what are the sampling methods in spark? Why not reservoir sampling? -
i know reservoir sampling can applied in parallel, spark seems use other sampling methods have no idea about. describe them briefly?
according @tristan answer, guess purpose of not using reservoir sampling keep balance of classes. go though source code , found noting labels.
i know existence of stratified sampling
Comments
Post a Comment