what are the sampling methods in spark? Why not reservoir sampling? -


i know reservoir sampling can applied in parallel, spark seems use other sampling methods have no idea about. describe them briefly?

according @tristan answer, guess purpose of not using reservoir sampling keep balance of classes. go though source code , found noting labels.

i know existence of stratified sampling


Comments

Popular posts from this blog

scala - 'wrong top statement declaration' when using slick in IntelliJ -

c# - DevExpress.Wpf.Grid.InfiniteGridSizeException was unhandled -

PySide and Qt Properties: Connecting signals from Python to QML -