Spark Programming Guide(四)

Shuffle operations Certain operations within Spark trigger an event known as the shuffle. The shuffle is Spark’s mechanism for re-distributing data so that it’s grouped differently across partitions.
相關文章
相關標籤/搜索