擅长:python、mysql、java
<p>这可能会有帮助:
<a href="https://spark.apache.org/docs/latest/programming-guide.html#shuffle-operations" rel="nofollow">https://spark.apache.org/docs/latest/programming-guide.html#shuffle-operations</a></p>
<p>或者这个:
<a href="http://www.slideshare.net/SparkSummit/dev-ops-training" rel="nofollow">http://www.slideshare.net/SparkSummit/dev-ops-training</a>,从幻灯片208开始</p>
<p>从幻灯片209:
“使用诸如distinct之类的“numPartitions”的转换可能会洗牌”</p>