com.esotericsoftware.kryo.kryoexception java.util.ConcurentModificationException

時間 2019-11-11

標籤 com.esotericsoftware.kryo.kryoexception com esotericsoftware kryo kryoexception java.util.concurentmodificationexception java util concurentmodificationexception 欄目 Java 简体版

原文原文鏈接

最近有網友看個人「整合Kafka到Spark Streaming——代碼示例和挑戰」文章，
講 kafka對象放到 pool 並經過broadcast廣播出去：

而後在開發測試階段報錯以下：

html

而後就找我，說「代碼都跟你的差很少呀，爲何就報這個錯呢？」
其實對於廣播操做，spark 確定要序列號的，還有儘可能不要把大對象廣播出去，
後來把代碼要過來看了下，發現 createKafkaProducerPool這個方法，單首創建了一個類，同時這個類 extends Serializable ，我當時的感受就是，若是createKafkaProducerPool方法，寫在main方法 or Driver端應該就確定不會有這個問題，我也建議這樣搞的，還有我懷疑集羣是啓用了Kryo序列號方式，而createKafkaProducerPool方法所在類居然 extends Serializable ，不解java

important：

The closures (anon function going inside RDD.map(…)) are serialized by Spark before distributing them. Hadoop does not have this problem because it binary-serializes the whole .jar and copies it over the network. Spark uses JavaSerialization by default, but it is very slow compared to, say, Kryo. So we use Kryo to do that by using a wrapper (Spark doesn’t support kryo-serde for closures, not yet).git

And uptill now the org.dbpedia.extraction.spark.serializeKryoSerializationWrapper class has been working perfectly. Some freak extractors seem to fail though. github

根據這個錯誤檢索的文章

若是你們有遇到這樣問題或者什麼好想法，請回復，THX ~

com.esotericsoftware.kryo.kryoexception

Java