Spark保存HDFS示例

時間 2019-11-06

標籤 spark 保存 hdfs 示例欄目 Spark 简体版

原文原文鏈接

def saveAsNewAPIHadoopFile(
path: String,
keyClass: Class[_],
valueClass: Class[_],
outputFormatClass: Class[_ <: NewOutputFormat[_, _]],
conf: Configuration = self.context.hadoopConfiguration): Unit = self.withScope {
// Rename this as hadoopConf internally to avoid shadowing (see SPARK-2038).
val hadoopConf = conf
val job = NewAPIHadoopJob.getInstance(hadoopConf)
job.setOutputKeyClass(keyClass)
job.setOutputValueClass(valueClass)
job.setOutputFormatClass(outputFormatClass)
val jobConfiguration = job.getConfiguration
jobConfiguration.set("mapreduce.output.fileoutputformat.outputdir", path)
saveAsNewAPIHadoopDataset(jobConfiguration)
}
oop

1. Spark整合HDFS、WordCount示例
2. hadoop: hdfs API示例
3. Spark GraphX示例
4. spark示例
5. Spark Streaming示例
6. Spark MLLib示例
7. Spark SQL示例
8. Spark從hdfs下讀取txt文件並保存到hdfs目錄下
9. 使用spark streaming使用snappy壓縮保存數據到HDFS中
10. 示例vuex commit保存數據技巧
更多相關文章...
• Thymeleaf+SpringMVC5示例 - Thymeleaf 教程
• Thymeleaf Servlet Hellow World示例 - Thymeleaf 教程
• 三篇文章瞭解 TiDB 技術內幕——說存儲
• SpringBoot中properties文件不能自動提示解決方法

相關標籤/搜索

hadoop+hdfs+yarn+spark