關於轉載一些 Spark 官方的文檔以及 DataBricks 公司博文,本系列基本是中英雙語,主要是爲了提升本身的英語水平。html
A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets(中英雙語)July 14, 2016apache
Apache Spark as a Compiler: Joining a Billion Rows per Second on a Laptop(中英雙語)May 23, 2016post
Deep Dive into Spark SQL’s Catalyst Optimizer(中英雙語)April 13, 2015spa
What’s new for Spark SQL in Apache Spark 1.3(中英雙語)March 24, 2015htm
Introducing DataFrames in Apache Spark for Large Scale Data Science(中英雙語)February 17, 2015blog