To construct big data distributed platform based on Hadoop is a common method. Hadoop comes fron Google File System and is its open source realization. Here list the references for studying big data technology, especially on Hadoop. apache
基於Hadoop進行大數據分佈式平臺搭建是經常使用的方法,源於谷歌的GFS,爲其開源實現。此處總結了學習大數據技術相關參考資料,尤爲是Hadoop環境搭建時的參考文獻,供你們參考,在後續學習過程當中若發現更好的參考文獻,會不斷更新完善。分佈式
參考資料:ide
1.《大數據技術原理與應用—概念、存儲、處理、分析與應用》oop
(林子雨 編著,人民郵電出版社,2017年2月第2版)學習
2. Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015.4.大數據
(http://vdisk.weibo.com/s/u5ntMYF7_5pe)ui
3. https://www.tutorialspoint.com/hadoop/index.htmspa
(Introduced the basic knowledge about the basic concept of big data,and mainly focus on the environment setup of Hadoop in detail) orm
4. http://www.apache.org/htm
"The Apache Software Foundation is a cornerstone of the modern Open Source software ecosystem â supporting some of the most widely used and important software solutions powering today's Internet economy." â Mark Driver, Research Vice President, Gartner
From Apache Project List you can find most thing about big data technology,for example Hadoop,Spark,Mahout, ZooKeeper, Sqoop, Pig, Hive, Hbase , Flume and so on. You can download the file data what you want, and then learn to install the software based on the guide. This is the basic requirement to study big data technology.