大數據學習 概念篇

1.理解概念sql

  Hadoop 數據庫

Hadoop是一個由Apache基金會所開發的分佈式系統基礎架構。
用戶可在不瞭解分佈式底層細節的狀況下,開發分佈式程序。充分利用集羣的威力進行高速運算和存儲

  spark架構

Spark是當前最流行的開源大數據內存計算框架,用Scala語言實現,由UC伯克利大學AMPLab實驗室開發並於2010年開源。
以通用、易用爲目標,高速發展後成爲最活躍的Apache開源項目。

  hive   app

Hive是基於Hadoop的一個數據倉庫工具。
能夠將結構化的數據文件映射爲一張數據庫表,並提供簡單的sql查詢功能,能夠將sql語句轉換爲MapReduce任務進行運行。

  Hbase框架

HBase – Hadoop Database,是一個高可靠性、高性能、面向列、可伸縮的分佈式存儲系統。
利用HBase技術可在廉價PC Server上搭建起大規模結構化存儲集羣。

  zookeeper分佈式

 官方介紹 工具

ZooKeeper is a centralized service for maintaining configuration information, naming, providing distributed synchronization, and providing group services. All of these kinds of services are used in some form or another by distributed applications. Each time they are implemented there is a lot of work that goes into fixing the bugs and race conditions that are inevitable. Because of the difficulty of implementing these kinds of services, applications initially usually skimp on them ,which make them brittle in the presence of change and difficult to manage. Even when done correctly, different implementations of these services lead to management complexity when the applications are deployed.oop

相關文章
相關標籤/搜索