大數據技術框架淺析

 IBMVolume()Velocity()Variety()Value()Veracity() & 
node

 

    OLTP(On-Line Transaction Processing:)OLAP(On-Line Analytical Processing:)spa

    OLTPblog

    OLAP(Data WareHouse)class

    OLAPHDFS(Hadoop Distributed File System:Hadoop)Googleim

    GFS(Google File System:Google)
img

    Page Rank()(BigTable)()
co

    BigTableGoogle
360

 

    Hadoop(Map Reduce)SparkImpalaStorm(Redis)data

 

 

    
ps

 

    (JBus)SqoopFlume

    Sqoop使SQLHadoopSqoop使JDBC

    FlumeFlumeSourceChannelSinkSourceChannelChannelSink使SinkChannelHDFSFlumeFlume使Source使Avro FlumeAvro()ThriftSyslogNetcat

 

    (Master/Slave)MasterZookeeper

    HDFSNameNodeDataNodesHDFS1HDFS 2.0128M

    YarnMapReduceMapperReducerMapperReducer2.0MapReduceIOSpark

    SparkMapReduceSpark CoreSpark SQLSpark StreamingSpark MLLib(ALS)Spark Graphx()

    HiveSQLMapReduce

    PigMapReduce  

    Storm()HDFSStorm

 

    ()

 

    

相關文章
相關標籤/搜索