linux系統: CentOS6.7 Hadoop版本: 2.6.5 zookeeper版本: 3.4.8
192.168.179.201: m1 192.168.179.202: m2 192.168.179.203: m3 m1: Zookeeper, Namenode, DataNode, ResourceManager, NodeManager, Master, Worker m2: Zookeeper, Namenode, DataNode, ResourceManager, NodeManager, Worker m3: Zookeeper, DataNode, NodeManager, Worker
官方資料: https://cwiki.apache.org/confluence/display/Hive/Parquet
Hive擴展Parquet有如下兩種方案:(hive自0.13版本之後已自帶paruqet功能, 沒必要再擴展parquet功能)
經過yum下載parquet資源包 (此方案大多數狀況下沒法找到parquet, 緣由是沒有配置YUM源, 如何配置YUM源請自行查找資料)java
sudo yum -y install parquet
將下載到/usr/lib/parquet
目錄下的jar
(除javadoc.jar
和sources.jar
外)拷貝到$HIVE_HOME/lib目錄下node
cp /usr/lib/parquet/* $HIVE_HOME/lib
$HIVE_HOME/lib
目錄cd /home/centos/soft/hive/lib
執行如下指令, 下載parquet資源包(注意:parquet1.2.5
版本對應Hive 0.12
版本,每一個版本的Hive對應的Parquet版本不一樣, 具體Hive所對應的版本請自行下載源碼查看)linux
for f in parquet-avro parquet-cascading parquet-column parquet-common parquet-encoding parquet-generator parquet-hadoop parquet-hive parquet-pig parquet-pig-bundle parquet-scrooge parquet-test-hadoop2 parquet-thrift do curl -O https://oss.sonatype.org/service/local/repositories/releases/content/com/twitter/${f}/1.2.5/${f}-1.2.5.jar curl -O https://oss.sonatype.org/service/local/repositories/releases/content/com/twitter/parquet-format/1.2.5/parquet-format-1.2.5.jar done