對其中的Master節點配置虛擬主機,能夠經過Chrome瀏覽器訪問目錄。
安裝虛擬主機以前,先安裝Apache2php
sudo apt-get install apache2
再安裝php5html
sudo apt-get install php5
而後,進入 /etc/apache2/sites-available文件夾,添加」*.conf」文件
往該文件裏寫入node
<VirtualHost *:80> ServerName author.xxx.com ServerAdmin author.xxx.com DocumentRoot "/home/author" <Directory "/home/author"> Options Indexes AllowOverride all Order allow,deny IndexOptions Charset=UTF-8 Allow from all Require all granted </Directory> <ifModule dir_module> DirectoryIndex index.html </ifModule> ErrorLog ${APACHE_LOG_DIR}/authors_errors.log CustomLog ${APACHE_LOG_DIR}/authors_access.log combined </VirtualHost>
這樣的結果是,當Url中訪問author.xxx.com時,是有文件夾的樹狀列表顯示的。若是想關掉樹狀列表顯示(爲了安全),能夠將shell
Options Indexes IndexOptions Charset=UTF-8
改爲apache
Options FollowSymLinks
這邊瀏覽器
paul_errors.log
paul_access.log
都位於 /usr/log/apache2中,能夠查看apache的日誌,用root權限。安全
配置文件完成以後,則設置的配置文件運行如下命令:服務器
sudo a2ensite xxx.conf
sudo /etc/init.d/apache2 restart
前面基本一致,除了從新啓動配置文件不一樣:網絡
sudo apachectl -v //查看apache版本 sudo apachectl -t //查看虛擬文件配置是否語法正確 sudo apachectl -k restart //從新啓動Apache
該搭建集羣具體參數參考本主上一篇文章「機房4臺服務器集羣網絡配置"app
在Master上執行下列查看語句以後,出現以下錯誤
hdfs dfsadmin -report
Configured Capacity: 0 (0 B) Present Capacity: 0 (0 B) DFS Remaining: 0 (0 B) DFS Used: 0 (0 B) DFS Used%: NaN% Under replicated blocks: 0 Blocks with corrupt replicas: 0 Missing blocks: 0 Missing blocks (with replication factor 1):
全部值都爲0,且得不到其餘slave1,slave2,slave3的反饋消息。
解決方法:
mkdir /home/hadoop/usr/hadoop/conf
新建配置文件夾
文件夾下放入如下配置文件
core-site.xml
<?xml version="1.0" encoding="UTF-8"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration> <property> <name>hadoop.tmp.dir</name> <value>/usr/hadoop/tmp</value> <description>A base for other temporary directories.</description> </property> <!--file system properties--> <property> <name>fs.default.name</name> <value>hdfs://192.168.223.1:9000</value> </property> </configuration>
hdfs-site.xml
<?xml version="1.0" encoding="UTF-8"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration> <property> <name>dfs.replication</name> <value>1</value> </property> </configuration>
mapred-site.xml(老版本下job,task配置)
<?xml version="1.0"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration> <property> <name>mapred.job.tracker</name> <value>http://192.168.223.1:9001</value> </property> </configuration>
mapred-site.xml(使用hadoop2.2以後的配置)
<configuration> <property> <name>mapreduce.framework.name</name> <value>yarn</value> </property> </configuration>
yarn-site.xml(Master下的配置文件)
<?xml version="1.0" encoding="UTF-8"?> <configuration> <property> <name>yarn.resourcemanager.hostname</name> <value>resourcemanager.company.com</value> </property> <property> <description>Classpath for typical applications.</description> <name>yarn.application.classpath</name> <value> $HADOOP_CONF_DIR, $HADOOP_COMMON_HOME/*,$HADOOP_COMMON_HOME/lib*/, $HADOOP_HDFS_HOME/*,$HADOOP_HDFS_HOME/lib/*, $HADOOP_MAPRED_HOME/*,$HADOOP_MAPRED_HOME/lib/*, $HADOOP_YARN_HOME/*,$HADOOP_YARN_HOME/lib/* </value>
</property>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.local-dirs</name>
<value>file:///data/1/yarn/local,file:///data/2/yarn/local,file:///data/3/yarn/local</value>
</property>
<property>
<name>yarn.nodemanager.log-dirs</name>
<value>file:///data/1/yarn/logs,file:///data/2/yarn/logs,file:///data/3/yarn/logs</value>
</property>
<property>
<name>yarn.log.aggregation-enable</name>
<value>true</value>
</property>
<property>
<description>Where to aggregate logs</description>
<name>yarn.nodemanager.remote-app-log-dir</name>
<value>hdfs://<namenode-host.company.com>:8020/var/log/hadoop-yarn/apps</value>
</property><!-- Site specific YARN configuration properties --></configuration>
爲了配合yarn-site.xml中的配置,須要配置
$ sudo mkdir -p /data/1/yarn/local /data/2/yarn/local /data/3/yarn/local /data/4/yarn/local
$ sudo mkdir -p /data/1/yarn/logs /data/2/yarn/logs /data/3/yarn/logs /data/4/yarn/logs
將 yarn.nodemanager.local-dirs 目錄的全部者配置爲 hadoop 用戶:
$ sudo chown -R hadoop:hadoop /data/1/yarn/local /data/2/yarn/local /data/3/yarn/local /data/4/yarn/local
將 yarn.nodemanager.log-dirs 目錄的全部者配置爲 hadoop 用戶:
$ sudo chown -R hadoop:hadoop /data/1/yarn/logs /data/2/yarn/logs /data/3/yarn/logs /data/4/yarn/logs
yarn-site.xml在slave中的配置,用於與master節點通訊,因此IP與端口號都是master節點的:
<?xml version="1.0"?> <configuration> <property> <name> yarn.nodemanager.aux-services </name> <value> mapreduce_shuffle </value> </property> <property> <name> yarn.nodemanager.auxservices.mapreduce.shuffle.class </name> <value> org.apache.hadoop.mapred.ShuffleHandler </value> </property> <property> <name> yarn.resourcemanager.address </name> <value> 192.168.223.1:8032 </value> </property> <property> <name> yarn.resourcemanager.scheduler.address </name> <value> 192.168.223.1:8030 </value> </property> <property> <name> yarn.resourcemanager.resource-tracker.address </name> <value> 192.168.223.1:8031 </value> </property> <property> <name> yarn.resourcemanager.hostname </name> <value> 192.168.223.1 </value> </property> <!-- Site specific YARN configuration properties --> </configuration>
master
192.168.223.1
slaves(在master節點上的配置文件相應ip地方換上如下相應的ip)
192.168.223.2 192.168.223.3 192.168.223.4
slaves(在slave節點上的配置文件)
localhost
啓動方法以下:
hadoop@master:/usr/hadoop$hadoop namenode -format hadoop@master:/usr/hadoop$sbin/start-all.sh(若是已經啓動,則先運行sbin/stop-all.sh)
查看方法(執行如下命令)
hadoop@master:/usr/hadoop$hdfs dfsadmin -report
獲得以下結果,則表示安裝正確
Configured Capacity: 4958160830464 (4.51 TB) Present Capacity: 4699621490688 (4.27 TB) DFS Remaining: 4699621404672 (4.27 TB) DFS Used: 86016 (84 KB) DFS Used%: 0.00% Under replicated blocks: 0 Blocks with corrupt replicas: 0 Missing blocks: 0 Missing blocks (with replication factor 1): 0 ------------------------------------------------- Live datanodes (3): Name: 192.168.223.3:50010 (slave3) Hostname: slave3 Decommission Status : Normal Configured Capacity: 1697554399232 (1.54 TB) DFS Used: 28672 (28 KB) Non DFS Used: 88462258176 (82.39 GB) DFS Remaining: 1609092112384 (1.46 TB) DFS Used%: 0.00% DFS Remaining%: 94.79% Configured Cache Capacity: 0 (0 B) Cache Used: 0 (0 B) Cache Remaining: 0 (0 B) Cache Used%: 100.00% Cache Remaining%: 0.00% Xceivers: 1 Last contact: Sat Nov 14 21:40:02 CST 2015 Name: 192.168.223.2:50010 (slave2) Hostname: slave2 Decommission Status : Normal Configured Capacity: 1697938153472 (1.54 TB) DFS Used: 28672 (28 KB) Non DFS Used: 88474435584 (82.40 GB) DFS Remaining: 1609463689216 (1.46 TB) DFS Used%: 0.00% DFS Remaining%: 94.79% Configured Cache Capacity: 0 (0 B) Cache Used: 0 (0 B) Cache Remaining: 0 (0 B) Cache Used%: 100.00% Cache Remaining%: 0.00% Xceivers: 1 Last contact: Sat Nov 14 21:40:02 CST 2015 Name: 192.168.223.4:50010 (slave4) Hostname: slave4 Decommission Status : Normal Configured Capacity: 1562668277760 (1.42 TB) DFS Used: 28672 (28 KB) Non DFS Used: 81602646016 (76.00 GB) DFS Remaining: 1481065603072 (1.35 TB) DFS Used%: 0.00% DFS Remaining%: 94.78% Configured Cache Capacity: 0 (0 B) Cache Used: 0 (0 B) Cache Remaining: 0 (0 B) Cache Used%: 100.00% Cache Remaining%: 0.00% Xceivers: 1 Last contact: Sat Nov 14 21:40:02 CST 2015
建立HDFS文件系統的命令
hadoop fs -mkdir -p /user/[current login user]
建立完HDFS文件系統用戶以後,你就能夠訪問HDFS文件系統,具體對HDFS分佈式文件系統的命令請參考如下網址
網頁訪問hadoop當前性能
http://10.1.8.200:50070/(這邊的ip爲外網訪問master節點的ip,讀者本身設置本身的ip)
以下圖所示:
具體安裝則參考網址