大數據概述 Hadoop配置
阿新 • • 發佈:2019-04-28
examples tracing interact req .cn ctu dom ins cli Top
NSD ARCHITECTURE DAY05
- 案例1:安裝Hadoop
- 案例2:安裝配置Hadoop
1 案例1:安裝Hadoop
1.1 問題
本案例要求安裝單機模式Hadoop:
- 單機模式安裝Hadoop
- 安裝JAVA環境
- 設置環境變量,啟動運行
1.2 步驟
實現此案例需要按照如下步驟進行。
步驟一:環境準備
1)配置主機名為nn01,ip為192.168.1.21,配置yum源(系統源)
備註:由於在之前的案例中這些都已經做過,這裏不再重復,不會的學員可以參考之前的案例
2)安裝java環境
- [[email protected] ~]# yum -y install java-1.8.0-openjdk-devel
- [[email protected] ~]# java -version
- openjdk version "1.8.0_131"
- OpenJDK Runtime Environment (build 1.8.0_131-b12)
- OpenJDK 64-Bit Server VM (build 25.131-b12, mixed mode)
- [[email protected] ~]# jps
- 1235 Jps
3)安裝hadoop
- [[email protected] ~]# tar -xf hadoop-2.7.6.tar.gz
- [[email protected] ~]# mv hadoop-2.7.6 /usr/local/hadoop
- [[email protected] ~]# cd /usr/local/hadoop/
- [[email protected] hadoop]# ls
- bin include libexec NOTICE.txt sbin
- etc lib LICENSE.txt README.txt share
- [[email protected] hadoop]# ./bin/hadoop //報錯,JAVA_HOME沒有找到
- Error: JAVA_HOME is not set and could not be found.
- [[email protected] hadoop]#
4)解決報錯問題
- [[email protected] hadoop]# rpm -ql java-1.8.0-openjdk
- [[email protected] hadoop]# cd ./etc/hadoop/
- [[email protected] hadoop]# vim hadoop-env.sh
- 25 export \
- JAVA_HOME="/usr/lib/jvm/java-1.8.0-openjdk-1.8.0.131-11.b12.el7.x86_64/jre"
- 33 export HADOOP_CONF_DIR="/usr/local/hadoop/etc/hadoop"
- [[email protected] ~]# cd /usr/local/hadoop/
- [[email protected] hadoop]# ./bin/hadoop
- Usage: hadoop [--config confdir] [COMMAND | CLASSNAME]
- CLASSNAME run the class named CLASSNAME
- or
- where COMMAND is one of:
- fs run a generic filesystem user client
- version print the version
- jar <jar> run a jar file
- note: please use "yarn jar" to launch
- YARN applications, not this command.
- checknative [-a|-h] check native hadoop and compression libraries availability
- distcp <srcurl> <desturl> copy file or directories recursively
- archive -archiveName NAME -p <parent path> <src>* <dest> create a hadoop archive
- classpath prints the class path needed to get the
- credential interact with credential providers
- Hadoop jar and the required libraries
- daemonlog get/set the log level for each daemon
- trace view and modify Hadoop tracing settings
- Most commands print help when invoked w/o parameters.
- [[email protected] hadoop]# mkdir /usr/local/hadoop/aa
- [[email protected] hadoop]# ls
- bin etc include lib libexec LICENSE.txt NOTICE.txt aa README.txt sbin share
- [[email protected] hadoop]# cp *.txt /usr/local/hadoop/aa
- [[email protected] hadoop]# ./bin/hadoop jar \
- share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.6.jar wordcount aa bb //wordcount為參數 統計aa這個文件夾,存到bb這個文件裏面(這個文件不能存在,要是存在會報錯,是為了防止數據覆蓋)
- [[email protected] hadoop]# cat bb/part-r-00000 //查看
2 案例2:安裝配置Hadoop
2.1 問題
本案例要求:
- 另備三臺虛擬機,安裝Hadoop
- 使所有節點能夠ping通,配置SSH信任關系
- 節點驗證
2.2 方案
準備四臺虛擬機,由於之前已經準備過一臺,所以只需再準備三臺新的虛擬機即可,安裝hadoop,使所有節點可以ping通,配置SSH信任關系,如圖-1所示:
圖-1
2.3 步驟
實現此案例需要按照如下步驟進行。
步驟一:環境準備
1)三臺機器配置主機名為node1、node2、node3,配置ip地址(ip如圖-1所示),yum源(系統源)
2)編輯/etc/hosts(四臺主機同樣操作,以nn01為例)
- [[email protected] ~]# vim /etc/hosts
- 192.168.1.21 nn01
- 192.168.1.22 node1
- 192.168.1.23 node2
- 192.168.1.24 node3
3)安裝java環境,在node1,node2,node3上面操作(以node1為例)
- [[email protected] ~]# yum -y install java-1.8.0-openjdk-devel
4)布置SSH信任關系
- [[email protected] ~]# vim /etc/ssh/ssh_config //第一次登陸不需要輸入yes
- Host *
- GSSAPIAuthentication yes
- StrictHostKeyChecking no
- [[email protected] .ssh]# ssh-keygen
- Generating public/private rsa key pair.
- Enter file in which to save the key (/root/.ssh/id_rsa):
- Enter passphrase (empty for no passphrase):
- Enter same passphrase again:
- Your identification has been saved in /root/.ssh/id_rsa.
- Your public key has been saved in /root/.ssh/id_rsa.pub.
- The key fingerprint is:
- SHA256:Ucl8OCezw92aArY5+zPtOrJ9ol1ojRE3EAZ1mgndYQM [email protected]
- The key‘s randomart image is:
- +---[RSA 2048]----+
- | o*E*=. |
- | +XB+. |
- | ..=Oo. |
- | o.+o... |
- | .S+.. o |
- | + .=o |
- | o+oo |
- | o+=.o |
- | o==O. |
- +----[SHA256]-----+
- [[email protected] .ssh]# for i in 21 22 23 24 ; do ssh-copy-id 192.168.1.$i; done
- //部署公鑰給nn01,node1,node2,node3
5)測試信任關系
- [[email protected] .ssh]# ssh node1
- Last login: Fri Sep 7 16:52:00 2018 from 192.168.1.21
- [[email protected] ~]# exit
- logout
- Connection to node1 closed.
- [[email protected] .ssh]# ssh node2
- Last login: Fri Sep 7 16:52:05 2018 from 192.168.1.21
- [[email protected] ~]# exit
- logout
- Connection to node2 closed.
- [[email protected] .ssh]# ssh node3
步驟二:配置hadoop
1)修改slaves文件
- [[email protected] ~]# cd /usr/local/hadoop/etc/hadoop
- [[email protected] hadoop]# vim slaves
- node1
- node2
- node3
2)hadoop的核心配置文件core-site
- [[email protected] hadoop]# vim core-site.xml
- <configuration>
- <property>
- <name>fs.defaultFS</name>
- <value>hdfs://nn01:9000</value>
- </property>
- <property>
- <name>hadoop.tmp.dir</name>
- <value>/var/hadoop</value>
- </property>
- </configuration>
- [[email protected] hadoop]# mkdir /var/hadoop //hadoop的數據根目錄
- [[email protected] hadoop]# ssh node1 mkdir /var/hadoop
- [[email protected] hadoop]# ssh node2 mkdir /var/hadoop
- [[email protected] hadoop]# ssh node3 mkdir /var/hadoop
3)配置hdfs-site文件
- [[email protected] hadoop]# vim hdfs-site.xml
- <configuration>
- <property>
- <name>dfs.namenode.http-address</name>
- <value>nn01:50070</value>
- </property>
- <property>
- <name>dfs.namenode.secondary.http-address</name>
- <value>nn01:50090</value>
- </property>
- <property>
- <name>dfs.replication</name>
- <value>2</value>
- </property>
- </configuration>
4)同步配置到node1,node2,node3
- [[email protected] hadoop]# yum –y install rsync //同步的主機都要安裝rsync
- [[email protected] hadoop]# for i in 22 23 24 ; do rsync -aSH --delete /usr/local/hadoop/
- \ 192.168.1.$i:/usr/local/hadoop/ -e ‘ssh‘ & done
- [1] 23260
- [2] 23261
- [3] 23262
5)查看是否同步成功
- [[email protected] hadoop]# ssh node1 ls /usr/local/hadoop/
- bin
- etc
- include
- lib
- libexec
- LICENSE.txt
- NOTICE.txt
- bb
- README.txt
- sbin
- share
- aa
- [[email protected] hadoop]# ssh node2 ls /usr/local/hadoop/
- bin
- etc
- include
- lib
- libexec
- LICENSE.txt
- NOTICE.txt
- bb
- README.txt
- sbin
- share
- aa
- [[email protected] hadoop]# ssh node3 ls /usr/local/hadoop/
- bin
- etc
- include
- lib
- libexec
- LICENSE.txt
- NOTICE.txt
- bb
- README.txt
- sbin
- share
- aa
步驟三:格式化
- [[email protected] hadoop]# cd /usr/local/hadoop/
- [[email protected] hadoop]# ./bin/hdfs namenode -format //格式化 namenode
- [[email protected] hadoop]# ./sbin/start-dfs.sh //啟動
- [[email protected] hadoop]# jps //驗證角色
- 23408 NameNode
- 23700 Jps
- 23591 SecondaryNameNode
- [[email protected] hadoop]# ./bin/hdfs dfsadmin -report //查看集群是否組建成功
- Live datanodes (3): //有三個角色成功
大數據概述 Hadoop配置