hadoop0.20.2伪分布式环境搭建
虽然现在hadoop版本已经到了主流2点多的时代,但是对于学习大数据而言,我还是选择从以前老的版本0.20.2学起。下面就是伪分布式的环境搭建过程。
hadoop下载地址:
http://archive.apache.org/dist/hadoop/core/hadoop-0.20.2/hadoop-0.20.2.tar.gz
linux系统版本:centos7
1、配置主机名
# vi /etc/sysconfig/network
# Created by anaconda
master1
# hostname master1
2、创建管理hadoop的组和用户
# groupadd hduser
# useradd -g hduser hduser
# passwd hduser
3、hosts主机名ip解析
# vi /etc/hosts
192.168.11.131master1
4、配置hadoop的sudoers权限
# vi /etc/sudoers
hduserALL=(ALL) NOPASSWD:ALL
5、关闭selinux和防火墙
# vi /etc/sysconfig/selinux
SELINUX=enforcing --> SELINUX=disabled
# systemctl stop firewalld
# systemctl disable firewalld
6、解压包
# su hduser
$ cd
$ ll *tar*
-rw-r--r--. 1 root root44575568 Jun 16 17:24 hadoop-0.20.2.tar.gz
-rw-r--r--. 1 root root 288430080 Mar 162016 jdk1.7.0_79.tar
$ tar xf jdk1.7.0_79.tar
$ tar zxf hadoop-0.20.2.tar.gz
$ mv jdk1.7.0_79 jdk
$ mv hadoop-0.20.2 hadoop
7、配置java环境
$ vi .bashrc
export JAVA_HOME=/home/hduser/jdk
export JRE_HOME=$JAVA_HOME/jre
export PATH=$PATH:$JAVA_HOME/bin
export> $ source .bashrc
$ java -version
java version "1.7.0_79"
Java(TM) SE Runtime Environment (build 1.7.0_79-b15)
Java HotSpot(TM) 64-Bit Server VM (build 24.79-b02, mixed mode)
8、配置hadoop
$ pwd
/home/hduser/hadoop/conf
$ vi hadoop-env.sh
export JAVA_HOME=/home/hduser/jdk
$ vi core-site.xml
fs.default.name
hdfs://master1:9000
$ sudo mkdir -p /data/hadoop/data
$ sudo chown -R hduser:hduser /data/hadoop/data
$ vi hdfs-site.xml
dfs.data.dir
/data/hadoop/data
dfs.replication
1
$ vi mapred-site.xml
mapred.job.tracker
master1:9001
9、做无密码认证
$ cd
$ ssh-keygen -t rsa
Generating public/private rsa key pair.
Enter file in which to save the key (/home/hduser/.ssh/id_rsa):
Created directory '/home/hduser/.ssh'.
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your> Your public key has been saved in /home/hduser/.ssh/id_rsa.pub.
The key fingerprint is:
SHA256:qRJhPSF32QDs9tU3e0/mAx/EBC2MHamGv2WPvUw19/M hduser@master1
The key's randomart image is:
+-------+
| ..+.o++o=|
| +.o. .. = o |
| o.o ... +|
| . .o. o.o. oo |
| .. .S.o..+o|
| . ... +..O|
| . . + *B+|
| . . .o==|
| oE|
+---------+
一路enter键
$ cd .ssh
$ ls
id_rsa >
$ cp> 10、格式化文件系统
$ cd
$ cdhadoop/bin
$ ./hadoop namenode -format
18/06/19 04:02:12 INFO namenode.NameNode: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting NameNode
STARTUP_MSG: host = master1/192.168.11.131
STARTUP_MSG: args = [-format]
STARTUP_MSG: version = 0.20.2
STARTUP_MSG: build = https://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.20 -r 911707; compiled by 'chrisdo' on Fri Feb 19 08:07:34 UTC 2010
************************************************************/
18/06/19 04:02:13 INFO namenode.FSNamesystem: fsOwner=hduser,hduser
18/06/19 04:02:13 INFO namenode.FSNamesystem: supergroup=supergroup
18/06/19 04:02:13 INFO namenode.FSNamesystem: isPermissionEnabled=true
18/06/19 04:02:13 INFO common.Storage: Image file of> 18/06/19 04:02:13 INFO common.Storage: Storage directory /tmp/hadoop-hduser/dfs/name has been successfully formatted.
18/06/19 04:02:13 INFO namenode.NameNode: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down NameNode at master1/192.168.11.131
************************************************************/
11、启动服务
$ ./start-all.sh
starting namenode, logging to /home/hduser/hadoop/bin/../logs/hadoop-hduser-namenode-master1.out
The authenticity of host 'localhost (::1)' can't be established.
ECDSA key fingerprint is SHA256:OXYl4X6F6g4TV7YriZaSvuBIFM840h/qTg8/B7BUil0.
ECDSA key fingerprint is MD5:b6:b6:04:2d:49:70:8b:ed:65:00:e2:05:b0:95:5b:6d.
Are you sure you want to continue connecting (yes/no)? yes
localhost: Warning: Permanently added 'localhost' (ECDSA) to the list of known hosts.
localhost: starting datanode, logging to /home/hduser/hadoop/bin/../logs/hadoop-hduser-datanode-master1.out
localhost: starting secondarynamenode, logging to /home/hduser/hadoop/bin/../logs/hadoop-hduser-secondarynamenode-master1.out
starting jobtracker, logging to /home/hduser/hadoop/bin/../logs/hadoop-hduser-jobtracker-master1.out
localhost: starting tasktracker, logging to /home/hduser/hadoop/bin/../logs/hadoop-hduser-tasktracker-master1.out
12、查看服务
$ jps
1867 JobTracker
1804 SecondaryNameNode
1597 NameNode
1971 TaskTracker
2011 Jps
1710 DataNode
$
13、浏览器查看服务状态
使用web查看HSFS运行状态
在浏览器输入
http://192.168.11.131:50030 使用web查看MapReduce运行状态
在浏览器输入
http://192.168.11.131:50070
页:
[1]