111 发表于 2018-10-30 13:11:49

Hadoop-2.5.1集群安装配置笔记

Hadoop-2.5.1集群安装配置笔记
1.环境
1.1.虚拟机
  准备3台虚拟机,安装Centos 64-bit操作系统,采用最小安装。
  (本来想多跑几台虚拟机,但本人笔记本电脑内存有限,最多只能同时跑3个虚拟机)
  虚拟机一律配置静态IP地址,配置域名解析,各虚拟机时间同步。
  192.168.17.100 nameNode
  192.168.17.101 dataNode1
  192.168.17.102 dataNode2
2.安装
2.1.安装前
2.1.1.安装wget和ssh
  用于下载和ssh登录
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  yum -y install wget
[*]
[*]  yum -y install openssh*
2.1.2.安装JDK、配置环境变量
  略...
2.1.3.配置ssh公钥密钥自动登录
  在hadoop集群环境中,nameNode节点,需要能够ssh无密码登录访问dataNode节点
  进入SSH目录:
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  # cd .ssh
[*]  #
  生成公钥密钥对:
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  # ssh-keygen -t rsa
[*]  Generating public/private rsa key pair.
[*]  Enter file in which to save the key (/root/.ssh/id_rsa):
[*]  Enter passphrase (empty for no passphrase):
[*]  Enter same passphrase again:
[*]  Your identification has been saved in /root/.ssh/id_rsa.
[*]  Your public key has been saved in /root/.ssh/id_rsa.pub.
[*]  The key fingerprint is:
[*]  98:3c:31:5c:23:21:73:a0:a0:1f:c6:d3:c3:dc:58:32 root@gifer
[*]  The key's randomart image is:
[*]  +--[ RSA 2048]----+
[*]  |.   E.=.o      |
[*]  |.o = @ o .       |
[*]  |. * * =          |
[*]  | o o o =         |
[*]  |.   = S      |
[*]  |       .         |
[*]  |               |
[*]  |               |
[*]  |               |
[*]  +-----------------+
  看到图形输出,表示密钥生成成功,目录下多出两个文件
  私钥文件:id_raa
  公钥文件:id_rsa.pub
  将公钥文件id_rsa.pub内容放到authorized_keys文件中:

[*]  cat id_rsa.pub >> authorized_keys
  将公钥文件authorized_keys分发到各dataNode节点:

[*]  scp authorized_keysroot@dataNode:/root/.ssh/
  验证ssh无密码登录:

[*]  # ssh root@dataNode1
[*]  Last login: Sun Sep 21 11:38:05 2014 from 192.168.17.1
  看到以上输出,表示配置成功!如果还提示需要输出密码访问,表示配置失败!
2.2.开始安装
  下载最新版本hadoop-2.5.1

[*]  wget http://mirrors.hust.edu.cn/apache/hadoop/common/hadoop-2.5.1/hadoop-2.5.1.tar.gz
  解压

[*]  tar -zxf hadoop-2.5.1.tar.gz
2.3.配置文件
  进入配置文件目录:cd hadoop-2.5.1/etc/hadoop
2.3.1.core-site.xml
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  
[*]  
[*]  hadoop.tmp.dir
[*]  /home/hadoop/tmp
[*]  Abase for other temporary directories.
[*]  
[*]  
[*]  fs.defaultFS
[*]  hdfs://nameNode:9000
[*]  
[*]  
[*]  io.file.buffer.size
[*]  4096
[*]  
[*]  
2.3.2.hdfs-site.xml
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  
[*]  
[*]  dfs.nameservices
[*]  hadoop-cluster1
[*]  
[*]  
[*]  dfs.namenode.secondary.http-address
[*]  nameNode:50090
[*]  
[*]  
[*]  dfs.namenode.name.dir
[*]  file:///home/hadoop/dfs/name
[*]  
[*]  
[*]  dfs.datanode.data.dir
[*]  file:///home/hadoop/dfs/data
[*]  
[*]  
[*]  dfs.replication
[*]  2
[*]  
[*]  
[*]  dfs.webhdfs.enabled
[*]  true
[*]  
[*]  
2.3.3.mapred-site.xml

[*]  
[*]  
[*]  mapreduce.framework.name
[*]  yarn
[*]  
[*]  
[*]  mapreduce.jobtracker.http.address
[*]  nameNode:50030
[*]  
[*]  
[*]  mapreduce.jobhistory.address
[*]  nameNode:10020
[*]  
[*]  
[*]  mapreduce.jobhistory.webapp.address
[*]  nameNode:19888
[*]  
[*]  
2.3.4.yarn-site.xml
   view plaincopyhttps://code.csdn.net/assets/CODE_ico.pnghttps://code.csdn.net/assets/ico_fork.svg

[*]  
[*]
[*]  
[*]  
[*]  yarn.nodemanager.aux-services
[*]  mapreduce_shuffle
[*]  
[*]  
[*]  yarn.resourcemanager.address
[*]  nameNode:8032
[*]  
[*]  
[*]  yarn.resourcemanager.scheduler.address
[*]  nameNode:8030
[*]  
[*]  
[*]  yarn.resourcemanager.resource-tracker.address
[*]  nameNode:8031
[*]  
[*]  
[*]  yarn.resourcemanager.admin.address
[*]  nameNode:8033
[*]  
[*]  
[*]  yarn.resourcemanager.webapp.address
[*]  nameNode:8088
[*]  
[*]  
2.3.5.slaves

[*]  dataNode1
[*]  dataNode2
2.3.6.修改JAVA_HOME
  分别在文件hadoop-env.sh和yarn-env.sh中添加JAVA_HOME配置
  vi hadoop-env.sh

[*]  export JAVA_HOME=/usr/java/jdk1.7.0_65
  vi yarn-env.sh

[*]  export JAVA_HOME=/usr/java/jdk1.7.0_652.4.格式化文件系统
  格式化文件系统:

[*]  bin/hdfs namenode -format
  2.5.启动、停止服务
  现在可以启动服务了
2.5.1.启动

[*]  # ./start-dfs.sh

[*]  # ./start-yarn.sh
2.5.2.停止

[*]  # ./stop-dfs.sh

[*]  # ./stop-yarn.sh
3.验证
3.1.查看启动的进程

[*]  # jps

[*]  7854 Jps
[*]  7594 ResourceManager
[*]  7357 NameNode
3.2.通过浏览器访问
  http://192.168.17.100:50070/
http://img.blog.csdn.net/20140922213112359?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvZ3JlZW5zdXJmZXI=/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center
  http://192.168.17.100:8088/
http://img.blog.csdn.net/20140922213309023?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQvZ3JlZW5zdXJmZXI=/font/5a6L5L2T/fontsize/400/fill/I0JBQkFCMA==/dissolve/70/gravity/Center

页: [1]
查看完整版本: Hadoop-2.5.1集群安装配置笔记