安装nagios检测hadoop
Nagios是常用的系统监控工具,提供了很多基本服务的监控脚本,如HTTP,MYSQL等,同时具有不错的可扩展性,自己可定制针对特定参数的监控脚本以及报警的方式.我现在有三台机器:192.168.0.26 namenode2 主机
192.168.0.27 datanode1 从机
192.168.0.28 datanode2 从机
一.在机子上面安装nagios
1.先在主机上安装apache2 nagios3nagios-nrpe-plugin这三个,apahce2有的话就不用安装了
sudo apt-get install apache2 nagios3 nagios-nrpe-plugin
在装的过程中,会让你设置nagios报警邮件,选择【Internet Site】,使用SMTP协议发送邮件
要求输入邮件服务器的域名,由于不需要搭建邮件服务器,直接点击【OK】
设置nagios web登陆密码
登陆nagios,http://192.168.0.26/nagios3 账号/密码:nagiosadmin/123456
2.sudo apt-get install nagios3-doc可以在网页上看nagios的doc
3.在从机上面只需要安装nagios-nrpe-server
sudo apt-get install nagios-nrpe-server
4.启动nagios的命令:
sudo /etc/init.d/nagios3 start
5.cd /etc/nagios3
sudo nagios3 –v nagios.cfg
没有出现错误,一般在启动前,你想查看有没有错误信息,就可以用这个
二.配置检测其它机子
1.在/etc/nagios3/conf.d目录下创建要监控的主机文件hosts.cfg
sudo nano /etc/nagios3/conf.d/hosts.cfg,添加以下内容:
#add namenode2
define host{
use generic-host ; Name of host template to use
; This host definition will inherit all variables that are defined
; in (or inherited by) the linux-server host template definition.
host_name namenode2
alias namenode2
address 192.168.0.26
}
#add datanode1
define host{
use generic-host ; Name of host template to use
; This host definition will inherit all variables that are defined
; in (or inherited by) the linux-server host template definition.
host_name datanode1
alias datanode1
address 192.168.0.27
}
#add datanode2
define host{
use generic-host ; Name of host template to use
; This host definition will inherit all variables that are defined
; in (or inherited by) the linux-server host template definition.
host_name datanode2
alias datanode2
address 192.168.0.28
}
2.修改/etc/nagios3/conf.d目录下的hostgroups_nagios2.cfg文件,新增主机组,增加以下内容:
# Some generic hostgroup definitions
# A simple wildcard hostgroup
define hostgroup {
hostgroup_nameall
alias All Servers
members *
}
# A list of your Debian GNU/Linux servers
define hostgroup {
hostgroup_namedebian-servers
alias Debian GNU/Linux Servers
members namenode2,datanode1,datanode2
}
# A list of your web servers
define hostgroup {
hostgroup_namehttp-servers
alias HTTP servers
members namenode2,datanode1,datanode2
}
# A list of your ssh-accessible servers
define hostgroup {
hostgroup_namessh-servers
alias SSH servers
members namenode2,datanode1,datanode2
}
修改完这2个以后,重启nagios
sudo /etc/init.d/nagios3 restart
输入用http://192.168.0.26/nagios3输入nagiosadmin密码,就进入nagios监控画面。
页:
[1]