check_mk for nagios 笔记
all WATO config files:/usr/local/check_mk/etc/conf.d/wato
/usr/local/check_mk/etc/multisite.d/wato
Global settings tuning
http://s3.运维网.com/wyfs02/M00/12/5B/wKioL1MEXxrg8CbjAATyhnUoFNE735.jpg
http://s3.运维网.com/wyfs02/M02/12/76/wKioL1MIL3OCMpWPAAjZOAndAxs256.jpg
add host/service/contact to hostgroups/servicegroups/contactgroups
1. create hostgroups/servicegroups/contactgroups
http://s3.运维网.com/wyfs02/M02/12/43/wKioL1MAvrqzj67iAAeEmVMj8DI713.jpg
2. add host/service/contact to hostgroups/servicegroups/contactgroups
http://s3.运维网.com/wyfs02/M01/12/43/wKiom1MAvyOyUt12AAexgyTRbYk843.jpg
http://s3.运维网.com/wyfs02/M00/12/43/wKioL1MAvv7wal0kAAYPxo1Jf7Y679.jpg
3. for add contact to contactgroups
http://s3.运维网.com/wyfs02/M00/12/43/wKioL1MAv9GyrCeEAAWrvy1eE8k877.jpg
4. apply changes
http://s3.运维网.com/wyfs02/M01/12/43/wKiom1MAv3nChey5AAXvdQoh38g629.jpg
SNMP Checks config
1. on Cisco router/switch to set snmp community:
snmp-server community public ro
2. on Check_MK to check snmp manually:
snmpwalk -v 2c -c public cisco_router_ip
3. on Check_mk server WATO
http://s3.运维网.com/wyfs02/M02/12/63/wKiom1MFjiKTiBF5AAIfYPpucPw702.jpg
http://s3.运维网.com/wyfs02/M01/12/64/wKioL1MFjinCMt37AAVCOZsqWlA976.jpg
Notes: default community is "public", you can config individually in WATO
/usr/local/check_mk/bin/cmk -D cisco3745 to check it's config or in WATO
Add just ping check for devices
1. add hosts in WATO
http://s3.运维网.com/wyfs02/M02/12/63/wKioL1MFgG_wIcSLAAIWcOkwSU8400.jpg
http://s3.运维网.com/wyfs02/M01/12/62/wKiom1MFgNjQZWD8AALXXxOsqq0819.jpg
5. check it in WATO
http://s3.运维网.com/wyfs02/M02/12/5D/wKiom1MEdwuQxOanAAYIU71DFCA370.jpg
Host Tags
1. create host tags in WATO
http://s3.运维网.com/wyfs02/M01/12/63/wKioL1MFgYCSJFovAAIgU864QhM752.jpg
http://s3.运维网.com/wyfs02/M00/12/62/wKiom1MFgabxzjBgAASBMqSKRng264.jpg
http://s3.运维网.com/wyfs02/M02/12/62/wKiom1MFgaax8tW8AAUgmzyDgbA021.jpg
Monitoring of Logfiles with Logwatch
1. make sure to install logwatch rpm
rpm -ivh check_mk-agent-1.2.4-1.noarch.rpm
rpm -ivh check_mk-agent-logwatch-1.2.4-1.noarch.rpm
2. vi /etc/check_mk/logwatch.cfg
/var/log/messages
I OK # lines containing OK are ignored, priority from top to bottom
C Critical # lines containing Critical
W Warn # lines containing Warn
W Error # lines containing Error
W Fail # lines containing Fail
/var/log/maillog
I OK # lines containing OK are ignored, priority from top to bottom
C Critical # lines containing Critical
W Warn # lines containing Warn
W Error # lines containing Error
W Fail # lines containing Fail
/usr/local/nagios/var/nagios.log
I OK # lines containing OK are ignored, priority from top to bottom
C Critical # lines containing Critical
W Warn # lines containing Warn
W Error # lines containing Error
W Fail # lines containing Fail
Notes: it's case sensitive for messages, so you should define it carefully
3. You can also use WATO
/usr/local/check_mk/bin/cmk -I localhost
vi /usr/local/check_mk/var/autochecks/localhost.mk to remove unwanted item as you needed
/usr/local/check_mk/bin/cmk -R
4. you can test it with logger command
logger "this is a Error message"
http://s3.运维网.com/wyfs02/M01/12/6D/wKiom1MG0v-ToA9kAARKcdcI-2U919.jpg
Monitoring MySQL
/usr/local/check_mk/bin/cmk -d localhost | head
PluginsDirectory: /usr/lib/check_mk_agent/plugins
LocalDirectory: /usr/lib/check_mk_agent/local
SpoolDirectory: /etc/check_mk/spool
AgentDirectory: /etc/check_mk
1. cp -a /usr/local/check_mk/share/agents/plugins/mk_mysql /usr/lib/check_mk_agent/plugins/
chmod 755 /usr/lib/check_mk_agent/plugins/*
2. You can also use WATO
/usr/local/check_mk/bin/cmk -I localhost
vi /usr/local/check_mk/var/autochecks/localhost.mk to remove unwanted item as you needed
/usr/local/check_mk/bin/cmk -R
http://s3.运维网.com/wyfs02/M01/12/77/wKiom1MIQ4agE_JWAAWjpx0XtJ0008.jpg
Monitoring any linux job
1. please refer http://hj192837.blog.运维网.com/655995/1352370 for testing script
2. vi /etc/cron.d/check-cisco
MAILTO=
0 10,17 * * * root mk-job ping-cisco-job /tmp/pingtest.sh
service crond restart
3. after the first run of the job the only thing to do is to run a service inventory on the host and restart your monitoring process.
You can also use WATO
/usr/local/check_mk/bin/cmk -I localhost
vi /usr/local/check_mk/var/autochecks/localhost.mk to remove unwanted item as you needed
/usr/local/check_mk/bin/cmk -R
http://s3.运维网.com/wyfs02/M02/12/78/wKiom1MIfkWySkHqAARYzKh6Npk345.jpg
Setup Distributed Monitoring
1. on distributed monitored check_mk server
cp -a /usr/local/check_mk/bin/unixcat /usr/bin
vi /etc/xinetd.d/livestatus
service livestatus
{
type = UNLISTED
port = 6557
socket_type = stream
protocol = tcp
wait = no
# limit to 100 connections per second. Disable 3 secs if above.
cps = 100 3
# set the number of maximum allowed parallel instances of unixcat.
# Please make sure that this values is at least as high as
# the number of threads defined with num_client_threads in
# etc/mk-livestatus/nagios.cfg
instances = 500
# limit the maximum number of simultaneous connections from
# one source IP address
per_source = 250
# Disable TCP delay, makes connection more responsive
flags = NODELAY
user = nagios
server = /usr/bin/unixcat
server_args = /usr/local/nagios/var/rw/live
# configure the IP address(es) of your Nagios server here:
# only_from = 127.0.0.1 10.0.20.1 10.0.20.2
disable = no
}
2. service xinetd restart
3. on distributed monitoring check_mk server
http://s3.运维网.com/wyfs02/M00/12/9A/wKiom1MLB4vwe8QfAASZ0HssnSw586.jpg
http://s3.运维网.com/wyfs02/M01/12/AF/wKioL1MMGiXhs6kWAAPDrK0S4E0317.jpg
4. now it's ok, only works for Perf-O-Meter column for PNP graph in current version
http://s3.运维网.com/wyfs02/M01/12/AF/wKioL1MMGnPSuoJfAAKFVYCSM1k226.jpg
5. using mod_proxy to get PNP graph -- undergoing
Setup Legacy Checks (no agent or SNMP on client)
1. on Nagios check_mk server
vi /etc/hosts
192.168.1.11 linuxtestclient
vi /usr/local/check_mk/etc/main.mk
all_hosts = [ 'localhost','linuxtestclient' ]
#Legacy Checks
extra_nagios_conf += r"""
# You can get command definition from nagios command.cfg
define command {
command_name check_ssh
command_line $USER1$/check_ssh $ARG1$ $HOSTADDRESS$
}
"""
legacy_checks = [
# command_name!argument, service description, pnp enable, hostname
( ( "check_ssh", "SSH Status", True), [ "linuxtestclient" ] ),
]
2. /usr/local/check_mk/bin/cmk -R
http://s3.运维网.com/wyfs02/M00/16/5B/wKiom1MRW5mgGNblAACZWzW6zHw725.jpg
Setup MRPE on client with agent
1. on client
vi /etc/check_mk/mrpe.cfg
#service description, command line for calling the plugin
LOAD /usr/local/nagios/libexec/check_load -w 5.0,4.0,3.0 -c 10.0,6.0,4.0
2. on Nagios check_mk server
vi /etc/hosts
192.168.1.11 linuxtestclient
vi /usr/local/check_mk/etc/main.mk
all_hosts = [ 'localhost','linuxtestclient' ]
3. /usr/local/check_mk/bin/cmk -I --checks=mrpe linuxtestclient
/usr/local/check_mk/bin/cmk -R
http://s3.运维网.com/wyfs02/M00/16/66/wKiom1MRYdaheQWxAADIfwLfohM724.jpg
使用check_mk MRPE监控percona xtradb cluster
on pxc node:
rpm -ivh percona-nagios-plugins-1.1.4-1.noarch.rpm
yum -y install xinetd
rpm -ivh check_mk-agent-1.2.4p5-1.noarch.rpm check_mk-agent-logwatch-1.2.4p5-1.noarch.rpm
mkdir /etc/nagios
vi /etc/nagios/mysql.cnf
user = root
password = s3cret
vi /etc/check_mk/mrpe.cfg
#service description, command line for calling the plugin
Threads_running /usr/lib64/nagios/plugins/pmp-check-mysql-status -x Threads_running -w 20 -c 40
页:
[1]