搭建mcollective高可用,使puppet架构更加安全、稳定
存在这样一种场景,当你的puppet基于mcollective环境搭建完成之后,需要考虑MQ的高可用,否则,MQ挂掉之后就不能用mco命令进行推送了哦。 如何做MQ的高可用呢,其实有两种方法: 方法一:两台MQ做集群,通过复制队列信息进行同步,节点访问可通过浮动IP进行。 方法二:两台MQ独立,在MC Server端做failover,通过rabbtimq的plugins参数实现,可设置自动检测,切换时间等等。一、配置Rabbitmq
安装(略),可参http://rsyslog.org/2013/11/10/mcollective-middleware/
1. 开启插件rabbitmq_stomp
# rabbitmq-plugins enable rabbitmq_stomp
The following plugins have been enabled:
rabbitmq_stomp
Plugin configuration has changed. Restart RabbitMQ for changes to take effect.
2. 添加tcp监听端口和范围
# vim /etc/rabbitmq/rabbitmq.config
[
{rabbitmq_stomp, [{tcp_listeners, }]}
].
备注:可参考 http://www.rabbitmq.com/stomp.html
3. 创建账户并设置权限
如果你以前配置过,建议将配置清空
# rabbitmqctl stop_app
Stopping node rabbit@linuxmaster1poc ...
...done.
# rabbitmqctl reset
Resetting node rabbit@linuxmaster1poc ...
...done.
# rabbitmqctl start_app
Starting node rabbit@linuxmaster1poc ...
...done.
删除默认用户guest,添加三个用户(web_admin-http访问用,admin--管理员,mc_rabbitmq--mcollective链接用)
# rabbitmqctl list_users
Listing users ...
guest
...done.
# rabbitmqctl delete_user guest
Deleting user "guest" ...
...done.
# rabbitmqctl add_user mc_rabbitmq 123.com
Creating user "mc_rabbitmq" ...
...done.
# rabbitmqctl add_user admin password=123.com
Creating user "admin" ...
...done.
# rabbitmqctl add_user web_admin 123.com
Creating user "web_admin" ...
...done.
设置用户的角色
# rabbitmqctl set_user_tags admin administrator
Setting tags for user "admin" to ...
...done.
# rabbitmqctl set_user_tags web_admin monitoring
Setting tags for user "web_admin" to ...
...done.
创建虚拟主机组
# rabbitmqctl add_vhost /mcollective
Creating vhost "/mcollective" ...
...done.
设置用户访问虚拟主机组的权限
# rabbitmqctl set_permissions -p "/mcollective" mc_rabbitmq".*" ".*" ".*"
Setting permissions for user "mc_rabbitmq" in vhost "/mcollective" ...
...done.
# rabbitmqctl set_permissions -p "/mcollective" admin".*" ".*" ".*"
Setting permissions for user "admin" in vhost "/mcollective" ...
...done.
# rabbitmqctl set_permissions -p "/mcollective" web_admin".*" ".*" ".*"
Setting permissions for user "web_admin" in vhost "/mcollective" ...
...done.
重启rabbitmq-server服务
# /etc/init.d/rabbitmq-server restart
Restarting rabbitmq-server: SUCCESS
rabbitmq-server.
查看用户以及角色是否创建成功
# rabbitmqctl list_users
Listing users ...
admin
mc_rabbitmq []
web_admin
...done.
查看虚拟主机组“/mcollective”中所有用户的权限
# rabbitmqctl list_permissions -p "/mcollective"
Listing permissions in vhost "/mcollective" ...
admin .* .* .*
mc_rabbitmq .* .* .*
web_admin .* .* .*
...done.
#
4、登录http://192.168.100.120:15672/设置虚拟主机“/mcollective”的exchanges
默认配置
# rabbitmqctl list_exchanges -p "/mcollective"
Listing exchanges ...
direct
amq.direct direct
amq.fanout fanout
amq.headers headers
amq.match headers
amq.rabbitmq.trace topic
amq.topic topic
...done.
exchanges设置设置后更新配置# rabbitmqctl list_exchanges -p "/mcollective"Listing exchanges ... directamq.direct directamq.fanout fanoutamq.headers headersamq.match headersamq.rabbitmq.trace topicamq.topic topicmcollective_broadcast topicmcollective_directed direct...done.备注:可参考官网设置 https://www.rabbitmq.com/man/rabbitmqctl.1.man.html二、 配置MCollective:1. 配置mcollective client端# cat /etc/mcollective/client.cfg topicprefix = /topic/main_collective = mcollectivecollectives = mcollectivelibdir = /usr/libexec/mcollectivelogger_type = console#loglevel = debugloglevel = warn# Pluginssecurityprovider = pskplugin.psk = a36cd839414370e10fd281b8a38a4f48direct_addressing = 1connector = rabbitmqplugin.rabbitmq.vhost = /mcollective#虚拟主机plugin.rabbitmq.pool.size = 2 #设置地址池里有两个mqplugin.rabbitmq.initial_reconnect_delay = 0.01plugin.rabbitmq.max_reconnect_delay = 30.0#重连时间plugin.rabbitmq.use_exponential_back_off = trueplugin.rabbitmq.back_off_multiplier = 2plugin.rabbitmq.max_reconnect_attempts = 0plugin.rabbitmq.randomize = falseplugin.rabbitmq.timeout = -1plugin.rabbitmq.pool.1.host = 192.168.100.120plugin.rabbitmq.pool.1.port = 61613plugin.rabbitmq.pool.1.user = mc_rabbitmqplugin.rabbitmq.pool.1.password = 123.complugin.rabbitmq.pool.1.ssl = false plugin.rabbitmq.pool.2.host = 192.168.100.121plugin.rabbitmq.pool.2.port = 61613plugin.rabbitmq.pool.2.user = mc_rabbitmqplugin.rabbitmq.pool.2.password = 123.complugin.rabbitmq.pool.2.ssl = false # Factsfactsource = yamlplugin.yaml = /etc/mcollective/facts.yaml2. 配置mcollective server端# cat /etc/mcollective/server.cfg# --Global--topicprefix = /topic/main_collective = mcollectivecollectives = mcollectivelibdir = /usr/libexec/mcollectivelogfile = /var/log/puppet/mcollective.logloglevel = infodaemonize = 1# --rabbitmq Plugins--securityprovider = pskplugin.psk = a36cd839414370e10fd281b8a38a4f48direct_addressing = 1connector = rabbitmqplugin.rabbitmq.vhost = /mcollective plugin.rabbitmq.pool.size = 2plugin.rabbitmq.initial_reconnect_delay = 0.01plugin.rabbitmq.max_reconnect_delay = 30.0plugin.rabbitmq.use_exponential_back_off = trueplugin.rabbitmq.back_off_multiplier = 2plugin.rabbitmq.max_reconnect_attempts = 0plugin.rabbitmq.randomize = falseplugin.rabbitmq.timeout = -1plugin.rabbitmq.pool.1.host = 192.168.100.120plugin.rabbitmq.pool.1.port = 61613plugin.rabbitmq.pool.1.user = mc_rabbitmqplugin.rabbitmq.pool.1.password = 123.complugin.rabbitmq.pool.1.ssl = false plugin.rabbitmq.pool.2.host = 192.168.100.121plugin.rabbitmq.pool.2.port = 61613plugin.rabbitmq.pool.2.user = mc_rabbitmqplugin.rabbitmq.pool.2.password = 123.complugin.rabbitmq.pool.2.ssl = false # --Puppet provider specific options--plugin.service.provider = puppetplugin.service.puppet.hasstatus = trueplugin.service.puppet.hasrestart = trueplugin.puppet.command = puppet agentplugin.puppet.splay = trueplugin.puppet.splaylimit = 30plugin.puppet.config = /etc/puppet/puppet.conf# --Facts--factsource = yaml##factsource = facterplugin.yaml = /etc/mcollective/facts.yaml三、高可用测试特别注意: 节点mcollective的server.cfg中pool是有优先级的,默认数字小的生效,这点需要注意,也就是说当所有节点都连接在MQ2上的时候,启动MQ1,mco命令是无法使用的,因为它在运行的时候连接的是MQ1,而所有节点都连接在MQ2上。1. 停止MQ1,查看切换状态1.1 先看当前的节点连接状态# mco ping #查看连接的节点linux57poc time=69.46 mslinux58poc time=70.05 mslinux64poc time=70.59 ms---- ping statistics ----3 replies max: 70.59 min: 69.46 avg: 70.03 # mco shell "lsof -i:61613" #查看所有节点监听的端口情况,可以看到目前都连接在linuxmaster1poc上。Do you really want to send this command unfiltered? (y/n): yDiscovering hosts using the mc method for 2 second(s) .... 3Host: linux64pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 36625 root 6uIPv427771 0t0TCP linux64poc:40493->linuxmaster1poc:61613 (ESTABLISHED)Host: linux58pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 11060 root 6uIPv434046 0t0TCP linux58poc:36295->linuxmaster1poc:61613 (ESTABLISHED)Host: linux57pocStatuscode: 0Output:COMMAND PID USER FD TYPEDEVICE SIZE NODE NAMEruby 18076 root 6uIPv4 1351365 TCP linux57poc:24698->linuxmaster1poc:61613 (ESTABLISHED)
# /etc/init.d/rabbitmq-server stopStopping rabbitmq-server: rabbitmq-server.1.2 再次运行mco查看切换状态# mco pinglinux58poc time=73.54 mslinux64poc time=74.61 mslinux57poc time=75.39 ms---- ping statistics ----3 replies max: 75.39 min: 73.54 avg: 74.51 # mco shell "lsof -i:61613" Do you really want to send this command unfiltered? (y/n): yDiscovering hosts using the mc method for 2 second(s) .... 3Host: linux58pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 11060 root 6uIPv434046 0t0TCP linux58poc:36295->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 11060 root 9uIPv434137 0t0TCP linux58poc:47200->linuxmaster2poc:61613 (ESTABLISHED)Host: linux64pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 36625 root 6uIPv427771 0t0TCP linux64poc:40493->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 36625 root 8uIPv427877 0t0TCP linux64poc:37472->linuxmaster2poc:61613 (ESTABLISHED)Host: linux57pocStatuscode: 0Output:COMMAND PID USER FD TYPEDEVICE SIZE NODE NAMEruby 18076 root 9uIPv4 1351484 TCP linux57poc:9309->linuxmaster2poc:61613 (ESTABLISHED)通过日志查看# mco shell "lsof -i:61613"Do you really want to send this command unfiltered? (y/n): yDiscovering hosts using the mc method for 2 second(s) .... 3Host: linux58pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 11428 root 6uIPv434283 0t0TCP linux58poc:36300->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 11428 root 8uIPv434338 0t0TCP linux58poc:47205->linuxmaster2poc:61613 (ESTABLISHED)Host: linux57pocStatuscode: 0Output:COMMAND PID USER FD TYPEDEVICE SIZE NODE NAMEruby 18447 root 6uIPv4 1351559 TCP linux57poc:59343->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 18447 root 8uIPv4 1351622 TCP linux57poc:29757->linuxmaster2poc:61613 (ESTABLISHED)Host: linux64pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 37054 root 4uIPv428036 0t0TCP linux64poc:37476->linuxmaster2poc:61613 (ESTABLISHED)ruby 37054 root 6uIPv427990 0t0TCP linux64poc:40497->linuxmaster1poc:61613 (CLOSE_WAIT)总结:可以看到之前的连接已经变成CLOSE_WAIT,新的连接被建立2. 停止MQ2,启动MQ1 查看切换状态# /etc/init.d/rabbitmq-server stopStopping rabbitmq-server: rabbitmq-server.# lsof -i:61613COMMAND PID USER FD TYPEDEVICE SIZE NODE NAMEruby 18447 root 6uIPv4 1351559 TCP linux57poc:59343->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 18447 root 8uIPv4 1351622 TCP linux57poc:29757->linuxmaster2poc:61613 (CLOSE_WAIT)# lsof -i:61613COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 11428 root 6uIPv434283 0t0TCP linux58poc:36300->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 11428 root 8uIPv434338 0t0TCP linux58poc:47205->linuxmaster2poc:61613 (CLOSE_WAIT)# lsof -i:61613COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 37054 root 4uIPv428036 0t0TCP linux64poc:37476->linuxmaster2poc:61613 (CLOSE_WAIT)ruby 37054 root 6uIPv427990 0t0TCP linux64poc:40497->linuxmaster1poc:61613 (CLOSE_WAIT)
# /etc/init.d/rabbitmq-server startStarting rabbitmq-server: SUCCESSrabbitmq-server.根据 plugin.rabbitmq.max_reconnect_delay = 30.0,需要过最多30秒,mcollective服务端会重新建立连接请求# tailf/var/log/rabbitmq/rabbit\@linuxmaster1poc.log=INFO REPORT==== 24-Dec-2013::11:00:45 ===accepting STOMP connection <0.332.0> (192.168.100.126:36316 -> 192.168.100.120:61613)=INFO REPORT==== 24-Dec-2013::11:00:45 ===accepting STOMP connection <0.348.0> (192.168.100.125:18945 -> 192.168.100.120:61613)=INFO REPORT==== 24-Dec-2013::11:00:45 ===accepting STOMP connection <0.382.0> (192.168.100.127:40513 -> 192.168.100.120:61613)
# mco pinglinux58poc time=70.60 mslinux57poc time=71.32 mslinux64poc time=111.56 ms---- ping statistics ----3 replies max: 111.56 min: 70.60 avg: 84.49
# mco shell "lsof -i:61613"Do you really want to send this command unfiltered? (y/n): yDiscovering hosts using the mc method for 2 second(s) .... 3Host: linux58pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 11428 root 6uIPv434283 0t0TCP linux58poc:36300->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 11428 root 8uIPv434338 0t0TCP linux58poc:47205->linuxmaster2poc:61613 (CLOSE_WAIT)ruby 11428 root 10uIPv434444 0t0TCP linux58poc:36316->linuxmaster1poc:61613 (ESTABLISHED)Host: linux57pocStatuscode: 0Output:COMMAND PID USER FD TYPEDEVICE SIZE NODE NAMEruby 18447 root 10uIPv4 1351723 TCP linux57poc:18945->linuxmaster1poc:61613 (ESTABLISHED)Host: linux64pocStatuscode: 0Output:COMMAND PID USER FD TYPE DEVICE SIZE/OFF NODE NAMEruby 37054 root 4uIPv428036 0t0TCP linux64poc:37476->linuxmaster2poc:61613 (CLOSE_WAIT)ruby 37054 root 6uIPv427990 0t0TCP linux64poc:40497->linuxmaster1poc:61613 (CLOSE_WAIT)ruby 37054 root 9uIPv428206 0t0TCP linux64poc:40513->linuxmaster1poc:61613 (ESTABLISHED)
页:
[1]