CentOS 6.2+Nginx+Nagios,手機短信和qq郵箱提醒php
注:192.168.0.21 服務端 mysql
192.168.0.22 客戶端 linux
環境:兩臺centos6.0 64位系統,都已經搭建好了源碼的lnmp平臺ios
結尾附上所需的軟件包nginx
1.nagios安裝(中文版)c++
tar xvf nagios-cn-3.2.3.tar.bz2 cd nagios-cn-3.2.3 useradd -m -s /bin/bash nagios usermod -a -G nagcmd nagios ./configure --prefix=/usr/local/nagios --with-command-group=nagcmd make make all make install make install-init # 生成init啓動腳本 make install-config # 安裝示例配置文件 make install-commandmode # 設置相應的目錄權限 chmod o+rwx /usr/local/nagios/var/rw
2.nagios-plugins安裝sql
wget http://prdownloads.sourceforge.net/sourceforge/nagiosplug/nagios-plugins tar zxvf nagios-plugins-1.4.16.tar.gz cd nagios-plugins-1.4.16 yum install make apr* autoconf automake curl curl-devel gcc gcc-c++ zlib-devel \ openssl openssl-devel pcre-devel gd gd-devel kernel keyutils patch perl perl-devel \ kernel keyutils kernel-headers compat* mpfr cpp glibc libgomp libstdc++-devel ppl \ cloog-ppl keyutils-libs-devel libcom_err-devel libsepol-devel libselinux-devel \ krb5-devel zlib-devel libXpm* freetype libjpeg* libpng* php-common php-gd ncurses* libtool* libxml2 libxml2-devel patch -y
./configure --prefix=/usr/local/nagios --with-mysql=/home/mysql/ make make install
3.nrpe安裝數據庫
tar xzvf nrpe-2.12.tar.gz cd nrpe-2.12 ./configure make ./configure make all make install-plugin make install-daemon make install-daemon-config \cp src/check_nrpe /usr/local/nagios/libexec/ /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d echo '/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d' >> /etc/rc.local
要重啓nrpe進行就先殺掉進行,而後重啓 kill `ps aux |grep nrpe |grep -v grep |awk '{print $2}'` /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d 本機測試下: /usr/local/nagios/libexec/check_nrpe -H localhost -c check_users
加入系統服務
vim
加入系統服務並設爲開機自動 chkconfig --add nagios chkconfig nagios on chown nagios.nagios /usr/local/nagios/var/rw # 測試配置文件是否正確 /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg
添加別名命令,方便測試配置文件centos
vi ~/.bashrc 在裏面用alias 來自定義一個命令來代替,這裏我用check alias check='/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg' source ~/.bashrc 此時能夠用check命令來檢測配置文件了
修改聯繫人郵箱,修改成用於報警接收的郵件地址
vi /usr/local/nagios/etc/objects/contacts.cfg ############################################################################### # CONTACTS.CFG - SAMPLE CONTACT/CONTACTGROUP DEFINITIONS # # Last Modified: 05-31-2007 # # NOTES: This config file provides you with some example contact and contact # group definitions that you can reference in host and service # definitions. # # You don't need to keep these definitions in a separate file from your # other object definitions. This has been done just to make things # easier to understand. # ############################################################################### ############################################################################### ############################################################################### # # CONTACTS # ############################################################################### ############################################################################### # Just one contact defined by default - the Nagios admin (that's you) # This contact definition inherits a lot of default values from the 'generic-contact' # template which is defined elsewhere. define contact{ contact_name nagiosadmin ; Short name of user use generic-contact ; Inherit default values from generic-contact template (defined above) alias Nagios Admin ; Full name of user email nagios@localhost ; <<***** CHANGE THIS TO YOUR EMAIL ADDRESS ****** } ############################################################################### ############################################################################### # # CONTACT GROUPS # ############################################################################### ############################################################################### # We only have one contact in this simple configuration file, so there is # no need to create more than one contact group. define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagiosadmin } 定義check_nrpe命令 vi /usr/local/nagios/etc/objects/commands.cfg define command{ command_name check_nrpe command_line /usr/local/nagios/libexec/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ }
檢測配置文件是否有誤
check
nginx 配置,Nginx fastcgi perl (pl、cgi)支持 安裝FCGI模塊 cd tar zxvf FCGI-0.70.tar.gz cd FCGI-0.70 perl Makefile.PL make make install cd 安裝 IO 和 IO::ALL模塊 tar zxvf IO-1.25.tar.gz cd IO-1.25 perl Makefile.PL make make install cd tar zxvf IO-All-0.41.tar.gz cd IO-All-0.41 perl Makefile.PL make make install cd unzip perl-fcgi.zip cp perl-fcgi.pl /usr/local/nginx/ chmod 755 /usr/local/nginx/perl-fcgi.pl
vi /usr/local/nginx/start_perl_cgi.sh #!/bin/bash #set -x dir=/usr/local/nginx/ stop () { #pkill -f $dir/perl-fcgi.pl kill $(cat $dir/logs/perl-fcgi.pid) rm $dir/logs/perl-fcgi.pid 2>/dev/null rm $dir/logs/perl-fcgi.sock 2>/dev/null echo "stop perl-fcgi done" } start () { rm $dir/now_start_perl_fcgi.sh 2>/dev/null chown nobody.root $dir/logs echo "$dir/perl-fcgi.pl -l $dir/logs/perl-fcgi.log -pid $dir/logs/perl-fcgi.pid -S $dir/logs/perl-fcgi.sock" >>$dir/now_start_perl_fcgi.sh chown nobody.nobody $dir/now_start_perl_fcgi.sh chmod u+x $dir/now_start_perl_fcgi.sh sudo -u nobody $dir/now_start_perl_fcgi.sh echo "start perl-fcgi done" } case $1 in stop) stop ;; start) start ;; restart) stop start ;; esac
把start_perl_cgi.sh文件中的nobody所有用nagios替換,nginx 目錄上的用戶
sed -i 's@nobody@nagios@g' /usr/local/nginx/start_perl_cgi.sh chmod 755 /usr/local/nginx/start_perl_cgi.sh /usr/local/nginx/start_perl_cgi.sh start
# 取消用戶認證(方便調試) vi /usr/local/nagios/etc/cgi.cfg 找到use_authentication=1並把值改成0 修改聯繫人郵箱,修改成用於報警接收的郵件地址 vi /usr/local/nagios/etc/objects/contacts.cfg
到這一步就是正常的
下面nginx 配置
我把監聽改爲80的了
而後開啓服務
就能夠訪問了,而後繼續安裝客戶端,最後給你們截圖看效果
service nagios start
nagios被控端安裝
yum install openssl-devel -y 1. nagios-plugins安裝 groupadd nagios useradd nagios -M -s /sbin/nologin -g nagios tar xvf nagios-plugins-1.4.16.tar.gz cd nagios-plugins-1.4.16 ./configure --prefix=/usr/local/nagios --with-nagios-user=nagios --with-nagios-gourp=nagios --with-mysql=/usr/local/mysql && make && make install cd 2. nrpe安裝 tar zxvf nrpe-2.13.tar.gz cd nrpe-2.13 ./configure make all make install-plugin make install-daemon make install-daemon-config
啓動nrpe /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d echo '/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d' >> /etc/rc.local
監控服務端本機:本身監控本身不須要配置nrpe,服務端的nrpe只用於獲取客戶端的nrpe傳送過來的數據,在這裏由於中文版的nagios已經默認有些配置,等會兒修改下直接用了
監控客戶端:監控的服務有:mysql、nginx、memory、ip鏈接數、僵死的進程、磁盤空間、磁盤IO、登陸用戶數、進程總數、cpu負載、PING、SSH
unzip libexec.zip \cp libexec/* /usr/local/nagios/libexec chmod -R +x /usr/local/nagios/libexec
裝插件
建立一個空的數據庫nagios,受權nagios這個用戶從任何地方訪問nagios這個數據庫,刷新受權設置,查詢下nagios這個用戶是否建立成功 create database nagios; grant select on nagios.* to nagios @'%' identified by '123456'; flush privileges; select User,Password,Host from mysql.user;
添加mysql庫到系統搜索庫 vim /etc/ld.so.conf /usr/local/mysql/lib ldconfig 要監控磁盤io,還得安裝sysstat這個工具包 yum install sysstat -y 配置客戶端上面的nrpe vim /usr/local/nagios/etc/nrpe.cfg
配置客戶端上面的nrpe vim /usr/local/nagios/etc/nrpe.cfg command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10 command[check_load]=/usr/local/nagios/libexec/check_cpu.sh -w 80% -c 90% command[check_sda1]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/sda1 command[check_sda2]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /dev/sda2 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_swap]=/usr/local/nagios/libexec/check_swap -w 20% -c 10% command[check_iostat]=/usr/local/nagios/libexec/check_iostat.sh -d sda -w 6 -c 10 command[check_mysql]=/usr/local/nagios/libexec/check_mysql -H 192.168.0.22 -u nagios -p 123456 -d nagios command[check_nginx]=/usr/local/nagios/libexec/check_nginx.sh -u 192.168.0.22 -p /status -w 4000 -c 5000 command[check_mem]=/usr/local/nagios/libexec/check_memory.pl -f -w 20 -c 10 command[check_ip_conn]=/usr/local/nagios/libexec/ip_conn.sh 200 250 command[check_ssh]=/usr/local/nagios/libexec/check_tcp -p 22 -w 1.0 -c 10.0 配置完成後,重啓nrpe kill `ps aux |grep nrpe |grep -v grep |awk '{print $2}'` /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d 服務端配置: 監控服務端本機的配置: vim /usr/local/nagios/etc/objects/localhost.cfg 修改裏面的配置,最後修改完成的配置以下 define host{ use linux-server host_name localhost alias localhost address 127.0.0.1 icon_p_w_picpath server.gif statusmap_p_w_picpath server.gd2 2d_coords 500,200 3d_coords 500,200,100 } define hostgroup{ hostgroup_name linux-servers ; The name of the hostgroup alias Linux Servers ; Long name of the group members * ; Comma separated list of hosts that belong to this group } define servicegroup{ servicegroup_name 所有聯通性檢查 alias 聯通性檢查 members localhost,PING,nagios-client,PING } define service{ use local-service ; Name of service template to use host_name * service_description PING check_command check_ping!100.0,20%!500.0,60% } define service{ use local-service ; Name of service template to use host_name localhost service_description 根分區 check_command check_local_disk!20%!10%!/ } define service{ use local-service ; Name of service template to use host_name localhost service_description 登陸用戶數 check_command check_local_users!20!50 } define service{ use local-service ; Name of service template to use host_name localhost service_description 進程總數 check_command check_local_procs!250!400!RSZDT } define service{ use local-service ; Name of service template to use host_name localhost service_description 系統負荷 check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4.0 } define service{ use local-service ; Name of service template to use host_name localhost service_description 交換空間利用率 check_command check_local_swap!20!10 } define service{ use local-service ; Name of service template to use host_name localhost service_description SSH check_command check_tcp!22!1.0!10.0 notifications_enabled 0 } 服務器監控客戶端的配置: 保存退出後複製這個文件一份,做爲nagios-client的監控模版文件 cp /usr/local/nagios/etc/objects/localhost.cfg /usr/local/nagios/etc/objects/nagios-client.cfg vim /usr/local/nagios/etc/objects/nagios-client.cfg 修改完成後的配置以下 define host{ use linux-server host_name nagios-client alias nagios-client address 192.168.0.22 icon_p_w_picpath server.gif statusmap_p_w_picpath server.gd2 2d_coords 500,200 3d_coords 500,200,100 } define service{ use local-service ; Name of service template to use host_name * service_description PING check_command check_ping!100.0,20%!500.0,60% } define service{ use local-service ; Name of service template to use host_name nagios-client service_description boot分區 check_command check_nrpe!check_sda1 } define service{ use local-service ; Name of service template to use host_name nagios-client service_description 根分區 check_command check_nrpe!check_sda2 } define service{ use local-service ; Name of service template to use host_name nagios-client service_description 登陸用戶數 check_command check_nrpe!check_users } define service{ use local-service ; Name of service template to use host_name nagios-client service_description 進總程數 check_command check_nrpe!check_total_procs } define service{ use local-service ; Name of service template to use host_name nagios-client service_description CPU平均負載 check_command check_nrpe!check_load } define service{ use local-service ; Name of service template to use host_name nagios-client service_description 虛擬內存 check_command check_nrpe!check_swap } define service{ use local-service ; Name of service template to use host_name nagios-client service_description SSH check_command check_nrpe!check_ssh notifications_enabled 0 } define service{ use local-service ; Name of service template to use host_name nagios-client service_description 僵死進程數 check_command check_nrpe!check_zombie_procs } define service{ use local-service ; Name of service template to use host_name nagios-client service_description iostat check_command check_nrpe!check_iostat } define service{ use local-service ; Name of service template to use host_name nagios-client service_description mysql check_command check_nrpe!check_mysql } define service{ use local-service ; Name of service template to use host_name nagios-client service_description nginx check_command check_nrpe!check_nginx } define service{ use local-service ; Name of service template to use host_name nagios-client service_description memory check_command check_nrpe!check_mem } define service{ use local-service ; Name of service template to use host_name nagios-client service_description IP鏈接數 check_command check_nrpe!check_ip_conn }
直接把原來的郵件報警的兩條命令中的/bin/mail修改成/usr/bin/mutt便可,以下圖 加快nagios的報警時間設置: 1.修改模版文件: vim /usr/local/nagios/etc/objects/templates.cfg 修改全部normal_check_interval項的值爲1,既發現故障後1分鐘就報警 修改全部check_interval項的值爲1,即正常狀況下每分鐘檢查一次 修改全部notification_interval 的值爲20分鐘 #在主機出現異常後,故障一直沒有解決,nagios再次對使用者發出通知的時間 service nagios restart 重啓nagios
測試告警:
試驗完成!
附上軟件包所需軟件地址
缺的軟件能夠直接找我要!