個人ceph集羣運行了一段時間後,報以下警告: shell
# ceph -s cluster c6e7e7d9-2b91-4550-80b0-6fa46d0644f6 health HEALTH_WARN clock skew detected on mon.c 896 pgs stuck inactive 896 pgs stuck unclean noscrub flag(s) set Monitor clock skew detected monmap e1: 5 mons at {a=101.71.4.11:6789/0,b=101.71.4.12:6789/0,c=101.71.4.13:6789/0,d=101.71.4.14:6789/0,e=101.71.4.15:6789/0} election epoch 28, quorum 0,1,2,3,4 a,b,c,d,e osdmap e1616: 240 osds: 216 up, 216 in flags noscrub pgmap v16891: 4992 pgs, 18 pools, 1093 GB data, 38340 objects 5446 GB used, 361 TB / 386 TB avail 4096 active+clean 896 creating核心信息就是 clock skew detected on mon.c
我採用以下辦法解決問題:
1. 才每一臺MON機器上執行以下命令關閉ntpd服務 rest
service ntpd stop2. 執行ntpupdate命令進行時間信息同步
[root@gnop029-ct-zhejiang_wenzhou-16-14 ~]# ntpdate us.pool.ntp.org 5 Dec 16:27:20 ntpdate[30359]: adjust time server 209.118.204.201 offset 0.000712 sec3. 從新啓動ntpd服務
service ntpd start4.從新啓動ceph -s後,發現集羣再也不報時間問題:
[root@gnop029-ct-zhejiang_wenzhou-16-14 ~]# ceph -s cluster c6e7e7d9-2b91-4550-80b0-6fa46d0644f6 health HEALTH_WARN 896 pgs stuck inactive 896 pgs stuck unclean noscrub flag(s) set monmap e1: 5 mons at {a=101.71.4.11:6789/0,b=101.71.4.12:6789/0,c=101.71.4.13:6789/0,d=101.71.4.14:6789/0,e=101.71.4.15:6789/0} election epoch 28, quorum 0,1,2,3,4 a,b,c,d,e osdmap e1616: 240 osds: 216 up, 216 in flags noscrub pgmap v16891: 4992 pgs, 18 pools, 1093 GB data, 38340 objects 5446 GB used, 361 TB / 386 TB avail 4096 active+clean 896 creating
--------------------------------------------------------------------- code
能夠經過crontab命令將動做放置到週期任務中
crontab -e
增長以下信息 server
10 * * * * /usr/sbin/ntpdate us.pool.ntp.org
service crond restart Stopping crond: [ OK ] Starting crond: [ OK ]