上一章,咱們經過實踐和其餘文章的幫助,在k8s的環境安裝了redis-ha,而且對其進行了一些實驗來驗證他的主從切換是否有效。本篇中將會分析,到底是如何實現了redis-ha的主從切換,以及其與K8S平面進行的交互。node
我曾經覺得是在helm/chart中寫入了腳原本完成這件事,可是仔細看過代碼以後,並未發現明顯的內容,關於搭建redis-ha和主從切換的腳本。git
地址:https://github.com/helm/charts/tree/master/stable/redis-hagithub
後來,經過查看redis鏡像的日誌發現了一些內容,redis
地址:https://quay.io/repository/smile/redis/manifest/sha256:8948a952920d4495859c984546838d4c9b4c71e0036eef86570922d91cacb3df?tab=layersexpress
能夠看到,在這個鏡像構建日誌中,有幾個疑似相關內容的文件,/usr/local/bin目錄下的promte.sh,redis-launcher.sh,label-updater.shapache
進入到pod中,咱們能夠看到redis-launcher是做爲啓動時就運行的腳本,因此我就推測這一切都是這個文件引發的。json
redis-launcher.sh:bash
bash-4.4# cat /usr/local/bin/redis-launcher.sh #!/bin/bash # Copyright 2017 Ismail KABOUBI # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. # This script determines whether the pod that executes it will be a Redis Sentinel, Master, or Slave # The redis-ha Helm chart signals Sentinel status with environment variables. If they are not set, the newly # launched pod will scan K8S to see if there is an active master. If not, it uses a deterministic means of # sensing whether it should launch as master then writes master or slave to the label called redis-role # appropriately. It's this label that determines which LB a pod can be seen through. # # The redis-role=master pod is the key for the cluster to get started. Sentinels will wait for it to appear # in the LB before they finish launching. All other pods wait for the Sentinels to ID the master. # # Pods also set the labels podIP and runID. RunID is the first few characters of the unique run_id value # generated by each Redis sever. # # During normal operation, there should be only one redis-role=master pod. If it fails, the Sentinels # will nominate a new master and change all the redis-role values appropriately. echo "Starting redis launcher" echo "Setting labels" label-updater.sh & plabeler=$! echo "Selecting proper service to execute" # Define config file locations SENTINEL_CONF=/etc/redis/sentinel.conf MASTER_CONF=/etc/redis/master.conf SLAVE_CONF=/etc/redis/slave.conf # Adapt to dynamically named env vars ENV_VAR_PREFIX=`echo $REDIS_CHART_PREFIX|awk '{print toupper($0)}'|sed 's/-/_/g'` PORTVAR="${ENV_VAR_PREFIX}MASTER_SVC_SERVICE_PORT" HOSTVAR="${ENV_VAR_PREFIX}MASTER_SVC_SERVICE_HOST" MASTER_LB_PORT="${!PORTVAR}" MASTER_LB_HOST="${!HOSTVAR}" QUORUM=${QUORUM:-2} # Only sets AUTH if the ENV var REDIS_PASS is set. REDISAUTH="" [ -n "$REDIS_PASS" ] && REDISAUTH="-a $REDIS_PASS" || REDISAUTH="" # Launch master when `MASTER` environment variable is set function launchmaster() { # If we know we're a master, update the labels right away kubectl label --overwrite pod $HOSTNAME redis-role="master" echo "Using config file $MASTER_CONF" if [[ ! -e /redis-master-data ]]; then echo "Redis master data doesn't exist, data won't be persistent!" mkdir /redis-master-data fi if [ -n "$REDIS_PASS" ]; then sed -i "s/# requirepass/requirepass ${REDIS_PASS} \n#/" $MASTER_CONF fi redis-server $MASTER_CONF --protected-mode no $@ } # Launch sentinel when `SENTINEL` environment variable is set function launchsentinel() { # If we know we're a sentinel, update the labels right away kubectl label --overwrite pod $HOSTNAME redis-role="sentinel" echo "Using config file $SENTINEL_CONF" while true; do # The sentinels must wait for a load-balanced master to appear then ask it for its actual IP. MASTER_IP=$(kubectl get pod -o jsonpath='{range .items[*]}{.metadata.name} {..podIP} {.status.containerStatuses[0].state}{"\n"}{end}' -l redis-role=master|grep running|grep $REDIS_CHART_PREFIX|awk '{print $2}'|xargs) echo "Current master is $MASTER_IP" if [[ -z ${MASTER_IP} ]]; then continue fi timeout -t 3 redis-cli ${REDISAUTH} -h ${MASTER_IP} -p ${MASTER_LB_PORT} INFO if [[ "$?" == "0" ]]; then break fi echo "Connecting to master failed. Waiting..." sleep 10 done echo "sentinel monitor mymaster ${MASTER_IP} ${MASTER_LB_PORT} ${QUORUM}" > ${SENTINEL_CONF} echo "sentinel down-after-milliseconds mymaster 15000" >> ${SENTINEL_CONF} echo "sentinel failover-timeout mymaster 30000" >> ${SENTINEL_CONF} echo "sentinel parallel-syncs mymaster 10" >> ${SENTINEL_CONF} echo "bind 0.0.0.0" >> ${SENTINEL_CONF} echo "sentinel client-reconfig-script mymaster /usr/local/bin/promote.sh" >> ${SENTINEL_CONF} if [ -n "$REDIS_PASS" ]; then echo "sentinel auth-pass mymaster ${REDIS_PASS}" >> ${SENTINEL_CONF} fi redis-sentinel ${SENTINEL_CONF} --protected-mode no $@ } # Launch slave when `SLAVE` environment variable is set function launchslave() { kubectl label --overwrite pod $HOSTNAME redis-role="slave" echo "Using config file $SLAVE_CONF" if [[ ! -e /redis-master-data ]]; then echo "Redis master data doesn't exist, data won't be persistent!" mkdir /redis-master-data fi i=0 while true; do master=${MASTER_LB_HOST} timeout -t 3 redis-cli ${REDISAUTH} -h ${master} -p ${MASTER_LB_PORT} INFO if [[ "$?" == "0" ]]; then break fi i=$((i+1)) if [[ "$i" -gt "30" ]]; then echo "Exiting after too many attempts" kill $plabeler exit 1 fi echo "Connecting to master failed. Waiting..." sleep 1 done if [ -n "$REDIS_PASS" ]; then sed -i "s/# masterauth/masterauth ${REDIS_PASS} \n#/" $SLAVE_CONF sed -i "s/# requirepass/requirepass ${REDIS_PASS} \n#/" $SLAVE_CONF fi sed -i "s/%master-ip%/${MASTER_LB_HOST}/" $SLAVE_CONF sed -i "s/%master-port%/${MASTER_LB_PORT}/" $SLAVE_CONF redis-server $SLAVE_CONF --protected-mode no $@ } #Check if MASTER environment variable is set if [[ "${MASTER}" == "true" ]]; then echo "Launching Redis in Master mode" launchmaster exit 0 fi # Check if SENTINEL environment variable is set if [[ "${SENTINEL}" == "true" ]]; then echo "Launching Redis Sentinel" launchsentinel echo "Launcsentinel action completed" exit 0 fi # Determine whether this should be a master or slave instance echo "Looking for pods running as master" MASTERS=`kubectl get pod -o jsonpath='{range .items[*]}{.metadata.name} {..podIP} {.status.containerStatuses[0].state}{"\n"}{end}' -l redis-role=master|grep running|grep $REDIS_CHART_PREFIX` if [[ "$MASTERS" == "" ]]; then echo "No masters found: \"$MASTERS\" Electing first master..." SLAVE1=`kubectl get pod -o jsonpath='{range .items[*]}{.metadata.creationTimestamp} {.metadata.name} {.status.containerStatuses[0].state} {"\n"} {end}' -l redis-node=true |grep running|sort|awk '{print $2}'|grep $REDIS_CHART_PREFIX|head -n1` if [[ "$SLAVE1" == "$HOSTNAME" ]] || [[ "$SLAVE1" == "" ]]; then echo "Taking master role" launchmaster else echo "Electing $SLAVE1 master" launchslave fi exit 0 else echo "Found $MASTERS" echo "Launching Redis in Slave mode" launchslave exit 0 fi echo "Launching Redis in Slave mode" launchslave echo "Launchslave action completed"
label-updater.shapp
bash-4.4# cat /usr/local/bin/label-updater.sh # Push some helpful vars into labels PODIP=`hostname -i` echo podIP $PODIP kubectl label --overwrite pod $HOSTNAME podIP="$PODIP" if [ "$SENTINEL" ]; then exit fi RUNID="" # Only sets AUTH if the ENV var REDIS_PASS is set. REDISAUTH="" [ -n "$REDIS_PASS" ] && REDISAUTH="-a $REDIS_PASS" || REDISAUTH="" while true; do RUNID=`redis-cli $REDISAUTH info server |grep run_id|awk -F: '{print $2}'|head -c6` if [ -n "$RUNID" ]; then kubectl label --overwrite pod $HOSTNAME runID="$RUNID" break else sleep 1 fi done
promote.shless
bash-4.4# cat /usr/local/bin/promote.sh #!/usr/bin/env bash MASTERIP=$6 # Convert the IP of the promoted pod to a hostname MASTERPOD=`kubectl get pod -o jsonpath='{range .items[*]}{.metadata.name} {..podIP} {.status.containerStatuses[0].state}{"\n"}{end}' -l redis-role=slave --sort-by=.metadata.name|grep running|grep $MASTERIP|awk '{print $1}'` echo "PROMO ARGS: $@" echo "PROMOTING $MASTERPOD ($MASTERIP) TO MASTER" kubectl label --overwrite pod $MASTERPOD redis-role="master" # Demote anyone else who jumped to master kubectl get pod -o jsonpath='{range .items[*]}{.metadata.name} {.status.containerStatuses[0].state}{"\n"}{end}' -l redis-role=master --sort-by=.metadata.name|grep running|awk '{print $1}'|grep $REDIS_CHART_PREFIX|grep -v $MASTERPOD|xargs -n1 -I% kubectl label --overwrite pod % redis-role="slave" echo "OTHER MASTERS $MASTERS"
詳細的內容我尚未開始看,可是能夠講一下大體的原理,就是在每一個pod在啓動的時候都會起這樣的一個redis-launcher的進程,這個就像一個agent同樣,主要完成redis的master,slave和sentinel的配置,同時,他會將各個pod的角色反向經過kubectl命令傳給K8S平面。
若是是發生了主從結構已經起來了,可是中途由於某種緣由掛掉了,則會經過監控sentinel的狀態來觸發更改K8S平面pod的Role的過程。這個設定是在啓動sentinel完成的,代碼在這裏:
echo "sentinel client-reconfig-script mymaster /usr/local/bin/promote.sh" >> ${SENTINEL_CONF}
他會監控,若是對於mymaster這個集羣中的sentinel發生了reconfig的事件的時候,就會去觸發/usr/local/bin/promote.sh這個腳本。
因此能夠看到,是redis的pod在控制,而不是K8S平面在進行控制。以後有時間,我會詳細的讀一下這個腳本,而後加上一些註釋。
更多openstack/trove的文章:http://www.cnblogs.com/S-tec-songjian/
此文章屬博客園用戶S-tec原創做品,受國家《著做權法》保護,未經許可,任何單位及我的不得作營利性使用;若僅作我的學習、交流等非營利性使用,應當指明做者姓名、做品名稱,原文地址,而且不得侵犯做者依法享有的其餘權利。