Mysql 高負載排查思路
發現問題
top命令 查看服務器負載,發現 mysql居然百分之兩百的cpu,引發Mysql 負載這麼高的緣由,估計是索引問題和某些變態SQL語句.
排查思路
1. 肯定高負載的類型,top命令看負載高是CPU仍是IO。
2. mysql 下執行查看當前的鏈接數與執行的sql 語句。
3. 檢查慢查詢日誌,多是慢查詢引發負載高。
4. 檢查硬件問題,是否磁盤故障問題形成的。
5. 檢查監控平臺,對比此機器不一樣時間的負載。
肯定負載類型(top)mysql
- top - 10:14:18 up 23 days, 11:01, 1 user, load average: 124.17, 55.88, 24.70
- Tasks: 138 total, 1 running, 137 sleeping, 0 stopped, 0 zombie
- Cpu(s): 2.4%us, 1.0%sy, 0.0%ni, 95.2%id, 2.0%wa, 0.1%hi, 0.2%si, 0.0%st
- Mem: 3090528k total, 2965772k used, 124756k free, 93332k buffers
- Swap: 4192956k total, 2425132k used, 1767824k free, 756524k cached
- PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
- 30833 mysql 15 0 6250m 2.5g 4076 S 257.1 49.9 529:34.45 mysqld
查看當前的鏈接數與執行的sql 語句linux
- show processlist;
- Id User Host db Command Time State Info
- 192 slave 8.8.8.142:39820 NULL Binlog Dump 58982 Has sent all binlog to slave; waiting for binlog to be updated NULL
- 194 slave 8.8.8.120:41075 NULL Binlog Dump 58982 Has sent all binlog to slave; waiting for binlog to be updated NULL
- 424891 biotherm 8.8.8.46:57861 biotherm Query 493 Sending data SELECT * FROM xxx_list WHERE tid = '1112' AND del = 0 ORDER BY id
- DESC LIMIT 0, 4
- 424917 biotherm 8.8.8.49:50984 biotherm Query 488 Sending data SELECT * FROM xxx_list WHERE tid = '1112' AND del = 0 ORDER BY id
- DESC LIMIT 0, 4
- ..............................................
- 430330 biotherm 8.8.8.42:35982 biotherm Query 487 Sending data SELECT * FROM xxx_list WHERE tid = '1112' AND del = 0
記錄慢查詢
編輯Mysql 配置文件(my.cnf),在[mysqld]字段添加如下幾行:sql
- log_slow_queries = /usr/local/mysql/var/slow_queries.log #慢查詢日誌路徑
- long_query_time = 10 #記錄SQL查詢超過10s的語句
- log-queries-not-using-indexes = 1 #記錄沒有使用索引的sql
查看慢查詢日誌數據庫
- tail /usr/local/mysql/var/slow_queries.log
- # Time: 130305 9:48:13
- # User@Host: biotherm[biotherm] @ [8.8.8.45]
- # Query_time: 1294.881407 Lock_time: 0.000179 Rows_sent: 4 Rows_examined: 1318033
- SET timestamp=1363916893;
- SELECT * FROM xxx_list WHERE tid = '11xx' AND del = 0 ORDER BY id DESC LIMIT 0, 4;
4個參數
Query_time: 0 Lock_time: 0 Rows_sent: 1 Rows_examined: 54
分別意思爲:查詢時間 鎖定時間 查詢結果行數 掃描行數,主要看掃描行數多的語句,而後去數據庫加上對應的索引,再優化下變態的sql 語句。
極端狀況kill sql進程服務器
- 找出佔用cpu時間過長的sql,在mysql 下執行以下命令:
- show processlist;
- 肯定後一條sql處於Query狀態,且Time時間過長,鎖定它的ID,執行以下命令:
- kill QUERY 269815764;
注意:殺死 sql進程,可能致使數據丟失,因此執行前要衡量數據的重要性。
案例:索引致使的問題
結束
更多請:
linux 系統運維 37275208
vmware 虛擬化 166682360運維