獲取堆棧信息
問題線程的定位
負載較低mysql
mysql_pid=4522
pstack $mysql_pid>pstack.infolinux
pt-pmp對堆棧信息排序
pt-pmp pstack.info | lesssql
也能夠直接執行pt-pmp
pt-pmp --pid 4522
如app
10 __io_getevents_0_4(libaio.so.1),LinuxAIOHandler::collect(os0file.cc:2502),LinuxAIOHandler::poll(os0file.cc:2648),os_aio_linux_handler(os0file.cc:2704),os_aio_handler(os0file.cc:2704),fil_aio_wait(fil0fil.cc:5835),io_handler_thread(srv0start.cc:311),start_thread(libpthread.so.0),clone(libc.so.6) 3 pthread_cond_wait,wait(os0event.cc:165),os_event::wait_low(os0event.cc:165),srv_worker_thread(srv0srv.cc:2520),start_thread(libpthread.so.0),clone(libc.so.6) 2 pthread_cond_wait,native_cond_wait(thr_cond.h:140),my_cond_wait(thr_cond.h:140),inline_mysql_cond_wait(thr_cond.h:140),Per_thread_connection_handler::block_until_new_connection(thr_cond.h:140),handle_connection(connection_handler_per_thread.cc:329),pfs_spawn_thread(pfs.cc:2190),start_thread(libpthread.so.0),clone(libc.so.6)
堆棧跟蹤
等待分析
侵入性大less
mysqld_pid 4522
gdb -p $mysqld_pid
(gdb) info thread 顯示運行的全部線程
(gdb) thread 4 切換到某個線程
bt 顯示調用棧
如函數
SELECT B.THREAD_OS_ID AS mysqld_os_thread_id,A.ID as processlist_id,A.USER,A.HOST,A.DB,A.COMMAND,A.TIME,A.STATE,LEFT(A.INFO,150) AS statement FROM information_schema.PROCESSLIST A INNER JOIN performance_schema.threads B on A.ID=B.PROCESSLIST_ID where A.id !=connection_id();
+---------------------+----------------+-------+-----------+------+---------+------+---------------------------------+----------------------------------------+
| mysqld_os_thread_id | processlist_id | USER | HOST | DB | COMMAND | TIME | STATE | statement |
+---------------------+----------------+-------+-----------+------+---------+------+---------------------------------+----------------------------------------+
| 27015 | 4 | admin | localhost | NULL | Sleep | 2597 | | NULL |
| 28252 | 6 | admin | localhost | test | Query | 1166 | Waiting for table metadata lock | alter table test.test_1 drop index idx |
+---------------------+----------------+-------+-----------+------+---------+------+---------------------------------+----------------------------------------+
(gdb) thread 4
[Switching to thread 4 (Thread 0x7f8d107f8700 (LWP 28252))]
#0 0x00007f8d3b147d12 in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0uibt
........
#4 MDL_wait::timed_wait (this=0x7f8ce4000958, owner=0x7f8ce40008c0, abs_timeout=0x7f8d107f37e0, set_status_on_timeout=false,
wait_state_name=) at /export/home/pb2/build/sb_0-27500212-1520171728.22/mysql-5.7.22/sql/mdl.cc:1861 this
注意spa
- pstack 和gdb均可以獲取堆棧信息,區別是gdb是交互式的(須要手動退出),pstack是快照形式(執行3s自動退出)
- 對mysqld使用gdb後,已經創建的鏈接沒法再執行語句,新鏈接沒法創建
查看進程的系統調用信息
負載較高
查看系統調用
strace -cp $mysqld_pid
查看執行語句
mysqld_pid=4522
strace -f -F -ff -o mysqld-strace -s 1024 -p $mysqld_pid
find ./ -name "mysqld-strace" -type f -print |xargs grep -n "SELECT.FROM"
查看讀寫的文件
strace -o /tmp/strace_output.txt -T -tt -f -e trace=read,open -p 「mysqld_pid」
進程內部函數調用狀況
負載較低
mysqld內部函數總體消耗狀況
perf top -p 4522
如
Samples: 66 of event 'cpu-clock', Event count (approx.): 6520074 Overhead Shared Object Symbol 18.42% mysqld [.] sync_array_print_long_waits_low 10.83% [kernel] [k] _raw_spin_unlock_irqrestore 7.88% libaio.so.1.0.1 [.] 0x0000000000000645 7.30% [kernel] [k] finish_task_switch 6.84% [kernel] [k] system_call_after_swapgs 6.22% [kernel] [k] aio_read_events 6.02% libc-2.17.so [.] __memset_sse2
記錄全部內部函數調用
該命令會輸出記錄到perf.data中
perf record -p 4522
解析並查看perf.data內容
perf script -i perf.data>perf.log && less perf.log
參考
MySQL全部操做hang住問題的故障排查
linux strace追蹤mysql執行語句 (mysqld --debug)
linux 調試利器gdb, strace, pstack, pstree, lsof
Debugging a MySQL Server
What to Do If MySQL Keeps Crashing