SQL Server 2017錯誤日誌中出現「Parallel redo is shutdown for database 'xxx' with worker pool size [2]."淺析

時間 2020-05-22

標籤 sql server 錯誤日誌出現 parallel redo shutdown database worker pool size 淺析欄目 SQL 简体版

原文原文鏈接

在SQL Server 2017的錯誤日誌中出現"Parallel redo is started for database 'xxx' with worker pool size [2]"和「Parallel redo is shutdown for database 'xxx' with worker pool size [2].」這種信息，這意味着什麼呢？以下所示sql

Date 2020/5/16 11:07:38數據庫

Log SQL Server (Current - 2020/5/16 11:08:00)服務器

Source spid33sapp

Message測試

Parallel redo is started for database 'YourSQLDba' with worker pool size [2].spa

Date 2020/5/16 11:07:38命令行

Log SQL Server (Current - 2020/5/16 11:08:00)線程

Source spid33srest

Message日誌

Parallel redo is shutdown for database 'YourSQLDba' with worker pool size [2].

其實這個要涉及parallel redo這個概念，官方文檔有詳細介紹，摘抄部分以下【詳情請見參考資料】：

When availability group was initially released with SQL Server 2012, the transaction log redo was handled by a single redo thread for each database in an AG secondary replica. This redo model is also called as serial redo. In SQL Server 2016, the redo model was enhanced with multiple parallel redo worker threads per database to share the redo workload. In addition, each database has a new helper worker thread for handling the dirty page disk flush IO. This new redo model is called parallel redo. With the new parallel redo model that is the default setting since SQL Server 2016, workloads with highly concurrent small transactions are expected to achieve better redo performance. When the transaction redo operation is CPU intensive, such as when data encryption and/or data compression are enabled, parallel redo has even higher redo throughput (Redone Bytes/sec) compared to serial redo. Moreover, indirect checkpoint allows parallel redo to offload more disk IO (and IO waits for slow disk) to its helper worker thread and frees main redo thread to enumerate more received log records in secondary replica. It further speeds up the redo performance. However parallel redo, which enables multi-threading model, has an associated cost.

其實錯誤日誌中出現這些信息，這是在SQL Server 2017中添加的與可用性組的並行重作（Parallel redo）相關的信息性日誌消息。咱們的SQL Server實例是單實例，並非AG中的一個節點，怎麼會有parallel redo的信息呢？其實數據庫沒有參與AG，因此在數據庫啓動的時候，該數據庫的parallel redo線程啓動，而後數據庫檢查發現並無可用性組。那麼就會關閉parallel redo的線程。

因此在數據庫實例重啓事後，你會在錯誤日誌看到「Parallel redo is started for database 'xxxx' with worker pool size [2].」這樣的輸出信息，而後立馬又會看到「Parallel redo is shutdown for database 'xxxx' with worker pool size [2].」.

其實呢，還有一種狀況，就是你的用戶數據設置開啓了AUTO_CLOSE選項。以下所示，我將數據庫的YourSQLDba的AUTO_CLOSE開啓。

 
  USE [master] 
   
  GO 
   
  ALTER DATABASE [YourSQLDba] SET AUTO_CLOSE ON WITH NO_WAIT 
   
  GO 
   
  SELECT   d.name                        AS database_name  
   
          ,SUSER_SNAME(owner_sid)        AS database_owner 
   
          ,d.create_date                 AS create_date  
   
          ,d.collation_name              AS collcation_name  
   
          ,d.state_desc                  AS state_desc 
   
          ,d.is_auto_close_on            AS is_auto_close_on 
   
  FROM    sys.databases d

以下所示，當會話訪問此數據庫，就會出現大量這樣的日誌信息。此時能夠經過將數據庫AUTO_CLOSE選項關閉，就不會在錯誤日誌中出現大量這樣的信息，可是在SQL Server實例啓動的時候，你仍是仍是會看到這些日誌信息

咱們能夠經過啓用跟蹤標記3459來關閉parallel redo這個功能。注意，這個跟蹤標記（trace flag）僅僅適用於SQL Server 2016/2017或更高的版本。建議在數據庫實例啓動時經過使用 -T 命令行選項來啓用全局跟蹤標誌。這樣可確保跟蹤標誌在服務器從新啓動後保持活動狀態。若要讓跟蹤標誌生效，請重啓 SQL Server。

另外，注意關於parallel redo在特定版本有個Bug：「FIX: Parallel redo does not work after you disable Trace Flag 3459 in an instance of SQL Server」，但願你不在測試過程當中命中了這個Bug，不然會影響測試結果（具體版本信息，請閱讀參考資料的官方連接）

Assume that you use Always On Availability Groups in Microsoft SQL Server. After you switch to serial redo from parallel redo by enabling Trace Flag 3459, serial redo works as expected. However, when you switch back to parallel redo by disabling Trace Flag 3459, parallel redo does not work. If you restart the instance of SQL Server, parallel redo works as expected.

參考資料：

https://docs.microsoft.com/zh-cn/archive/blogs/sql_server_team/sql-server-20162017-availability-group-secondary-replica-redo-model-and-performance

https://dba.stackexchange.com/questions/239181/messages-about-parallel-redo

https://support.microsoft.com/en-us/help/4339858/fix-parallel-redo-does-not-work-after-you-disable-trace-flag-3459-in-a