pg中刪除的頁是否仍被訪問

昨天看到微信羣中,有人提問:pg對於標記爲刪除的頁,是否會掃描到?sql

今天作了一下測試,發現若是刪除的是表的最後連續的幾個頁(根據ctid來肯定數據插入前後,只討論有insert的狀況)中的數據,最後幾個頁通過vacuum後,會被釋放回操做系統,天然不會被掃描到,但若是不是最後的頁會不會掃描到,下面測試一下。微信

1. 測試無索引的狀況:

swrd=# \d t1
                       Table "swrd.t1"
 Column |       Type        | Collation | Nullable | Default
--------+-------------------+-----------+----------+---------
 c1     | integer           |           |          |
 c2     | character varying |           |          |
 c3     | character varying |           |          |
 c4     | character varying |           |          |
swrd=# insert into t1 select i,md5(i::text),md5(i::text),md5(i::text) from generate_series(1,10000) t(i);
INSERT 0 10000
swrd=# select max(ctid) from t1;
   max
----------
 (40,80)
(1 row)

在t1中插入1w條記錄,t1表是一共佔用了41個頁,查看記錄,一個塊中存放248條記錄。oop

##全表掃描,能夠看到buffers數是41個。
swrd=# explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  ;
                                                 QUERY PLAN
-------------------------------------------------------------------------------------------------------------
 Seq Scan on swrd.t1  (cost=0.00..141.00 rows=10000 width=103) (actual time=0.020..1.696 rows=10000 loops=1)
   Output: c1, c2, c3, c4
   Buffers: shared hit=41
 Planning Time: 0.142 ms
 Execution Time: 2.650 ms
(5 rows)

##刪除幾個頁,看是否有變化,不作vacuum
##刪除第13頁到16頁
swrd=# alter table t1 set(autovacuum_enabled=off);
ALTER TABLE
swrd=# delete from t1 where c1>=2977 and c1<=4216;
DELETE 1240
##仍然讀取的buffer個數是41個,由於頁還沒作清理,因此很正常,下面作一下vacuum
swrd=# explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  ;
                                                QUERY PLAN
-----------------------------------------------------------------------------------------------------------
 Seq Scan on swrd.t1  (cost=0.00..128.60 rows=8760 width=103) (actual time=0.066..2.961 rows=8760 loops=1)
   Output: c1, c2, c3, c4
   Buffers: shared read=41
 Planning Time: 0.694 ms
 Execution Time: 3.836 ms
(5 rows)
##作一下vacuum
swrd=# vacuum VERBOSE t1;
INFO:  vacuuming "swrd.t1"
INFO:  "t1": removed 0 row versions in 5 pages
INFO:  "t1": found 0 removable, 8760 nonremovable row versions in 41 out of 41 pages
DETAIL:  0 dead row versions cannot be removed yet, oldest xmin: 46060313
There were 0 unused item pointers.
Skipped 0 pages due to buffer pins, 0 frozen pages.
0 pages are entirely empty.
CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s.
INFO:  vacuuming "pg_toast.pg_toast_37066"
INFO:  index "pg_toast_37066_index" now contains 0 row versions in 1 pages
DETAIL:  0 index row versions were removed.
0 index pages have been deleted, 0 are currently reusable.
CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s.
INFO:  "pg_toast_37066": found 0 removable, 0 nonremovable row versions in 0 out of 0 pages
DETAIL:  0 dead row versions cannot be removed yet, oldest xmin: 46060313
There were 0 unused item pointers.
Skipped 0 pages due to buffer pins, 0 frozen pages.
0 pages are entirely empty.
CPU: user: 0.00 s, system: 0.00 s, elapsed: 0.00 s.
VACUUM
swrd=# explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  ;
                                                QUERY PLAN
-----------------------------------------------------------------------------------------------------------
 Seq Scan on swrd.t1  (cost=0.00..128.60 rows=8760 width=103) (actual time=0.011..1.162 rows=8760 loops=1)
   Output: c1, c2, c3, c4
   Buffers: shared hit=41
 Planning Time: 0.092 ms
 Execution Time: 1.972 ms
(5 rows)

掃描的page仍然是41個。在作vacuum full後,掃描的頁數才減小了5個。測試

swrd=# vacuum FULL t1;
VACUUM
swrd=# explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  ;
                                                QUERY PLAN
-----------------------------------------------------------------------------------------------------------
 Seq Scan on swrd.t1  (cost=0.00..123.60 rows=8760 width=103) (actual time=0.057..2.040 rows=8760 loops=1)
   Output: c1, c2, c3, c4
   Buffers: shared read=36
 Planning Time: 0.089 ms
 Execution Time: 2.879 ms
(5 rows)

2. 測試使用索引的狀況:

swrd=# create index on t1(c1);
CREATE INDEX
swrd=#   explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  where c1>=2977 and c1<=4216 ;
                                                         QUERY PLAN
----------------------------------------------------------------------------------------------------------------------------
 Index Scan using t1_c1_idx on swrd.t1  (cost=0.29..42.08 rows=1240 width=103) (actual time=0.098..0.415 rows=1240 loops=1)
   Output: c1, c2, c3, c4
   Index Cond: ((t1.c1 >= 2977) AND (t1.c1 <= 4216))
   Buffers: shared hit=7
 Planning Time: 0.143 ms
 Execution Time: 0.558 ms
(6 rows)

c1值在2977-4216共佔用了5個頁,其餘兩個頁是來自索引的。操作系統

使用插件pageinspect看一下索引的結構:插件

swrd=# select * from bt_metap('t1_c1_idx');
 magic  | version | root | level | fastroot | fastlevel | oldest_xact | last_cleanup_num_tuples
--------+---------+------+-------+----------+-----------+-------------+-------------------------
 340322 |       3 |    3 |     1 |        3 |         1 |           0 |                      -1
(1 row)

看到索引的level是1,也就是該索引是2級結構,包含了meta page,root page,leaf page。另外讀取的buffer是root page和leaf page。code

下面刪掉其中的兩個頁的數據量:索引

swrd=# delete from t1 where c1 >=3225 and c1<=3720;
DELETE 496
swrd=#  explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  where c1>=2977 and c1<=4216
swrd-# ;
                                                        QUERY PLAN
---------------------------------------------------------------------------------------------------------------------------
 Index Scan using t1_c1_idx on swrd.t1  (cost=0.29..42.08 rows=1240 width=103) (actual time=0.086..0.428 rows=744 loops=1)
   Output: c1, c2, c3, c4
   Index Cond: ((t1.c1 >= 2977) AND (t1.c1 <= 4216))
   Buffers: shared hit=8 dirtied=1
 Planning Time: 0.141 ms
 Execution Time: 0.528 ms
(6 rows)

swrd=#  explain (ANALYZE,VERBOSE,COSTS,BUFFERS,TIMING) select * from t1  where c1>=2977 and c1<=4216
;
                                                        QUERY PLAN
---------------------------------------------------------------------------------------------------------------------------
 Index Scan using t1_c1_idx on swrd.t1  (cost=0.29..42.08 rows=1240 width=103) (actual time=0.064..0.253 rows=744 loops=1)
   Output: c1, c2, c3, c4
   Index Cond: ((t1.c1 >= 2977) AND (t1.c1 <= 4216))
   Buffers: shared hit=5
 Planning Time: 0.125 ms
 Execution Time: 0.348 ms
(6 rows)

經過使用bt_page_items查看索引中的數據,一個索引頁中包含了1473條記錄,並能夠看到刪除的頁均在一個leaf page上面。ip

因爲咱們以前避免vacuum的干擾,將它關閉了,手動執行一下,能夠看到對應的索引記錄也被刪除了。此處刪除的數據是(13,1)到(14,248)的記錄。md5

swrd=# select * from bt_page_items('t1_c1_idx',4);
 itemoffset |   ctid   | itemlen | nulls | vars |          data
------------+----------+---------+-------+------+-------------------------
          1 | (17,201) |      16 | f     | f    | 41 11 00 00 00 00 00 00
          2 | (11,217) |      16 | f     | f    | 81 0b 00 00 00 00 00 00
          3 | (11,218) |      16 | f     | f    | 82 0b 00 00 00 00 00 00
          4 | (11,219) |      16 | f     | f    | 83 0b 00 00 00 00 00 00
          5 | (11,220) |      16 | f     | f    | 84 0b 00 00 00 00 00 00
          6 | (11,221) |      16 | f     | f    | 85 0b 00 00 00 00 00 00
          7 | (11,222) |      16 | f     | f    | 86 0b 00 00 00 00 00 00
          8 | (11,223) |      16 | f     | f    | 87 0b 00 00 00 00 00 00
          9 | (11,224) |      16 | f     | f    | 88 0b 00 00 00 00 00 00
         10 | (11,225) |      16 | f     | f    | 89 0b 00 00 00 00 00 00
         11 | (11,226) |      16 | f     | f    | 8a 0b 00 00 00 00 00 00
         12 | (11,227) |      16 | f     | f    | 8b 0b 00 00 00 00 00 00
         13 | (11,228) |      16 | f     | f    | 8c 0b 00 00 00 00 00 00
         14 | (11,229) |      16 | f     | f    | 8d 0b 00 00 00 00 00 00
         15 | (11,230) |      16 | f     | f    | 8e 0b 00 00 00 00 00 00
         16 | (11,231) |      16 | f     | f    | 8f 0b 00 00 00 00 00 00
         17 | (11,232) |      16 | f     | f    | 90 0b 00 00 00 00 00 00
         18 | (11,233) |      16 | f     | f    | 91 0b 00 00 00 00 00 00
         19 | (11,234) |      16 | f     | f    | 92 0b 00 00 00 00 00 00
         20 | (11,235) |      16 | f     | f    | 93 0b 00 00 00 00 00 00
.....
        277 | (12,244) |      16 | f     | f    | 94 0c 00 00 00 00 00 00
        278 | (12,245) |      16 | f     | f    | 95 0c 00 00 00 00 00 00
        279 | (12,246) |      16 | f     | f    | 96 0c 00 00 00 00 00 00
        280 | (12,247) |      16 | f     | f    | 97 0c 00 00 00 00 00 00
        281 | (12,248) |      16 | f     | f    | 98 0c 00 00 00 00 00 00
        282 | (15,1)   |      16 | f     | f    | 89 0e 00 00 00 00 00 00
        283 | (15,2)   |      16 | f     | f    | 8a 0e 00 00 00 00 00 00
        284 | (15,3)   |      16 | f     | f    | 8b 0e 00 00 00 00 00 00
        285 | (15,4)   |      16 | f     | f    | 8c 0e 00 00 00 00 00 00
....
        769 | (16,240) |      16 | f     | f    | 70 10 00 00 00 00 00 00
        770 | (16,241) |      16 | f     | f    | 71 10 00 00 00 00 00 00
        771 | (16,242) |      16 | f     | f    | 72 10 00 00 00 00 00 00
        772 | (16,243) |      16 | f     | f    | 73 10 00 00 00 00 00 00
        773 | (16,244) |      16 | f     | f    | 74 10 00 00 00 00 00 00
        774 | (16,245) |      16 | f     | f    | 75 10 00 00 00 00 00 00
        775 | (16,246) |      16 | f     | f    | 76 10 00 00 00 00 00 00
        776 | (16,247) |      16 | f     | f    | 77 10 00 00 00 00 00 00
        777 | (16,248) |      16 | f     | f    | 78 10 00 00 00 00 00 00
        778 | (17,1)   |      16 | f     | f    | 79 10 00 00 00 00 00 00
....

看到上面的執行計劃,第一次執行時是有8個buffer,其中一個是髒頁,這裏後面再深刻了解一下,再補充上,第二次時,讀取的buffer數就變成了5個,咱們刪除的兩個頁已經再也不了。也就是使用索引掃描時沒有掃描到刪除的頁,這個也很好理解,由於索引是根據ctid來掃描的,刪除的頁不存在要查詢的數據了,天然就不會在掃了。

總結:

  1. 全表掃描時,讀取數據時可能會掃描到刪除的頁面。(按理說在讀數據時,只須要讀取buftag對應的頁就能夠了,但測試發現膨脹產生的髒頁仍會被讀到。很疑惑。)
  2. 索引掃描時,不會掃描到刪除的頁面,但對於有hot操做的記錄,仍須要使用原記錄的line pointer。
相關文章
相關標籤/搜索