小知識:Linux如何刪除大量小文件

環境:RHEL 6.5 + Oracle 11.2.0.4
需求:使用df -i巡檢發現Inodes使用率太高,須要清理刪除文件來解決。若是Inodes滿,該目錄將不能寫,即便df -h查看還有剩餘空間。node

1.問題現象

Oracle的adump下記錄的是sys的登錄審計信息,特色是小碎文件很是多,常常會遇到使用rm -rf *命令刪除不了,報錯-bash: /bin/rm: Argument list too long
這是由於通配符*在執行時會替換爲具體的文件名,例如rm -rf file1 file2 file3 ...,若是文件數量過多,就容易出現這個錯誤。
好比在下面的環境中,adump目錄下文件已達到114萬+,執行rm -rf *命令時就會報這個錯誤:linux

[oracle@jystdrac2 adump]$ pwd
/opt/app/oracle/admin/crmdb/adump
[oracle@jystdrac2 adump]$ ls|wc -l
1149787
[oracle@jystdrac2 adump]$ rm -rf *
-bash: /bin/rm: Argument list too long
[oracle@jystdrac2 adump]$ du -sh
4.4G

2.解決方案

清楚了問題現象,解決方案就從除去rm -rf *命令的方式以外,還有哪些方法可用,若是經過網絡搜索,可能會找到結合find命令再去執行rm的方式,但其實效率很是差,具體寫法這裏就不列出了,由於咱們一般也不會這樣處理。那麼如何較爲效率的刪除大批小文件呢?結合網絡的經驗,並實測驗證,最終總結了兩種常見的解決方案,效率上也都尚可。shell

方案一:巧用rsync的方式達到刪除目的express

創建一個空文件夾,使用rsync --delete-before -d <空文件夾> <須要清理刪除小文件的目錄>命令最終達到刪除大批小文件的目的。下面演示具體操做:bash

[oracle@jystdrac2 adump]$ mkdir /data/null
[oracle@jystdrac2 adump]$ ls -l /data/null
total 0
[oracle@jystdrac2 ~]$ nohup rsync --delete-before -d /data/null/ /opt/app/oracle/admin/crmdb/adump/ &

使用man rsync查看rsync命令相關的參數說明以下:網絡

-d, --dirs                  transfer directories without recursing
--delete-before         receiver deletes before transfer (default)

方案二:使用find命令的delete參數oracle

使用find <須要清理刪除小文件的目錄> -type f -delete命令直接刪除大批小文件。app

使用man find查看find命令相關的參數說明以下:less

-type c
              File is of type c:

              b      block (buffered) special

              c      character (unbuffered) special

              d      directory

              p      named pipe (FIFO)

              f      regular file

              l      symbolic  link; this is never true if the -L option or the -follow option is in effect, unless the symbolic link is broken.  If you want to
                     search for symbolic links when -L is in effect, use -xtype.

              s      socket

              D      door (Solaris)

       -delete
              Delete files; true if removal succeeded.  If the removal failed, an error message is issued.  If -delete fails, find’s exit status will be nonzero
              (when it eventually exits).  Use of -delete automatically turns on the ‘-depth’ option.

              Warnings:  Don’t forget that the find command line is evaluated as an expression, so putting -delete first will make find try to delete everything
              below the starting points you specified.  When testing a find command line that you later intend to use with -delete, you should explicitly  spec-
              ify -depth in order to avoid later surprises.  Because -delete implies -depth, you cannot usefully use -prune and -delete together.

下面演示具體操做:socket

[oracle@jystdrac1 adump]$ nohup find /opt/app/oracle/admin/crmdb/adump/ -type f -delete &

能夠參考下面的命令來簡單監控刪除過程當中Inodes使用率的變化:

while true; do df -i /; sleep 10; done

好比我這裏節點jystdrac1使用的find方法,節點jystdrac2使用的rsync方法,實際觀察Inodes釋放速度區別並不大:

# 使用的find方法,觀察Inodes釋放速度:
[oracle@jystdrac1 ~]$ while true; do df -i /; sleep 10; done
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1519124 287772   85% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1519015 287881   85% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1513880 293016   84% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1511132 295764   84% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1502434 304462   84% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1494583 312313   83% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1489111 317785   83% /
Filesystem                        Inodes   IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 1487629 319267   83% /

# 使用的rsync方法,觀察Inodes釋放速度:
[oracle@jystdrac2 ~]$ while true; do df -i /; sleep 10; done
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 963029 843867   54% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 955037 851859   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 953088 853808   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 950523 856373   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 948754 858142   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 944613 862283   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 942619 864277   53% /
Filesystem                        Inodes  IUsed  IFree IUse% Mounted on
/dev/mapper/vg_linuxbase-lv_root 1806896 938510 868386   52% /

既然兩種方式差別不算大,那就根據需求或我的習慣選擇便可。我本身更傾向於使用方案二,由於這樣無需建立空目錄,操做上也更直觀。
最後再總結下刪除大量小文件的方法:

# 方案一:
mkdir <空文件夾>
rsync --delete-before -d <空文件夾> <須要清理刪除小文件的目錄>
# 方案二:
find <須要清理刪除小文件的目錄> -type f -delete

相對來講這兩種方式都比較效率,但因爲總體小文件也是比較多,因此實際能夠選擇nohup放到後臺執行。

相關文章
相關標籤/搜索