文本工具cut、wc、sort、uniq、tr

時間 2019-12-07

標籤文本工具 cut sort uniq 简体版

原文原文鏈接

8.10 shell特殊符號&cut命令

特殊符號

「*」表明零個或多個字符
「？」表明一個字符
「#」註釋符號
「\」脫意符號
「|」管道符
「$」該符號與「!」合用「!$」表示上一條命令中的最後一個變量
「；」分隔符，在一行中運行兩個及兩個以上的命令時使用
「~」用戶的家目錄（root用戶「/root」，普通用戶「/home/username」）
「&」若是想把一條命令直接放到後臺運行的話，能夠在命令行加上這個符號（一般用於運行時間很是長的命令）
「[]」中括號中間爲字符組合，表明中間字符中的任意一個。

cut命令

cut命令用來顯示行中的指定部分，刪除文件中指定字段。cut常常用來顯示文件的內容，相似於下的type命令。
說明：該命令有兩項功能，其一是用來顯示文件的內容，它依次讀取由參數file所指明的文件，將它們的內容輸出到標準輸出上；其二是鏈接兩個或多個文件，如cut fl f2 > f3將把文件fl和幾的內容合併起來，而後經過輸出重定向符「>」的做用，將它們放入文件f3中。linux

語法： cut -d ‘分隔符’ [-cf] n [filename] （這裏n是正整數）
-d：指定分隔符號
-f：指定第幾段
-c：後面只有一個數字表示截取第幾個字符；後面跟一個數字區域，表示截取從幾到幾（該選項不和d，f共同使用）shell

[root@3 tmp]# cut -c1 1.txt |head -n2
r
b
[root@3 tmp]# cut -c1,3 1.txt |head -n2
ro
bn
[root@3 tmp]# cut -f1,3 -d ':' 1.txt |head -n2
root:0
bin:1
[root@3 tmp]# cut -f1-3 -d ':' 1.txt |head -n2
root:x:0
bin:x:1

8.11 sort、wc、uniq命令

sort命令

sort命令是在Linux裏很是有用，它將文件進行排序，並將排序結果標準輸出。sort命令既能夠從特定的文件，也能夠從stdin中獲取輸入。bash

語法： sort [-t 分隔符] [options] [filename]
Options:
-t：指定分隔符
-n：使用純數字排序（系統默認全部字母爲0）
-r：反向排序
-u：=unique 去重複
-kn1,n2：由n1區間排序到n2區間，能夠只寫-kn1，即對n1字段排序（n1 < n2）
sort不加任何選項，則從首字符向後，依次以ASCⅡ碼值進行比較，最後將它們按升續輸出。spa

[root@3 tmp]# head -n3 1.txt
root:x:0:0:root:/root:/bin/bash
bin:x:1:1:bin:/bin:/sbin/nologin
daemon:x:2:2:daemon:/sbin:/sbin/nologin
[root@3 tmp]# head -n3 1.txt |sort
bin:x:1:1:bin:/bin:/sbin/nologin
daemon:x:2:2:daemon:/sbin:/sbin/nologin
root:x:0:0:root:/root:/bin/bash

wc命令

wc命令用來計算數字。利用wc指令咱們能夠計算文件的Byte數、字數或是列數。命令行

語法： wc [options] [filename]
Options:
-l：=line 統計行數
-m：=member 統計字符數
-w：=Word 統計詞數日誌

[root@3 tmp]# wc -l !$
wc -l 2.txt
2 2.txt
[root@3 tmp]# wc -m 2.txt
10 2.txt
[root@3 tmp]# cat !$
cat 2.txt
1234
qwer
[root@3 tmp]# cat -A 2.txt
1234$
qwer$
[root@3 tmp]# wc -w 2.txt
4 2.txt
[root@3 tmp]# cat 2.txt
1234 456 789,10
qwer

說明： wc -m會統計文件內全部字符，包括隱藏的換行符「&」；wc -w是以空格做爲分隔符進行詞組統計的。code

uniq命令（unique）

uniq命令用於報告或忽略文件中的重複行，通常與sort命令結合使用（即：去重複）。排序

語法： uniq [options] [filename]
Options：
-c：=count 在每列旁邊顯示該行重複出現的次數it

[root@3 tmp]# cat !$
cat 2.txt
1234
456 789,10
1234
qwer
456
[root@3 tmp]# uniq -c 2.txt
      1 1234
      1 456 789,10
      1 1234
      1 qwer
      1 456
[root@3 tmp]# sort 2.txt |uniq -c
      2 1234
      1 456
      1 456 789,10
      1 qwer

說明： 直接使用uniq命令，2.txt內容顯示並無變化，使用sort排序後再用uniq命令，重複行被合併，即：在對文件進行去重以前須要先進行排序！io

8.12 tee、tr、split命令

tee命令

tee命令用於將數據重定向到文件,會刪除文件內原有內容，與「>」不一樣的是，tee會把定向的文件內容顯示出來。

語法： tee [options] [filename]
Options：
-a：向文件中重定向時使用追加模式（=「>>」）

[root@3 tmp]# cat 3.txt
00000000000
[root@3 tmp]# sort 2.txt |uniq -c |tee 3.txt
      2 1234
      1 456
      1 456 789,10
      1 qwer
[root@3 tmp]# cat 3.txt
      2 1234
      1 456
      1 456 789,10
      1 qwer
      [root@3 tmp]# sort 2.txt |uniq -c |tee -a 3.txt
      2 1234
      1 456
      1 456 789,10
      1 qwer
[root@3 tmp]# cat 3.txt
      2 1234
      1 456
      1 456 789,10
      1 qwer
      2 1234
      1 456
      1 456 789,10
      1 qwer

tr命令

tr命令能夠對來自標準輸入的字符進行替換、壓縮和刪除，它能夠將一個字符變成另外一個字符，也能夠將一組字符變成另外一組字符。

語法： tr [源字符] [目標字符]

[root@3 tmp]# echo "adailinux" |tr 'a' 'A'
AdAilinux   替換一個字符
[root@3 tmp]# echo "adailinux" |tr '[al]' '[AL]'
AdAiLinux   替換多個字符
[root@3 tmp]# echo "adailinux" |tr '[a-z]' '[A-Z]'
ADAILINUX

split命令

split命令能夠將一個大文件分割成不少個小文件，有時須要將文件分割成更小的片斷，好比爲提升可讀性，生成日誌等。

語法： split [options] [filename]
-b：指定每一輸出檔案的大小，默認單位爲 byte，可自定義單位，如 split -b 100M filename
-l：指定每個輸出檔案的行數多少
eg1： 指定大小

[root@3 tmp]# split -b 100 1.txt
[root@3 tmp]# ls
xaa
xab
xac
xad
[root@3 tmp]# rm -rf x*
[root@3 tmp]# split -b 100 1.txt adai.  
能夠指定文件前綴！
[root@3 tmp]# ls
adai.aa 
adai.ab
adai.ac
adai.ad

eg2： 指定行數

[root@3 tmp]# wc -l 1.txt
20 1.txt
[root@3 tmp]# split -l 5 1.txt
[root@3 tmp]# ls
xaa
xab
xac
xad
[root@3 tmp]# wc -l x*
  5 xaa
  5 xab
  5 xac
  5 xad
 20 總用量

8.13 shell特殊符號（下）

命令鏈接符： 「||」、「&&」、「;」

command1 ; command2 ：無論command1是否執行成功都會執行command2
command1 && command2 ：只有command1執行成功後纔會執行command2
command1 || command2 ：表示command1執行成功後，command2不執行，不然執行command2

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。