linux awk數組相關操做介紹

時間 2019-12-02

標籤 linux awk 數組相關介紹欄目 Linux 简体版

原文原文鏈接

用awk進行文本處理，少不了就是它的數組處理。那麼awk數組有那些特色，通常常見運算又會怎麼樣呢。咱們先看下如下的一些介紹，結合樣例咱們會解說下它的不一樣之處。在 awk 中數組叫作關聯數組(associative arrays)，因爲下標記可以是數也可以是串。awk 中的數組沒必要提早聲明，也沒必要聲明大小。數組元素用 0 或空串來初始化，這依據上下文而定。好比：數組

1、定義方法函數

1：可以用數值做數組索引(下標)spa

Tarray[1]=「cheng mo」
Tarray[2]=「800927」

2：可以用字符串做數組索引(下標)排序

Tarray[「first」]=「cheng 」
Tarray[「last」]=」mo」
Tarray[「birth」]=」800927」

使用中 print Tarray[1] 將獲得」cheng mo」而 print Tarray[2] 和 print[「birth」] 都將獲得」800927」。索引

2、數組相關函數ci

[chengmo@localhost ~]$ awk --version
GNU Awk 3.1.5字符串

使用版本號是：3.1以上，不一樣版本號如下函數不必定一樣it

獲得數組長度（length方法使用）

[chengmo@localhost ~]$ awk 'BEGIN{info="it is a test";lens=split(info,tA," ");print length(tA),lens;}'
4 4io

length返回字符串以及數組長度，split進行切割字符串爲數組，也會返回切割獲得數組長度。ast

(asort使用）:

[chengmo@localhost ~]$ awk 'BEGIN{info="it is a test";split(info,tA," ");print asort(tA);}'
4

asort對數組進行排序，返回數組長度。

輸出數組內容(無序，有序輸出）：

[chengmo@localhost ~]$ awk 'BEGIN{info="it is a test";split(info,tA," ");for(k in tA){print k,tA[k];}}'
4 test
1 it
2 is
3 a

for…in 輸出，因爲數組是關聯數組，默認是無序的。因此經過for…in 獲得是無序的數組。假設需要獲得有序數組，需要經過下標得到。

[chengmo@localhost ~]$ awk 'BEGIN{info="it is a test";tlen=split(info,tA," ");for(k=1;k<=tlen;k++){print k,tA[k];}}'
1 it
2 is
3 a
4 test

注意：數組下標是從1開始，與c數組不同。

推斷鍵值存在以及刪除鍵值：

一個錯誤的推斷方法：

[chengmo@localhost ~]$ awk 'BEGIN{tB["a"]="a1";tB["b"]="b1";if(tB["c"]!="1"){print "no found";};for(k in tB){print k,tB[k];}}'
no found
a a1
b b1
c

以上出現奇怪問題，tB[「c」]未定義，但是循環時候，發現已經存在該鍵值，它的值爲空，這裏需要注意，awk數組是關聯數組，僅僅要經過數組引用它的key，就會本身主動建立改序列.

正確推斷方法：

[chengmo@localhost ~]$ awk 'BEGIN{tB["a"]="a1";tB["b"]="b1";if( "c" in tB){print "ok";};for(k in tB){print k,tB[k];}}'
a a1
b b1

if(key in array) 經過這個方法推斷數組中是否包括」key」鍵值。

刪除鍵值：

[chengmo@localhost ~]$ awk 'BEGIN{tB["a"]="a1";tB["b"]="b1";delete tB["a"];for(k in tB){print k,tB[k];}}'
b b1

delete array[key]可以刪除，相應數組key的，序列值。

3、二維數組使用(多維數組使用）

awk的多維數組在本質上是一維數組，更確切一點，awk在存儲上並不支持多維數組。awk提供了邏輯上模擬二維數組的訪問方式。例如，array[2,4] = 1這種訪問是贊成的。awk使用一個特殊的字符串SUBSEP (\034)做爲切割字段，在上面的樣例中，關聯數組array存儲的鍵值其實是2\0344。

類似一維數組的成員測試，多維數組可以使用 if ( (i,j) in array)這種語法，但是下標必須放置在圓括號裏。
類似一維數組的循環訪問，多維數組使用 for ( item in array )這種語法遍歷數組。與一維數組不一樣的是，多維數組必須使用split()函數來訪問單獨的下標份量。split ( item, subscr, SUBSEP)

[chengmo@localhost ~]$ awk 'BEGIN{

for(i=1;i<=9;i++)
{
for(j=1;j<=9;j++)
{
tarr[i,j]=i*j;
print i,"*",j,"=",tarr[i,j];
}
}
}'
1 * 1 = 1
1 * 2 = 2
1 * 3 = 3
1 * 4 = 4
1 * 5 = 5
1 * 6 = 6

……

可以經過array[k,k2]引用得到數組內容.

方法二：

[chengmo@localhost ~]$ awk 'BEGIN{
for(i=1;i<=9;i++)
{
for(j=1;j<=9;j++)
{
tarr[i,j]=i*j;
}
}
for(m in tarr)
{

split(m,tarr2,SUBSEP);
print tarr2[1],"*",tarr2[2],"=",tarr[m];
}
}'