linux uniq sort 排重、排序

阿新 • • 發佈：2019-02-07

有如下檔案a.txt

[[email protected]] /ftproot# cat a.txt
ttt|000001
uuu|000002
uuu|000002
uuu|000002
uuu|000002
1
2
3
4
5
6
7
77
8
9
9

=====================================

#cat a.txt | uniq -c -i | sort -k2 -n 排重，排重輸出的第二列正序排列
#cat a.txt | uniq -c -i | sort -k2 -rn 排重，排重輸出的第二列逆序排列

uniq 引數解釋

-c 統計重複數量

     -c      Precede each output line with the count of the number of times
             the line occurred in the input, followed by a single space.

     -d      Only output lines that are repeated in the input.

     -f num Ignore the first num fields in each input line when doing compar-
             isons. A field is a string of non-blank characters separated
             from adjacent fields by blanks. Field numbers are one based,
             i.e., the first field is field one.

     -s chars
             Ignore the first chars characters in each input line when doing
             comparisons. If specified in conjunction with the -f option, the
             first chars characters after the first num fields will be
             ignored. Character numbers are one based, i.e., the first char-
             acter is character one.

     -u      Only output lines that are not repeated in the input.

-i Case insensitive comparison of lines.

=============================================================================

linux關於sort命令的高階用法（按多個列值進行排列）

如果單純地使用sort按行進行排序比較簡單，

但是使用sort按多個列值排列，同時使用tab作為分隔符，而且對於某些列需要進行逆序排列，這樣sort命令寫起來就比較麻煩了

比如下面的檔案內容，使用[TAB]進行分割:

Group-ID   Category-ID   Text        Frequency
----------------------------------------------
200        1000          oranges     10
200        900           bananas     5
200        1000          pears       8
200        1000          lemons      10
200        900           figs        4
190        700           grapes      17

下面使用這些列進行排序（列4在列3之前進行排序，而且列4是逆序排列）

    * Group ID (integer)
    * Category ID (integer)
    * Frequency “sorted in reverse order” (integer)
    * Text (alpha-numeric)

排序後的結果應該為：

Group-ID   Category-ID   Text        Frequency
----------------------------------------------
190        700           grapes      17
200        900           bananas     5
200        900           figs        4
200        1000          lemons      10
200        1000          oranges     10
200        1000          pears       8

可以直接使用sort命令來解決這個問題：

BASH CODE

sort -t $'\t' -k 1n,1 -k 2n,2 -k4rn,4 -k3,3 <my-file>

解釋如下：

-t $'\t'：指定TAB為分隔符
-k 1, 1: 按照第一列的值進行排序，如果只有一個1的話，相當於告訴sort從第一列開始直接到行尾排列
n:代表是數字順序，預設情況下市字典序，如10<2
r: reverse 逆序排列，預設情況下市正序排列

所以最後的命令：sort -t $’\t’ -k 1n,1 -k 2n,2 -k4rn,4 -k3,3 my-file

linux uniq sort 排重、排序

linux uniq sort 排重、排序

elastic search6.2.2 實現用戶搜索記錄查詢（去重、排序）

計算機網路實驗（二）之Wireshark抓包分析獲取URL列表（去重、排序、統計）

Linux Shell -- sort(按照指定列排序)

List去重、排序操作

c++中set的使用：初始化和去重、排序

linux Shell sort按照指定列排序

Python中的列表，元祖，集合，字典，字串進行去重、排序、翻轉操作

Linux命令去重統計排序（awk命令去重，sort, uniq命令去重統計）

Linux中 sort、uniq、wc、cut 隨筆

Linux學習——管道命令、文字提取命令、排序命令、雙向重導向、字元轉換命令、分割命令、引數代換

linux中的統計、排序之sort

sort對輸出行排序排重

Hadoop—MapReduce練習（資料去重、資料排序、平均成績、倒排索引）

Linux awk+uniq+sort 統計檔案中某字串出現次數並排序

ACM:一種排序（操作符過載、vector排重）

Linux學習筆記之管道、重定向與正則表達式

Linux系統中查找、刪除重復文件，釋放磁盤空間。

quick sort / quick select sort 快排 / 快速選擇排序

JAVA實現冒泡、歸併、希爾、堆排、快速、插入、簡單選擇、排序演算法

linux uniq sort 排重、排序

相關推薦