Python標準庫(3.x): itertools庫掃盲

阿新 • • 發佈：2018-02-25

有一個參數 ron repl gpo 位置 AC zip() drop

**itertools functions**
accumulate()	compress()	groupby()	starmap()
chain()	count()	islice()	takewhile()
chain.from_iterable()	cycle()	permutations()	tee()
combinations()	dropwhile()	product()	zip_longest()
combinations_with_replacement()	filterfalse()	repeat()

itertools.accumulate(iterable [, func]

)

　　返回一個叠代序列的累加值序列（沒有func的情況下）。

　　當指定了func（參數必須為兩個）後，將通過func進行累加。

　　註1: 當沒有傳入func時，func相當於 operator.add

　　註2: 返回值為叠代器

>>> data = [1,2,3,4]
>>> a = itertools.accumulate(data)
>>> list(a)
[1, 3, 6, 10]
#[1,2,3,4] --> [1, 1+2, (1+2)+3, ((1+2)+3)+4]

>>> b = itertools.accumulate(data, operator.mul)
 
>>> list(b)
[1, 2, 6, 24]
#[1,2,3,4] --> [1, 1*2, (1*2)*3, ((1*2)*3)*4]

itertools.chain(*iterables)

　　連接多個叠代序列為一個叠代序列，適用於需要連續遍歷多個序列場景。

　　註`: 返回值為叠代器

>>> a = [1,2,3,4,5]
>>> b = [6,7,8,9,10]
>>> c = itertools.chain(a,b)
>>> list(c)
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

itertools.chain.from_iterable(iterable)

　　通過一個叠代序列來創建 itertools.chain 的對象。

　　類似於將叠代序列中的每一個對象作為 itertools.chain 的參數，因此傳入的叠代序列中的每一個對象應該也是可叠代的。

　　註1: 返回值為叠代器

>>> a = itertools.chain.from_iterable([‘abc‘, ‘def‘, ‘hjk‘])
>>> list(a)
[‘a‘, ‘b‘, ‘c‘, ‘d‘, ‘e‘, ‘f‘, ‘h‘, ‘j‘, ‘k‘]
>>>
>>> b = itertools.chain.from_iterable([1,2,3])
>>> list(b)
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: ‘int‘ object is not iterable

itertools.combinations(iterable, r)

　　將叠代序列中的對象進行"不重復的"組合並返回所有組合的元組列表，每個組合的元素個數為r。

　　註1: 這裏的“不重復”是指叠代序列中的對象不會使用多次，但並不代表相同的值不會使用多次。

　　註2: 返回的組合順序依賴傳入的叠代序列中的順序。

　　註3: 返回值為叠代器。

>>> a = itertools.combinations(‘ABC‘,2)
>>> list(a)
[(‘A‘, ‘B‘), (‘A‘, ‘C‘), (‘B‘, ‘C‘)]
>>>
>>> b = itertools.combinations(‘CBA‘,2)
>>> list(b)
[(‘C‘, ‘B‘), (‘C‘, ‘A‘), (‘B‘, ‘A‘)]
>>>
>>> c = itertools.combinations(‘AAC‘,2)
>>> list(c)
[(‘A‘, ‘A‘), (‘A‘, ‘C‘), (‘A‘, ‘C‘)]

itertools.combinations_with_replacement(iterable, r)

　　將叠代序列中的對象進行"可重復的"組合並返回所有組合的元組列表，每個組合的元素個數為r。

　　註1: 與 itertools.combinations 的唯一區別就是元素可以重復使用。

　　註2: 返回的組合順序依賴傳入的叠代序列中的順序。

　　註3: 返回值為叠代器

>>> a = itertools.combinations_with_replacement(‘ABC‘, 2)
>>> list(a)
[(‘A‘, ‘A‘), (‘A‘, ‘B‘), (‘A‘, ‘C‘), (‘B‘, ‘B‘), (‘B‘, ‘C‘), (‘C‘, ‘C‘)]

itertools.compress(data, selectors)

　　對 data 中的數據進行過濾，只保留 selectors 中對應位置為 True 的對象。

　　data和selectors的序列長度可以不等，其中任意一個叠代終結，整個叠代即終結。

　　註1: 返回值為叠代器

>>> a = itertools.compress(‘ABCDE‘, [1,1,0,0,0])
>>> list(a)
[‘A‘, ‘B‘]
>>>
>>> b = itertools.compress(‘ABCDE‘, [1,1])
>>> list(b)
[‘A‘, ‘B‘]
>>>
>>> c = itertools.compress(‘ABC‘, [1,1,0,0,1])
>>> list(c)
[‘A‘, ‘B‘]

itertools.count(start=0, step=1)

　　生成一個計數叠代器，可以指定起始點和步進，但是沒有終點，可以一直叠代下去。

　　一般需要配合其他的叠代器一起使用，例如作為map(), zip()的參數等。

>>> a = itertools.count(start=1, step=2)
>>> next(a)
1
>>> next(a)
3
>>> next(a)
5
>>> next(a)
7
>>> next(a)
9
>>> next(a)
11
>>> next(a)
13
>>> 
>>> b = itertools.count(start=100, step=1)
>>> list(zip(b, ‘ABCDE‘))
[(100, ‘A‘), (101, ‘B‘), (102, ‘C‘), (103, ‘D‘), (104, ‘E‘)]

itertools.cycle(iterable)

　　生成一個循環叠代器，循環遍歷傳入叠代器中的對象，沒有終結。

　　一般需要配合其他叠代器一起使用，例如map(), zip() 的參數等

>>> a = itertools.cycle(‘ABC‘)
>>> next(a)
‘A‘
>>> next(a)
‘B‘
>>> next(a)
‘C‘
>>> next(a)
‘A‘
>>> next(a)
‘B‘
>>> next(a)
‘C‘
>>> next(a)
‘A‘
>>> 
>>> b = itertools.cycle(range(1,4))
>>> list(zip(‘ABCDEFG‘, b))
[(‘A‘, 1), (‘B‘, 2), (‘C‘, 3), (‘D‘, 1), (‘E‘, 2), (‘F‘, 3), (‘G‘, 1)]

itertools.dropwhile(predicate, iterable)

　　對叠代器中的對象按照 predicate 進行斷言，丟棄第一個斷言為False之前的所有對象。

　　也可以理解為從第一個斷言為False的對象開始輸出。

　　註1: 當出現第一個斷言為False的對象後，之後的對象不再進行斷言。

　　註2: predicate 代表的函數只能有一個參數。

　　註3: 返回值為叠代器

>>> a = itertools.dropwhile(lambda x: x<5, [3,4,5,6,5,4,3])
>>> list(a)
[5, 6, 5, 4, 3]

itertools.filterfalse(predicate, iterable)

　　過濾掉叠代器中按照 predicate 斷言為 True 的對象。

　　如果 predicate 傳入None, 則過濾掉值為 True 的對象。

　　註1: 返回值為叠代器

>>> a = itertools.filterfalse(lambda x: x%2==0, range(10))
>>> list(a)
[1, 3, 5, 7, 9]
>>> 
>>> b = itertools.filterfalse(None, [1,0,1,0,1,0])
>>> list(b)
[0, 0, 0]

itertools.groupby(iterable, key=None)

　　對叠代序列中的對象按照key進行分組，如果key為None則按照對象本身的值進行分組。

　　註1: 如果叠代序列中key值相等的對象中間間隔了其他的key值，則不會分在一個組。

　　註2: 返回值為一個叠代器且返回的是一個有兩個元素的元組，第一個元素為key值，第二個元素為分組對象的叠代器

>>> data = [‘abc-0‘, ‘def-0‘, ‘xyz-1‘, ‘tty-1‘, ‘kkk-2‘]
>>> a = itertools.groupby(data, lambda x:x[-1])
>>> [(k, list(g)) for k, g in a]
[(‘0‘, [‘abc-0‘, ‘def-0‘]), (‘1‘, [‘xyz-1‘, ‘tty-1‘]), (‘2‘, [‘kkk-2‘])]
>>> 
>>> 
>>> b = itertools.groupby(‘AAABBBCC‘)
>>> [(k, list(g)) for k, g in b]
[(‘A‘, [‘A‘, ‘A‘, ‘A‘]), (‘B‘, [‘B‘, ‘B‘, ‘B‘]), (‘C‘, [‘C‘, ‘C‘])]

itertools.islice(iterable, stop) itertools.islice(iterable, start, stop [, step])

　　對叠代序列進行分片，類似 slice()，但是本函數中 start, stop, step 都不能為負數。

　　參數 start 如果為 None, 則 start = 0

　　參數 stop 如果為 None, 則叠代到最後一個

　　參數 step 如果為 None, 則 step = 1

　　註1: 返回值為一個叠代器

>>> data = ‘ABCDEFG‘
>>> a = itertools.islice(data, 3)
>>> list(a)
[‘A‘, ‘B‘, ‘C‘]
>>> 
>>> b = itertools.islice(data, 1, 5, 2)
>>> list(b)
[‘B‘, ‘D‘]
>>> 
>>> c = itertools.islice(data, None, 3)
>>> list(c)
[‘A‘, ‘B‘, ‘C‘]
>>> 
>>> d = itertools.islice(data, 3, None)
>>> list(d)
[‘D‘, ‘E‘, ‘F‘, ‘G‘]

itertools.permutations(iterable, r=None)

　　將叠代序列中的對象進行"不重復的"排列組合並返回所有組合的元組列表，每個組合的元素個數為r。

　　如果r為None，則長度為叠代序列的長度。

　　註1: 這裏的“不重復”是指叠代序列中的對象不會使用多次，但並不代表相同的值不會使用多次。

　　註2: 返回的組合順序依賴傳入的叠代序列中的順序。

　　註3: 返回值為叠代器。

>>> a = itertools.permutations(‘ABC‘, 2)
>>> list(a)
[(‘A‘, ‘B‘), (‘A‘, ‘C‘), (‘B‘, ‘A‘), (‘B‘, ‘C‘), (‘C‘, ‘A‘), (‘C‘, ‘B‘)]
>>> 
>>> b = itertools.permutations(‘ABC‘)
>>> list(b)
[(‘A‘, ‘B‘, ‘C‘), (‘A‘, ‘C‘, ‘B‘), (‘B‘, ‘A‘, ‘C‘), (‘B‘, ‘C‘, ‘A‘), (‘C‘, ‘A‘, ‘B‘), (‘C‘, ‘B‘, ‘A‘)]

itertools.product(*iterables, repeat=1)

　　返回多個叠代序列的笛卡爾乘積，repeat值相當於把傳入的叠代器參數重復的次數。

　　註1: 返回值是一個叠代器

>>> a = itertools.product(‘ABCD‘, ‘xy‘)
>>> list(a)
[(‘A‘, ‘x‘), (‘A‘, ‘y‘), (‘B‘, ‘x‘), (‘B‘, ‘y‘), (‘C‘, ‘x‘), (‘C‘, ‘y‘), (‘D‘, ‘x‘), (‘D‘, ‘y‘)]
>>> 
>>> b = itertools.product(range(2), repeat=3)
>>> list(b)
[(0, 0, 0), (0, 0, 1), (0, 1, 0), (0, 1, 1), (1, 0, 0), (1, 0, 1), (1, 1, 0), (1, 1, 1)]
# 相當於 itertools.product(range(2), range(2), range(2))

itertools.repeat(object [, times])

　　返回一個叠代器，重復傳入的對象。重復的次數為 times 。

　　如果沒有傳入times參數，則無限重復。

>>> a = itertools.repeat(‘hello‘, 3)
>>> list(a)
[‘hello‘, ‘hello‘, ‘hello‘]
>>> 
>>> b = itertools.repeat(‘test‘)
>>> list(map(lambda x, y: x + y, b, ‘ABCD‘))
[‘testA‘, ‘testB‘, ‘testC‘, ‘testD‘]

itertools.starmap(function, iterable)

　　和 map() 類似。但是這裏 function 的參數封裝在叠代器中的每一個對象中。

　　註1: 叠代器中的每一個對象也必須是可叠代的，哪怕函數只有一個參數。

>>> a = itertools.starmap(lambda x,y: x**y, [(2,1), (2,2), (2,3)])
>>> list(a)
[2, 4, 8]
>>> 
>>> b = itertools.starmap(lambda x: x*x, [(1,),(2,),(3,)])
>>> list(b)
[1, 4, 9]

itertools.takewhile(predicate, iterable)

　　與 dropwhile() 相反，對叠代器中的對象按照 predicate 進行斷言，輸出第一個斷言為False之前的所有對象。

　　註1: 當出現第一個斷言為False的對象後，叠代即終止。

　　註2: predicate 代表的函數只能有一個參數。

　　註3: 返回值為叠代器

>>> a = itertools.takewhile(lambda x: x<5, [3,4,5,6,5,4,3])
>>> list(a)
[3, 4]

itertools.tee(iterable, n=2)

　　將一個叠代器復制n次，返回一個有n個叠代器的元組。n默認為2

>>> a = itertools.tee(‘ABC‘)
>>> [list(x) for x in a]
[[‘A‘, ‘B‘, ‘C‘], [‘A‘, ‘B‘, ‘C‘]]
>>> 
>>> b = itertools.tee(range(5), 3)
>>> [list(x) for x in b]
[[0, 1, 2, 3, 4], [0, 1, 2, 3, 4], [0, 1, 2, 3, 4]]

itertools.zip_longest(*iterables, fillvalue=None)

　　類似於 zip()。但是這裏按照最長的叠代序列進行打包，缺少的元素用 fillvalue 的值進行填充。

　　註1: fillvalue 默認為None, 並且如果是None，填充的就是None

>>> a = itertools.zip_longest(‘ABC‘, ‘xy‘, fillvalue=‘*‘)
>>> list(a)
[(‘A‘, ‘x‘), (‘B‘, ‘y‘), (‘C‘, ‘*‘)]
>>>
>>> b = itertools.zip_longest(‘ABC‘, ‘xy‘)
>>> list(b)
[(‘A‘, ‘x‘), (‘B‘, ‘y‘), (‘C‘, None)]

Python標準庫(3.x): itertools庫掃盲

有一個參數 ron repl gpo 位置 AC zip() drop itertools functions accumulate() compress() groupby() starmap() chain() count() islice() takewhi

python爬蟲系列(3.2-lxml庫的使用)

一、基本介紹 1、lxml 是一個HTML/XML的解析器，主要的功能是如何解析和提取 HTML/XML 資料。 2、lxml和正則一樣，也是用 C 實現的，是一款高效能的 Python HTML/XML 解析器，我們可

python學習（3）Urllib庫的基本使用

Urllib是Python內建的HTTP請求庫 urllib.request 請求模組 urllib.error 異常處理模組 urllib.parse url解析模組 urllib.robotparser

CentOS 7.x下升級Python版本到3.x系列(新老版本共存)

由於python官方已宣佈2.x系列即將停止支援，為了向前看，我們升級系統的python版本為3.x系列伺服器系統為當前最新的CentOS 7.4 1.安裝前檢視當前系統下的python版本號 # python -V 2.獲取python3.x的官方軟體包 # wget https:/

CentOS 7下升級Python版本到3.x系列

由於python官方已宣佈2.x系列即將停止支援，為了向前看，我們升級系統的python版本為3.x系列伺服器系統為當前最新的CentOS 7.4 1.安裝前檢視當前系統下的python版本號 # python -V 2.獲取python3.x的官方軟體包 # wget https://www.python

安裝pip3 以及將Linux下的Python更改為3.x

上次切換了Python2和Python3。但是Python3並沒有pip，所有在Python3下不能安裝包。更改Python的版本：將Linux系統預設的Python2.x 更改為Python3.x 首先在終

Python 3.x標準模組庫目錄

文字 1. string：通用字串操作 2. re：正則表示式操作 3. difflib：差異計算工具 4. textwrap：文字填充 5. unicodedata：Unicode字元資料庫 6. stringprep：網際網路字串準備工具 7. readli

Python標準庫--itertools模塊

end col map class 條件停止 -- rtm 共享 itertools模塊：處理可叠代對象 chain()和islice()、tee() chain：合並叠代器 islice：切割叠代器，start，end，step tee：復制叠代器，新叠代器共享輸入叠

Python標準庫筆記(10) — itertools模塊

構造 values tools multi 生成 TE product and map() itertools 用於更高效地創建叠代器的函數工具。 itertools 提供的功能受Clojure，Haskell，APL和SML等函數式編程語言的類似功能的啟發。它們的目的

python標準庫----itertools

python標準庫系列教程(一)——itertools 01 宣告 functools, itertools, operator是Python標準庫為我們提供的支援函數語言程式設計的三大模組，合理的使用這三個模組，我們可以寫出更加簡潔可讀的Pythonic程式碼，本次的系列文章將介紹並使

Python標準庫—Itertools

Itertools模組官方描述： Functional tools for creating and using iterators.即用於建立高效迭代器的函式。 itertools用於高效迴圈的迭代函式集合。迭代器迭代器（生成器）在Python中是一種很常用也很好用的資料結構

Python標準庫內建函式hex x

本函式是轉換一個整數物件為十六進位制的字串表示，比如像0x的格式。如果物件不是一個整數，應定義一個方法___index__()返回整數。如果想把本函式的結果轉換為整數型別，需要int()函式，並且使用基數為16的方式轉換。另浮點數轉換為十六進位制表示需要使用float.hex()來轉換，而不能使用本函式

Python標準庫內置函數hex x

幽默 cnblogs 進制 mil div 人工智能 times family href 本函數是轉換一個整數對象為十六進制的字符串表示，比如像0x的格式。如果對象不是一個整數，應定義一個方法___index__()返回整數。如果想把本函數的結果轉換為整數類型，需要int

Python標準庫--Scope

sda1 模塊簡介你一定在很多計算機科學課程上聽說過作用域。它很重要，如果你不理解它的工作原理，那麽就會出現一些令人困惑的錯誤。作用域最基本的功能就是告訴編譯器一個變量什麽時候是可見的。也就是說，作用域定義了你使用變量的時間和範圍。當你嘗試使用一些不在當前作用域的變量時，你就會得到NameError。Pyth

Python標準庫--string模塊

err 分隔 xca provide python 變量 dog upper miss string中包含了處理文本的常量和模板常量 print(string.whitespace) print(string.ascii_lowercase) print(string.

Python標準庫--textwrap模塊

給定 fix rip 調整 wrap 標準庫 wrapper dede 換行符 textwrap通過調整換行符的位置來格式化文本 __all__ = [‘TextWrapper‘, ‘wrap‘, ‘fill‘, ‘dedent‘, ‘indent‘, ‘shorten‘

Python標準庫--re模塊

spa 編程斜杠不能當前對象需要 sum pri re:正則表達式 __all__ = [ "match", "fullmatch", "search", "sub", "subn", "split", "findall", "finditer"

python標準庫之【socket】

yun lock .cn 函數返回 targe ddr 是個進程間的通信 log socket通常也稱作”套接字“。網絡上的兩個程序通過一個雙向的通信連接實現數據的交換，這個連接的一端稱為一個socket。socket 是網絡連接端點。例如當你的W

[python標準庫]Logging模塊

post 日誌信息 tin 方式 asc dha event 如果 bytes 1.模塊簡介　　logging模塊是Python內置的標準模塊，主要用於輸出運行日誌，可以設置輸出日誌的等級、日誌保存路徑、日誌文件回滾等；相比print，具備如下優點：可以通過設置不同的

Python標準庫：內置函數all(iterable)

blog ack div class pos true pop 使用實現假設可叠代的對象的所有元素所有非空（或者空叠代對象），就返回True。這個函數主要用來推斷列表、元組、字典等對象是否有空元素。比方有10000個元素的列表，假設沒有提供此函數，須要使用循環來實現

Python標準庫(3.x): itertools庫掃盲

itertools.accumulate(iterable [, func] )

itertools.chain(*iterables)

itertools.chain.from_iterable(iterable)

itertools.combinations(iterable, r)

itertools.combinations_with_replacement(iterable, r)

itertools.compress(data, selectors)

itertools.count(start=0, step=1)

itertools.cycle(iterable)

itertools.dropwhile(predicate, iterable)

itertools.filterfalse(predicate, iterable)

itertools.groupby(iterable, key=None)

itertools.islice(iterable, stop) itertools.islice(iterable, start, stop [, step])

itertools.permutations(iterable, r=None)

itertools.product(*iterables, repeat=1)

itertools.repeat(object [, times])

itertools.starmap(function, iterable)

itertools.takewhile(predicate, iterable)

itertools.tee(iterable, n=2)

itertools.zip_longest(*iterables, fillvalue=None)

相關推薦

itertools.accumulate(iterable [, func]

)