python 正則表示式語法學習筆記

阿新 • • 發佈：2020-02-27

正則表示式(regular expression)描述了一種字串匹配的模式（pattern），可以用來檢查一個串是否含有某種子串、將匹配的子串替換或者從某個串中取出符合某個條件的子串等。

Python 自1.5版本起增加了re 模組，它提供 Perl 風格的正則表示式模式。

re 模組使 Python 語言擁有全部的正則表示式功能。

compile 函式根據一個模式字串和可選的標誌引數生成一個正則表示式物件。該物件擁有一系列方法用於正則表示式匹配和替換。

本文重點給大家介紹python 正則表示式語法。

The special characters are:
"." Matches any character except a newline.

"^" Matches the start of the string.
"$" Matches the end of the string or just before the newline at
the end of the string.
"*" Matches 0 or more (greedy) repetitions of the preceding RE.
Greedy means that it will match as many repetitions as possible.
"+" Matches 1 or more (greedy) repetitions of the preceding RE.

"?" Matches 0 or 1 (greedy) of the preceding RE.
*?,+?,?? Non-greedy versions of the previous three special characters.
{m,n} Matches from m to n repetitions of the preceding RE.
{m,n}? Non-greedy version of the above.
"\\" Either escapes special characters or signals a special sequence.

[] Indicates a set of characters.
A "^" as the first character indicates a complementing set.
"|" A|B,creates an RE that will match either A or B.
(...) Matches the RE inside the parentheses.
The contents can be retrieved or matched later in the string.
(?aiLmsux) Set the A,I,L,M,S,U,or X flag for the RE (see below).
(?:...) Non-grouping version of regular parentheses.
(?P<name>...) The substring matched by the group is accessible by name.
(?P=name) Matches the text matched earlier by the group named name.
(?#...) A comment; ignored.
(?=...) Matches if ... matches next,but doesn't consume the string.
(?!...) Matches if ... doesn't match next.
(?<=...) Matches if preceded by ... (must be fixed length).
(?<!...) Matches if not preceded by ... (must be fixed length).
(?(id/name)yes|no) Matches yes pattern if the group with id/name matched,
the (optional) no pattern otherwise.

The special sequences consist of "\\" and a character from the list
below. If the ordinary character is not on the list,then the
resulting RE will match the second character.
\number Matches the contents of the group of the same number.
\A Matches only at the start of the string.
\Z Matches only at the end of the string.
\b Matches the empty string,but only at the start or end of a word.
\B Matches the empty string,but not at the start or end of a word.
\d Matches any decimal digit; equivalent to the set [0-9] in
bytes patterns or string patterns with the ASCII flag.
In string patterns without the ASCII flag,it will match the whole
range of Unicode digits.
\D Matches any non-digit character; equivalent to [^\d].
\s Matches any whitespace character; equivalent to [ \t\n\r\f\v] in
bytes patterns or string patterns with the ASCII flag.
In string patterns without the ASCII flag,it will match the whole
range of Unicode whitespace characters.
\S Matches any non-whitespace character; equivalent to [^\s].
\w Matches any alphanumeric character; equivalent to [a-zA-Z0-9_]
in bytes patterns or string patterns with the ASCII flag.
In string patterns without the ASCII flag,it will match the
range of Unicode alphanumeric characters (letters plus digits
plus underscore).
With LOCALE,it will match the set [0-9_] plus characters defined
as letters for the current locale.
\W Matches the complement of \w.
\\ Matches a literal backslash.

This module exports the following functions:
match Match a regular expression pattern to the beginning of a string.
fullmatch Match a regular expression pattern to all of a string.
search Search a string for the presence of a pattern.
sub Substitute occurrences of a pattern found in a string.
subn Same as sub,but also return the number of substitutions made.
split Split a string by the occurrences of a pattern.
findall Find all occurrences of a pattern in a string.
finditer Return an iterator yielding a match object for each match.
compile Compile a pattern into a RegexObject.
purge Clear the regular expression cache.
escape Backslash all non-alphanumerics in a string.

Some of the functions in this module takes flags as optional parameters:
A ASCII For string patterns,make \w,\W,\b,\B,\d,\D
match the corresponding ASCII character categories
(rather than the whole Unicode categories,which is the
default).
For bytes patterns,this flag is the only available
behaviour and needn't be specified.
I IGNORECASE Perform case-insensitive matching.
L LOCALE Make \w,dependent on the current locale.
M MULTILINE "^" matches the beginning of lines (after a newline)
as well as the string.
"$" matches the end of lines (before a newline) as well
as the end of the string.
S DOTALL "." matches any character at all,including the newline.
X VERBOSE Ignore whitespace and comments for nicer looking RE's.
U UNICODE For compatibility only. Ignored for string patterns (it
is the default),and forbidden for bytes patterns.

python 正則表示式語法學習筆記

下面看下正則表示式匹配的流程：

python 正則表示式語法學習筆記

正則表示式的大致匹配過程是：依次拿出表示式和文字中的字元比較，如果每一個字元都能匹配，則匹配成功；一旦有匹配不成功的字元則匹配失敗。如果表示式中有量詞或邊界，這個過程會稍微有一些不同，但也是很好理解的，自己多使用幾次就能明白。

總結

到此這篇關於python 正則表示式語法記錄的文章就介紹到這了,更多相關python 正則表示式語法記錄內容請搜尋我們以前的文章或繼續瀏覽下面的相關文章希望大家以後多多支援我們！

python 正則表示式語法學習筆記

正則不能輸入特殊字元_正則表示式語法學習和線上練習

技術標籤：正則不能輸入特殊字元標題: 正則表示式語法學習和線上練習作者: 夢幻之心星[email protected]標籤: [#正則表示式,#語法,#學習,#練習]目錄: [語法]日期: 2021-01-26

【自動化測試學習筆記】Python正則表示式

前言正則表示式是一個特殊的字元序列，它能幫助你方便的檢查一個字串是否與某種模式匹配。

python正則表示式學習筆記

技術標籤：學習筆記正則表示式python python正則表示式學習筆記 import re包 match方法：

Python正則表示式學習小例子

正則表示式是處理字串的強大工具。作為一個概念而言，正則表示式對於Python來說並不是獨有的。但是，Python中的正則表示式在實際使用過程中還是有一些細小的差別。

python正則表示式筆記——匹配多行多段

技術標籤：python自動化運維工具正則表示式python運維linux 場景分析使用python正則表示式提取某段中多行內容，例如： ‘’’ aaaa bbbb cccc xx abcdefg abcdefg abc yy abc abde adf dadfeljgslka lkdsjgls

Python正則表示式匹配字串中的數字

1.使用“\\d+”匹配全數字程式碼： import re zen = \"Arizona 479,501,870. Carlifornia 209,213,650.\"

python 正則表示式引數替換例項詳解

正則表示式是一個特殊的字元序列，它能幫助你方便的檢查一個字串是否與某種模式匹配。

python入門之基礎語法學習筆記

Python 中文編碼 Python 檔案中如果未指定編碼，在執行過程會出現報錯： Python中預設的編碼格式是 ASCII 格式，在沒修改編碼格式時無法正確列印漢字，所以在讀取中文時會報錯。

python正則表示式例項程式碼

re 模組使 Python 語言擁有全部的正則表示式功能。會用到的語法正則字元釋義

學會Python正則表示式，就看這20個例子(指令碼之家修正版)

一文秒懂python正則表示式常用函式

導讀：正則表示式是處理字串型別的\"核武器\"，不僅速度快，而且功能強大。本文不過多展開正則表示式相關語法，僅簡要介紹 python中正則表示式常用函式及其使用方法，以作快速查詢瀏覽。

SQL Anywhere正則表示式語法與示例

正則表示式語法通過 SIMILAR TO 和 REGEXP 搜尋條件以及 REGEXP_SUBSTR 函式支援正則表示式。對於 SIMILAR TO，正則表示式語法符合 ANSI/ISO SQL 標準。對於 REGEXP 和 REGEXP_SUBSTR，正則表示式的語法和支援符合

Python正則表示式如何匹配中文

用 \'[\\u4e00-\\u9fa5]‘ 匹配中文在字串中匹配中文示例：匹配字串中的第一個中文字元

Python正則表示式高階使用方法彙總

正則表示式是一個以簡單直觀的方式匹配指定文字資訊從而達到查詢、替換等操作的目的。正則表示式以其簡單而高效的特點使得其在資料分析和資料驗證方面應用廣泛。

python正則表示式的懶惰匹配和貪婪匹配說明

第一次碰到這個問題的時候，確實不知道該怎麼辦，後來請教了一個大神，加上自己的理解，才瞭解是什麼意思，這個東西寫python的會經常用到，而且會特別頻繁，在此寫一篇部落格，希望可以幫到一些朋友。

RE正則表示式-語法

一文搞定Python正則表示式

本文對正則表示式和Python中的re模組進行詳細講解很多人學習python，不知道從何學起。很多人學習python，掌握了基本語法過後，不知道在哪裡尋找案例上手。很多已經做案例的人，卻不知道如何去學習更加高深的知識。那

python正則表示式

python正則表示式 1. 正則表示式基礎 1.1. 簡單介紹正則表示式並不是Python的一部分。正則表示式是用於處理字串的強大工具，擁有自己獨特的語法以及一個獨立的處理引擎，效率上可能不如str自帶的方法，但功能十分

python 正則表示式用法

做資料清洗時，經常需要用到正則表示式替換等等首先介紹一下元字元（匹配規則）

python 正則表示式語法學習筆記

相關推薦