python模式匹配與正則表示式

阿新 • • 發佈：2020-12-13

正則表示式使用方法：

1> 用import re匯入正則表示式模組

2> 用re.compile()函式建立一個正則表示式物件(記得使用原始字串)

3> 向Regex物件的search()方法傳入想查詢的字串。它返回一個Match物件

4> 呼叫Match物件的group()方法，返回實際匹配文字的字串

import re
phoneNumRegex = re.compile(r'\d\d\d-\d\d\d-\d\d\d\d')
mo = phoneNumRegex.search('my number is 425-444-3467')
print('phone number found ' + mo.group())
======================================================
result：
phone number found 425-444-3467

利用括號分組：

import re
phoneNumRegex = re.compile(r'(\d\d\d)-(\d\d\d-\d\d\d\d)')
mo = phoneNumRegex.search('my number is 425-444-3467')
print(mo.group())
print(mo.group(1))
print(mo.group(2))
========================================================
result:
425-444-3467
425
444-3467

用管道匹配多個分組：希望匹配多個表示式中的一個時，可以使用管道

import re
heroRegex = re.compile(r'Batman|Tina Fey')
mo = heroRegex.search('Batman and Tina Fey')
print(mo.group())
mo = heroRegex.search('Tina Fey and Batman')
print(mo.group())
=============================================
result:
Batman
Tina Fey

import re
batRegex = re.compile(r'Bat(man|mobile|copter|bat)')
mo = batRegex.search('Batcopter lost a wheel')
print(mo.group())
print(mo.group(1))
====================================================
result:
Batcopter
copter

用問號實現可選匹配：？表示出現0次或一次

import re
batRegex = re.compile(r'Bat(wo)?man')
mo = batRegex.search('Batman lost a wheel')
print(mo.group())
mo = batRegex.search('Batwoman lost a wheel')
print(mo.group())
=============================================
result:
Batman
Batwoman

用星號匹配零次或多次：

import re
batRegex = re.compile(r'Bat(wo)*man')
mo = batRegex.search('Batman lost a wheel')
print(mo.group())
mo = batRegex.search('Batwoman lost a wheel')
print(mo.group())
mo = batRegex.search('Batwowowowoman lost a wheel')
print(mo.group())
===================================================
result:
Batman
Batwoman
Batwowowowoman

用加號匹配一次或多次：

import re
batRegex = re.compile(r'Bat(wo)+man')
mo = batRegex.search('Batman lost a wheel')
print(mo)
mo = batRegex.search('Batwoman lost a wheel')
print(mo.group())
mo = batRegex.search('Batwowowowoman lost a wheel')
print(mo.group())
==================================================
result:
None
Batwoman
Batwowowowoman

用花括號匹配特定次數：

import re
haRegex = re.compile(r'(Ha){2,3}')
mo = haRegex.search('Ha')
print(mo)
mo = haRegex.search('HaHa')
print(mo.group())
mo = haRegex.search('HaHaHa')
print(mo.group())
mo = haRegex.search('HaHaHaHa')
print(mo.group())
===============================
result:
None
HaHa
HaHaHa
HaHaHa

貪心和非貪心匹配：python預設是貪心匹配，eg:(Ha){3,5}預設以匹配更多的例項為準，可在{3,5}後加?表示使用非貪心匹配

import re
haRegex = re.compile(r'(Ha){2,3}?')
mo = haRegex.search('Ha')
print(mo)
mo = haRegex.search('HaHa')
print(mo.group())
mo = haRegex.search('HaHaHa')
print(mo.group())
mo = haRegex.search('HaHaHaHa')
print(mo.group())
==================================
result:
None
HaHa
HaHa
HaHa

findall方法：

import re
phoneNumRegex = re.compile(r'(\d\d\d)-(\d\d\d-\d\d\d\d)')
mo = phoneNumRegex.search('person01: 425-444-3467, person02: 425-678-4678')
print(mo.group())
mo = phoneNumRegex.findall('person01: 425-444-3467, person02: 425-678-4678')
print(mo)
============================================================================
result:
425-444-3467
[('425', '444-3467'), ('425', '678-4678')]

字元分類：

import re
# 至少一個數字+空格+一個字母
phoneNumRegex = re.compile(r'\d+\s\w+')
mo = phoneNumRegex.findall('8 c, 7 rrr, qqg, 19 tt')
print(mo)
====================================================
result:
['8 c', '7 rrr', '19 tt']

建立自己的字元分類：在縮寫的\d \s \w太寬泛的情況下可以自定義字符集

import re
phoneNumRegex = re.compile(r'[aeiou]')
mo = phoneNumRegex.findall('Hello world')
print(mo)
=========================================
result:
['e', 'o', 'o']

插入字元和美元字元：

1> 插入字元^表示以字串開頭的匹配

2> 美元字元$表示以字串結束的匹配

3> 同時使用^$字元表示整個子串必須匹配模式，如r'^\d+$'表示全是數字

import re
beginRegex = re.compile(r'^Hello')
mo = beginRegex.findall('Hello world')
print(mo)
endRegex = re.compile(r'\d$')
mo = endRegex.findall('Hello world')
print(mo)
mo = endRegex.findall('Hello world4')
print(mo)
beginEndRegex = re.compile(r'^\d+$')
mo = beginEndRegex.findall('45y889')
print(mo)
mo = beginEndRegex.findall('45889')
print(mo)
====================================
result:
['Hello']
[]
['4']
[]
['45889']

通配字元：.表示匹配除了換行以外的任意字元

import re
beginRegex = re.compile(r'.at')
mo = beginRegex.findall('The cat in the hat sat on the first mat')
print(mo)
==================================================================
result:
['cat', 'hat', 'sat', 'mat']

用.*匹配所有字元：

import re
beginRegex = re.compile(r'First Name:(.*) Last Name:(.*)')
mo = beginRegex.search('First Name:Broad Last Name:Cast')
print(mo.group(1))
print(mo.group(2))
==========================================================
result:
Broad
Cast

正則的第二個引數：

1> DOTALL：全部字元，包括換行

2> IGNORECASE：忽略大小寫

3> VERBOSE：忽略空白符和註釋

import re
beginRegex = re.compile(r'(.*)last', re.DOTALL | re.IGNORECASE | re.VERBOSE)
mo = beginRegex.search('First Name:Broad Last Name:Cast'
                       'sdf  sdf fs dffsdf dfdfasdf ')
print(mo.group())
===========================================================================
result:
First Name:Broad Last

正則表示式做替換：sub()函式有兩個引數，第一個引數用於取代發現的匹配字串，第二個引數是匹配的內容

import re
agentRegex = re.compile(r'Agent \w+')
mo = agentRegex.sub('Agent xx', 'Agent Alice gave the secret documents to Agent Bob')
print(mo)
====================================================================================
result:
Agent xx gave the secret documents to Agent xx

python模式匹配與正則表示式

技術標籤：python正則表示式模式匹配正則替換查詢正則表示式使用方法： 1> 用import re匯入正則表示式模組

Python字串與正則表示式詳細介紹

目錄一、字串相關操作二、正則表示式相關操作一、字串相關操作 1.統計所輸入字串中單詞的個數,單詞之間用空格分隔。其執行效果如下圖所示。

Python程式設計快速上手——正則表示式查詢功能案例分析

本文例項講述了Python正則表示式查詢功能。分享給大家供大家參考，具體如下：

linux grep與正則表示式使用介紹

grep （縮寫來自Globally search a Regular Expression and Print）是一種強大的文字搜尋工具，它能使用特定模式匹配（包括正則表示式）搜尋文字，並預設輸出匹配行。Unix的grep家族包括grep、egrep和fgrep。Windows

C++與正則表示式入門

什麼是正則表示式? 正則表示式是一組由字母和符號組成的特殊文字, 當你想要判斷許多字串是否符合某個特定格式；當你想在一大段文字中查找出所有的日期和時間；當你想要修改大量日誌中所有的時間格式，在這些情況下，

九齒耙(Ninerake)資料採集大資料深度學習智慧分析Python爬蟲軟體的正則表示式規則簡介

正則表示式易於使用，功能強大，可用於複雜的搜尋和替換以及基於模板的文字檢查。這對於輸入形式的使用者輸入驗證特別有用-驗證電子郵件地址等。您還可以從網頁或文件中提取電話號碼，郵政編碼等，在日誌檔案中搜索複

Python 學習筆記之——正則表示式

0. 常用匹配規則 ^ 匹配字串的開頭 $ 匹配字串的結尾 [...] 匹配一組字元，比如 [abc] 表示匹配小寫字母 a 或者 b 或者 c，[a-z] 表示匹配所有的小寫字母，[0-3] 表示匹配數字 0,1,2,3

SQL中常見的模糊查詢like與正則表示式

1.普通的模糊查詢　SELECT 欄位 FROM 表名 WHERE 欄位 LIKE　條件　　關於條件又可以分為四種匹配模式：

深入淺出grep與正則表示式

一、什麼是正則表示式很可能我們經常會聽到一些有經驗的系統管理員告訴我們說：正則表示式非常重要。為什麼說正則表示式非常重要呢？因為我們在使用文字編輯的時候或者編寫shell指令碼的時候經常會使用到

python基礎知識--8正則表示式

1.正則表示式 # 正則表示式# 通俗而言，就是通過某種規則，來匹配符合條件的字元序列。# 適用場景：# 快速地查詢、替換或匹配具有特殊格式的字元；# 如：#文字替換；#匹配電子郵箱、電話號碼、IP地址等； #匹配爬蟲程

python學習23天----正則表示式、字元組、元字元、量詞

引：所有的模組中的所有方法是記不住的，但是哪個模組能做哪些事情是可以記住的；所以關鍵在於掌握模組用法、掌握常見的方法、其他的應該整理筆記，然後記住

jmeter設定全域性變數與正則表示式提取器過程圖解

介面測試中，很多介面都要帶上登入後的token才能正常傳送請求，這裡記錄一下登入獲取token設定為全域性變數供其他介面使用

python基礎-re模組(正則表示式）

正則表示式：就是為了能夠模糊匹配。str的find方法都是精準匹配。 1.普通字元——就是精準匹配

php正則替換%3cbr%3e_Python的re模組與正則表示式小結

技術標籤：php正則替換%3cbr%3e 一、Python模組之RE模組一些可選值： - re.I(全拼：ignorecase)：忽略大小寫

python re模組和正則表示式

一、re模組和正則表示式先來看一個例子：https://reg.jd.com/reg/person?ReturnUrl=https%3A//www.jd.com/

python的re（正則表示式）

python re import re s = \'\'\'bottle\\nbag\\nbig\\napple\'\'\' for i,c in enumerate(s, 1): print((i-1, c), end=\"\\n\" if i%8==0 else \' \')

Linux/Unix工具與正則表示式的POSIX規範(轉載)

對正則表示式有基本瞭解的讀者，一定不會陌生『\\d』、『[a-z]+』之類的表示式，前者匹配一個數字字元，後者匹配一個以上的小寫英文字母。但是如果你用過vi、grep、awk、sed之類Linux/Unix下的工具或許會發現，這些

Blog.039 Shell 程式設計 grep 與正則表示式

本章目錄 1. 正則表示式概述　　1.1 基礎正則表示式　　1.2 元字元型別2. grep 概述　　2.1 grep 的基本用法和格式　　2.2 grep 中的正則表示式（操作例項）

8.grep命令與正則表示式

Linux上文字處理三劍客：　　grep：文字過濾工具（模式：pattern）工具；　　sed：stream editor，流編輯器；文字編輯工具；

Python核心程式設計：正則表示式

沒有一個特別熟練的語言是要吃大虧的，以前python只停留在表面，現在得深入研究一下python的各個特性和用法。按照書《python核心程式設計》來做筆記，如有讀者，歡迎指正

python模式匹配與正則表示式

相關推薦