python difflib模組講解示例

阿新 • • 發佈：2018-11-21

difflib模組提供的類和方法用來進行序列的差異化比較，它能夠比對檔案並生成差異結果文字或者html格式的差異化比較頁面，如果需要比較目錄的不同，可以使用filecmp模組。

class difflib.SequenceMatcher

此類提供了比較任意可雜湊型別序列對方法。此方法將尋找沒有包含‘垃圾’元素的最大連續匹配序列。通過對演算法的複雜度比較，它由於原始的完形匹配演算法，在最壞情況下有n的平方次運算，在最好情況下，具有線性的效率。它具有自動垃圾啟發式，可以將重複超過片段1%或者重複200次的字元作為垃圾來處理。可以通過將autojunk設定為false關閉該功能。 

   
    1
    2
    3
    4
    5

class difflib.Differ

 此類比較的是文字行的差異並且產生適合人類閱讀的差異結果或者增量結果，結果中各部分的表示如下：
   
    1

這裡寫圖片描述
class difflib.HtmlDiff

 此類可以被用來建立HTML表格 (或者說包含表格的html檔案) ，兩邊對應展示或者行對行的展示比對差異結果。 make_file(fromlines, tolines [, fromdesc][, todesc][, context][, numlines])make_table(fromlines, tolines [, fromdesc][, todesc][, context][, numlines]) 

   
    1
    2
    3
    4
    5

以上兩個方法都可以用來生成包含一個內容為比對結果的表格的html檔案，並且部分內容會高亮顯示。

difflib.context_diff(a, b[, fromfile][, tofile][, fromfiledate][, tofiledate][, n][, lineterm])

比較a與b(字串列表)，並且返回一個差異文字行的生成器
   
    1

示例：

>>> s1 = ['bacon\n', 'eggs\n', 'ham\n', 'guido\n']>>> s2 = ['python\n', 'eggy\n', 'hamster\n', 'guido\n']>>> for line in context_diff(s1, s2, fromfile='before.py', tofile='after.py'):...     sys.stdout.write(line)  *** before.py--- after.py****************** 1,4 ****! bacon! eggs! ham  guido--- 1,4 ----! python! eggy! hamster  guido 

   
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17

difflib.get_close_matches(word, possibilities[, n][, cutoff])

返回最大匹配結果的列表
   
    1

示例：

>>> get_close_matches('appel', ['ape', 'apple', 'peach', 'puppy'])['apple', 'ape']>>> import keyword>>> get_close_matches('wheel', keyword.kwlist)['while']>>> get_close_matches('apple', keyword.kwlist)[]>>> get_close_matches('accept', keyword.kwlist)['except']
   
    1
    2
    3
    4
    5
    6
    7
    8
    9

difflib.ndiff(a, b[, linejunk][, charjunk])

比較a與b(字串列表)，返回一個Differ-style 的差異結果
   
    1

示例：

>>> diff = ndiff('one\ntwo\nthree\n'.splitlines(1),...              'ore\ntree\nemu\n'.splitlines(1))>>> print ''.join(diff),- one?  ^+ ore?  ^- two- three?  -+ tree+ emu
   
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12

difflib.restore(sequence, which)

返回一個由兩個比對序列產生的結果
   
    1

示例

>>> diff = ndiff('one\ntwo\nthree\n'.splitlines(1),...              'ore\ntree\nemu\n'.splitlines(1))>>> diff = list(diff) # materialize the generated delta into a list>>> print ''.join(restore(diff, 1)),onetwothree>>> print ''.join(restore(diff, 2)),oretreeemu
   
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11

difflib.unified_diff(a, b[, fromfile][, tofile][, fromfiledate][, tofiledate][, n][, lineterm])

比較a與b(字串列表)，返回一個unified diff格式的差異結果.
   
    1

示例：

>>> s1 = ['bacon\n', 'eggs\n', 'ham\n', 'guido\n']>>> s2 = ['python\n', 'eggy\n', 'hamster\n', 'guido\n']>>> for line in unified_diff(s1, s2, fromfile='before.py', tofile='after.py'):...     sys.stdout.write(line)   --- before.py+++ [email protected]@ -1,4 +1,4 @@-bacon-eggs-ham+python+eggy+hamster guido
   
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14

實際應用示例

比對兩個檔案，然後生成一個展示差異結果的HTML檔案

#coding:utf-8'''file:difflibeg.pydate:2017/9/9 10:33author:lockeyemail:[email protected]:diffle module learning and practising '''import difflibhd = difflib.HtmlDiff()loads = ''with open('G:/python/note/day09/0907code/hostinfo/cpu.py','r') as load:    loads = load.readlines()    load.close()mems = ''with open('G:/python/note/day09/0907code/hostinfo/mem.py', 'r') as mem:    mems = mem.readlines()    mem.close()with open('htmlout.html','a+') as fo:    fo.write(hd.make_file(loads,mems))    fo.close()
   
    1
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23

執行結果：

這裡寫圖片描述

生成的html檔案比對結果：
這裡寫圖片描述

再分享一下我老師大神的人工智慧教程吧。零基礎！通俗易懂！風趣幽默！希望你也加入到我們人工智慧的隊伍中來！http://www.captainbed.net

python difflib模組講解示例

difflib模

python itertools 模組講解

1、介紹itertools 是python的迭代器模組，itertools提供的工具相當高效且節省記憶體。使用這些工具，你將能夠建立自己定製的迭代器用於高效率的迴圈。 - 無限迭代器　itertools包自帶了三個可以無限迭代的迭代器。這意味著，當你使用他們時，你要知道要的到底是最終會停止的迭代器，還是需

python operator模組講解

這些函式屬於執行物件比較，邏輯運算，數學運算，序列運算和抽象型別測試的類別。 operator.lt(a, b) #等價於a<b operator.le(a, b) #等價於a<=b operator.eq(a, b) #等價於a==b operator.ne(a, b) &n

python logging模組講解

日誌欄位資訊與日誌格式一條日誌資訊對應的是一個事件的發生，而一個事件通常需要包括以下幾個內容：事件發生時間事件發生位置事件的嚴重程度--日誌級別事件內容上面這些都是一條日誌記錄中可能包含的欄位資訊，當然還可以包括一些其他資訊，如程

python collections模組講解

collection模組是python內建的一個模組namedtuple tuple表示不變的集合,即一個點可以由二維座標可以表示: from collections import namedtuple point=namedtuple('name',['X','Y']) p=point(1,2)

python logging模組程式碼示例：實現日誌輸出到控制檯，並且寫入日誌檔案中

import logging class Logger(object): def __init__(self, log_file_name, log_level, logger_name):

Python logger模組應用示例

同時輸出到檔案和終端，並設定不同的輸出級別 #! /usr/bin/env python # coding: utf-8 import os import logging class MyLogger(logging.Logger): def __init__(

python強大的繪圖模組matplotlib示例講解

Matplotlib 是 Python 的繪相簿。作為程式設計師，經常需要進行繪圖，在我自己的工作中，如果需要繪圖，一般都是將資料匯入到excel中，然後通過excel生成圖表，這樣操作起來還是比較繁瑣的，所以最近學習了一下Matplotlib模組，將該模組的常用的繪圖手段和大家分享一下，提高大家在工作中的效

Python中模組的搜尋路徑例項講解

2018年3月1日13:26:09 最近在工作的時候遇到一個問題，我首先是拿到別人現成的程式碼，程式碼如下： import os,sys,re import datetime import threading import subprocess import configparser imp

python jieba模組基本命令講解

1、分詞精確模式: import jieba s="fdsfdsfsdfds" s_cut_jq=jieba.cut(s) #可見分詞結果返回的是一個生成器,可實現拼接 cut_jq=','.join(s_cut_jq)全模式: s_cut_qms=jieba.cut(s,cut_all=True)

python-requests資料驅動延伸 python-requests模組的講解和應用

在 python-requests模組的講解和應用基礎上進行資料驅動的延伸 task_01_requests.py #-*- coding:utf-8 -*- #task_01_requests.py # 1：利用requests模組，編寫一個可以完成http

python的記憶體回收機制即gc模組講解

最後容易造成記憶體問題的通常就是全域性單例、全域性快取、長期存活的物件引用計數(主要), 標記清除, 分代收集(輔助) 引用計數為0則會被gc回收。標記刪除可以解決迴圈引用的問題。分代：0代--年輕代；1代--中年代；2代--老年代，存活越久被回收的頻率越低。通過gc機制基本解決記憶體回收的問題。

python threading模組、Timer類講解

16.2.7. Timer Objects # The timer class was contributed by Itamar Shtull-Trauring def Timer(*args, *

Python之difflib模組

difflib 模組包含一些用來計算和處理序列之間差異的工具。它對於比較文字尤其有用，其中包含的函式可以使用多種常用差異格式生成報告。 import difflib text1 = '1234' text2 = '2234' d = difflib.HtmlDiff() with o

Python使用difflib模組比較兩個檔案內容異同，同時輸出html易瀏覽

因工作需求，需要對比連個檔案異同，並輸出html格式來對比。 #!/usr/bin/python # -*- coding: utf-8 -*- import sys import difflib def read_file(filename): try: with open(f

Python datetime模組詳解、示例

一、datetime模組介紹（一）、datetime模組中包含如下類：類名功能說明 date 日期物件,常用的屬性有year, month, day time 時間物件 datetime 日期時間

python difflib模塊實現兩個文件差異對比，並輸出html格式。

python difflib difflib 模塊包含一些用來計算和處理序列之間差異的工具。它對於比較文本尤其有用，其中包含的函數可以使用多種常用差異格式生成報告。實現了三個類： SequenceMatcher 任意類型序列的比較 (可以比較字符串)Differ 對字符串進行比較HtmlDiff

第五章：Python 之 RabbitMQ 基本示例

rabbitmq#send 端import pikacredentials = pika.PlainCredentials(‘root‘, ‘Password1‘)connection = pika.BlockingConnection(pika.ConnectionParameters(‘10.3.151.

python使用IP代理示例及出錯解決方法

python 代理ip requests模塊一、代碼示例# -*- coding:utf-8 -*- import requests header = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; Win64; x64

Python 裝飾器簡單示例

裝飾器簡單裝飾器示例： def servlet(func): print("into servlet")#1 print(servlet)#2 def foo(): print("into foo")#7 print(func)#8，真正的bar函數

python difflib模組講解示例

實際應用示例

相關推薦