pytesseract結合PIL庫進行OCR識別

阿新 • • 發佈：2020-12-23

1.獲取需要OCR識別的圖片

from PIL import Image
import pytesseract

def screenshots_picture(driver,locator):
    '''
    擷取需要被ocr識別的圖片
    :param driver:瀏覽器driver
    :param locator: 元素
    :param fileName: 截圖檔名稱
    :param screenshots_fileName: 識別圖片檔名稱
    :return: 識別碼
    注意：
        # 如果是retina螢幕,必須要加這個不然，就會出現擷取驗證的圖錯誤
        # dpr = driver.execute_script('return window.devicePixelRatio')
        # im = Image.open(picture_name1)
        # img = im.crop((left * dpr, top * dpr, right * dpr, height * dpr))

     
'''
    try:
        # 擷取當前網頁，該網頁有我們需要的驗證碼
        name = f'{time.time()}.png'
        fileName = filePictruePath(name)
        driver.save_screenshot(fileName)
        #定位到驗證碼的元素
        imgelement = driver.find_element(*locator)
        # 獲取驗證碼x,y軸座標
        location = imgelement.location
        x  
= int(location['x'])
        y = int(location['y'])
        #獲取驗證碼的長寬
        size = imgelement.size
        width = int(size['width'])
        height = int(size['height'])
        dpr = driver.execute_script('return window.devicePixelRatio')
        # 得到要被截圖的位置座標,通過兩點定位要截圖的位置
        rangle = (x*dpr,y*dpr,(x+width)*dpr,(y+height)*dpr)
         
#開啟螢幕截圖
        open_fileName = Image.open(fileName)
        # 使用Image的crop函式，從截圖中再次擷取我們需要的區域
        screenshots = open_fileName.crop(rangle)
        #儲存已擷取的驗證碼圖片
        ocr_name = f'{time.time()}ocr.png'
        screenshots_fileName = filePictruePath(ocr_name)
        screenshots.save(screenshots_fileName)

        return screenshots_fileName
    except Exception:
        return None

2.OCR識別圖片

def ocr_code(screenshots_fileName):
    '''
    ocr識別方法
    :param screenshots_fileName: 被識別的檔名稱
    :return: 識別資訊
    '''
    # 開啟儲存的圖片
    open_stream = Image.open(screenshots_fileName)
    # 使用pytesseract中的image_to_string方法獲取識別驗證碼
    identify_text = pytesseract.image_to_string(open_stream).strip()
    print(identify_text)
    # 過濾掉會受影響的符號
    identify_text = filter_str(identify_text)
    return identify_text

用到的方法：

def filePictruePath(name):
    '''
    生成檔案路徑
    :param name:
    :return:
    '''
    file_dir = f"{os.path.dirname(os.path.dirname(__file__))}/screenshot/"
    if os.path.exists(file_dir) and os.path.isdir(file_dir):
        pass
    else:
        os.mkdir(file_dir)
    return os.path.join(file_dir,name)

def filter_str(args):
    '''
    過濾字串中的無效字元
    :param args: 只留數字以及字串
    :return:
    '''
    new_str = str(args)
    new_str = ''.join(new_str.strip().split())
    str_list = []
    for i in new_str:
        if '0' <= i and i <= '9':
            str_list.append(i)
        elif i.upper() >= 'A' and i.upper() <= 'Z':
            str_list.append(i)
    return ''.join(str_list)

會出現下面的錯誤：

具體解決辦法見：https://blog.csdn.net/qq_31362767/article/details/107891185

pytesseract結合PIL庫進行OCR識別

1.獲取需要OCR識別的圖片 from PIL import Image import pytesseract def screenshots_picture(driver,locator):

使用python的pytesseract呼叫谷歌tesseract-ocr識別中英文字元

目錄 1tesseract-ocr簡介 2Pytesseract簡介 3安裝 3.1安裝tesseract-ocr 3.2安裝語言庫 3.3安裝依賴及pytesseract

淺談Python3識別判斷圖片主要顏色並和顏色庫進行對比的方法

【更新】主要提供兩種方案：方案一：（參考網上程式碼，感覺實用性不是很強）使用PIL擷取影象，然後將RGB轉為HSV進行判斷，統計判斷顏色，最後輸出RGB值

結合OpenCV與TensorFlow進行人臉識別的實現

作為新手來說，這是一個最簡單的人臉識別模型，難度不大，程式碼量也不算多，下面就逐一來講解，資料集的準備就不多說了，因人而異。

python3安裝OCR識別庫tesserocr過程圖解

OCR簡介 OCR，即Optical Character Recognition，光學字元識別，是指通過掃描字元，然後通過其形狀將其翻譯成電子文字的過程，對應圖形驗證碼來說，它們都是一些不規則的字元，這些字元是由字元稍加扭曲變換得到的

呼叫百度OCR模組進行文字識別

轉自：https://www.cnblogs.com/students/p/10826822.html 和https://mp.weixin.qq.com/s/RSSOJBm4KsU4EwX6J6Nt7w(這是參考的程式碼2)

Python+Selenium+PIL+Tesseract真正自動識別驗證碼進行一鍵登入

Python 2.7 IDE Pycharm 5.0.3 Firefox瀏覽器：47.0.1 PIL : Pillow-3.3.0-cp27-cp27m-win_amd64.whl PIL第三方庫的下載 win下安裝whl檔案

Python通過Tesseract庫實現文字識別

機器視覺從Google的無人駕駛汽車到可以識別假鈔的自動售賣機，機器視覺一直都是一個應用廣泛且具有深遠的影響和雄偉的願景的領域。

Python PIL庫圖片灰化處理

2020年4月4日,是個特殊的日子,我們看到朋友圈很多灰化的圖片.今天我們就聊聊圖片灰度處理這事兒.

Python的PIL庫中getpixel方法的使用

getpixel函式是用來獲取影象中某一點的畫素的RGB顏色值，getpixel的引數是一個座標點。對於圖象的不同的模式，getpixel函式返回的值有所不同。

win10下python3.8的PIL庫安裝

1.找到Python的位置我的是在 C:\\Users\\admin\\AppData\\Local\\Programs\\Python\\Python38 AppData這個檔案是個隱藏檔案需要查詢得先把隱藏檔案顯示出來

資料準備基本流程資料規範化的幾種方法利用SciKit庫進行資料變換

資料準備基本流程資料規範化的幾種方法利用SciKit庫進行資料變換資料準備流程

appium進行元素識別的操作過程

前言：在安裝好appium環境和應用的基礎上（https://www.cnblogs.com/miaoxiaochao/p/13375314.html），開始進行元素識別：

使用HMM進行分類識別（以語音識別為例）

本文內容參考了： [1] 基於HMM的語音識別系列部落格 [2] 從語音識別到股指預測---隱馬爾科夫模型(HMM)的一種應用

python+selenium2自動化---使用pytesseract和Pillow實現驗證碼識別

這種方式只能對簡單的驗證碼起作用，複雜的就獲取不到了。驗證碼識別思路：

Go語言使用net庫進行遠端過程呼叫

檔案結構： │ ├─rpc_client │rpc_client.go │ ├─rpc_protocol │rpc_protocol.go │ └─rpc_server

Python Pillow(PIL)庫的用法詳解

Pillow庫是一個Python的第三方庫。在Python2中，PIL(Python Imaging Library)是一個非常好用的影象處理庫，但PIL不支援Python3，所以有人(Alex Clark和Contributors)提供了Pillow，可以在Python3中使用。

專案實戰--OCR識別

# 匯入工具包 import numpy as np import argparse import cv2 # 設定引數 def order_points(pts): # 一共4個座標點

Python利用Pillow(PIL)庫實現驗證碼圖片的全過程

前言 Pillow庫有很多用途，本文使用Pillow來生成隨機的驗證碼圖片。 Pillow的用法參考：https://www.jb51.net/article/196007.htm

oracle over結合row_number分割槽進行資料去重處理

一、建立一個測試表A CREATE TABLE A( ID INT, NAME VARCHAR2(20) ); 二、向表中新增資料，且存在相同的資料

pytesseract結合PIL庫進行OCR識別

相關推薦