python圖片驗證碼識別最新模組muggle_ocr的示例程式碼

阿新 • • 發佈：2020-07-05

一.官方文件

https://pypi.org/project/muggle-ocr/

二模組安裝

pip install muggle-ocr
# 因模組過新，阿里/清華等第三方源可能尚未更新映象，因此手動指定使用境外源，為了提高依賴的安裝速度，可預先自行安裝依賴：tensorflow/numpy/opencv-python/pillow/pyyaml

三.使用程式碼

# 匯入包
import muggle_ocr

# 初始化；model_type 包含了 ModelType.OCR/ModelType.Captcha 兩種
sdk = muggle_ocr.SDK(model_type=muggle_ocr.ModelType.OCR)
# ModelType.OCR 可識別光學印刷文字 這裡個人覺得應該是官方文件寫錯了 官方文件是ModelType.Captcha 可識別光學印刷文字
with open(r"test1.png","rb") as f:
 b = f.read()
text = sdk.predict(image_bytes=b)
print(text)

# ModelType.Captcha 可識別4-6位驗證碼
sdk = muggle_ocr.SDK(model_type=muggle_ocr.ModelType.Captcha)
with open(r"test1.png","rb") as f:
 b = f.read()
text = sdk.predict(image_bytes=b)
print(text)

PS：下面看下 Python 實現全自動登入(真正的全自動，自動識別驗證碼)

你沒有看錯，全自動驗證~~~

黑科技？還是黑程式碼？
我感覺這個看在你用啥，對不對？反正我用來（* * * * ）你懂得

好了，先說一下用到的東西

selenium (本意是用來全自動測試)
Phantomjs (一種沒有介面的瀏覽器)
** 驗證碼識別器（一塊錢可用100次的這種）

關門放程式碼

from selenium import webdriver
from PIL import Image
if __name__ == '__main__':
 wbe = webdriver.PhantomJS()
 wbe.get("https://www.某個網站的登入頁面.com/login/index.html")//你可以拿知乎，百度，等等測試
 element = wbe.find_element_by_xpath('//*[@id="entry_name"]/p[3]/img')//驗證碼所在的xpath路徑
 left = element.location['x']
 top = element.location['y']
 right = element.location['x'] + element.size['width']
 bottom = element.location['y'] + element.size['height']
 im = Image.open(r'登入頁.png')//全頁面截圖
 im = im.crop((left,top,right,bottom))
 im.save('驗證碼.png')

#!/usr/bin/env python
# coding:utf-8
import requests
from hashlib import md5
class RClient(object):
 def __init__(self,username,password,soft_id,soft_key):
  self.username = username
  self.password = md5(password).hexdigest()
  self.soft_id = soft_id
  self.soft_key = soft_key
  self.base_params = {
   'username': self.username,'password': self.password,'softid': self.soft_id,'softkey': self.soft_key,}
  self.headers = {
   'Connection': 'Keep-Alive','Expect': '100-continue','User-Agent': 'ben',}
 def rk_create(self,im,im_type,timeout=60):
  """
  im: 圖片位元組
  im_type: 題目型別
  """
  params = {
   'typeid': im_type,'timeout': timeout,}
  params.update(self.base_params)
  files = {'image': ('a.png',im)}
  r = requests.post('http://api.ruokuai.com/create.json',data=params,files=files,headers=self.headers)
  return r.json()
 def rk_report_error(self,im_id):
  """
  im_id:報錯題目的ID
  """
  params = {
   'id': im_id,}
  params.update(self.base_params)
  r = requests.post('http://api.ruokuai.com/reporterror.json',headers=self.headers)
  return r.json()
def get_code():
 rc = RClient('使用者名稱','密碼','94522','62c235939b7240879453f31603733fd6')//想拿下測試的留言我，教你拿到測試賬號
 im = open('a.png','rb').read()
 print rc.rk_create(im,3040)

完整程式碼

#!/usr/bin/env python
# coding:utf-8
from selenium import webdriver
from PIL import Image
import requests
from hashlib import md5
import time
class RClient(object):
 def __init__(self,soft_key):
  self.username = username
  self.password = md5(password.encode("utf-8")).hexdigest()
  self.soft_id = soft_id
  self.soft_key = soft_key
  self.base_params = {
   'username': self.username,headers=self.headers)
  return r.json()
def get_code(im_file):
 rc = RClient('賬號','62c235939b7240879453f31603733fd6')
 im_source = open(im_file,"rb").read()
 print(rc.rk_create(im_source,3040))
if __name__ == '__main__':
 wbe = webdriver.PhantomJS()
 wbe.get("https://www.dajiang365.com/login/index.html")
 time.sleep(2)
 wbe.save_screenshot("das.png")
 element = wbe.find_element_by_xpath('//*[@id="entry_name"]/p[3]/img')
 left = element.location['x']
 top = element.location['y']
 right = element.location['x'] + element.size['width']
 bottom = element.location['y'] + element.size['height']
 im = Image.open(r'das.png')
 im = im.crop((left,bottom))
 im.save('a.png')
 time.sleep(2)
 get_code("a.png")

總結

到此這篇關於python圖片驗證碼識別最新模組muggle_ocr的示例程式碼的文章就介紹到這了,更多相關python 驗證碼識別模組muggle_ocr內容請搜尋我們以前的文章或繼續瀏覽下面的相關文章希望大家以後多多支援我們！

python圖片驗證碼識別最新模組muggle_ocr的示例程式碼

一.官方文件 https://pypi.org/project/muggle-ocr/ 二模組安裝 pip install muggle-ocr # 因模組過新，阿里/清華等第三方源可能尚未更新映象，因此手動指定使用境外源，為了提高依賴的安裝速度，可預先自行安裝依

python對驗證碼降噪的實現示例程式碼

前言：最近寫爬蟲會經常遇到一些驗證碼識別的問題，現如今的驗證碼已經是五花八門，剛開始的驗證碼就是簡單的對生成的驗證碼圖片進行一些干擾，但是隨著計算機視覺庫的發展壯大，可以輕鬆解決簡單的驗證碼識別問題

Python基於內建庫pytesseract實現圖片驗證碼識別功能

這篇文章主要介紹了Python基於內建庫pytesseract實現圖片驗證碼識別功能,文中通過示例程式碼介紹的非常詳細，對大家的學習或者工作具有一定的參考學習價值,需要的朋友可以參考下

Python實現驗證碼識別

大致介紹　　在python爬蟲爬取某些網站的驗證碼的時候可能會遇到驗證碼識別的問題，現在的驗證碼大多分為四類：

Python pytesseract驗證碼識別庫用法解析

環境 centos7 python3 pytesseract只是tesseract-ocr的一種實現介面。所以要先安裝tesseract-ocr（大名鼎鼎的開源的OCR識別引擎）。

Python 圖形驗證碼識別與利用

有一段時間沒更新部落格了，今天正好碰到公司的一個上線系統需要做安全檢查同時有圖形驗證碼較弱的問題，這裡就拿它來做例子記錄下，拿到系統首先看了登入介面，發現開發還是有一定的安全意識的，圖形驗證碼已經加入

圖片驗證碼識別技術——Tesseraact

將圖片翻譯成文字一般被稱為光學文字識別（Optical Character Recognition），簡稱為OCR。

圖片驗證碼識別

技術標籤：pythonpython影象識別 # -*- coding: utf8 -*- from re import findall import os import requests

pytesseract圖片驗證碼識別,四位數字

技術標籤：影象識別Pythonpython影象識別 try: import Image except ImportError: from PIL import Image

python爬蟲-驗證碼識別

為什麼需要識別驗證碼驗證碼是網站的一種反措施，有些時候我們需要登陸使用者才可以獲取到我們想要的資料，所以驗證碼識別是必要的。驗證碼識別操作：

Python 識別12306圖片驗證碼物品的實現示例

1、PIL介紹以及圖片分割 Python 3 安裝: pip3 install Pillow 1.1 image 模組 Image模組是在Python PIL影象處理中常見的模組，主要是用於對這個影象的基本處理，它配合open、save、convert、show…等功能使用。

文字識別還能這樣用？通過Python做文字識別到破解圖片驗證碼

前期準備 1. 安裝包，直接在終端上輸入pip指令即可： # 傳送瀏覽器請求 pip3 install requests

Python爬蟲對於圖片驗證碼自動識別的實現及模擬會話登陸！

一、圖片驗證碼識別驗證碼識別所使用的api為為快速圖片識別平臺，網頁地址為http://fast.95man.com/auth/main.html，在這個平臺中我們需要先依據使用者名稱和密碼獲取到token

Python驗證碼識別安裝Pillow、tesseract-ocr與pytesseract模組的安裝以及錯誤解決

1. 安裝tesseract tesseract下載地址：https://digi.bib.uni-mannheim.de/tesseract/ 下載完成後雙擊，此時會出現如下圖所示的頁面。

用Python模擬識別圖片驗證碼併發送手機驗證碼

1、導語大家好，好久不見。又到每日分享Python小技能的時候了。最近因為疫情影響，所以更新內容比較慢…今天週一，就來更新一波，心血來潮，是時候上線經營了。其實也沒想到有啥好分享的，不如分享一些乾貨給大家

python cv2在驗證碼識別中應用例項解析

這篇文章主要介紹了python cv2在驗證碼識別中應用例項解析,文中通過示例程式碼介紹的非常詳細，對大家的學習或者工作具有一定的參考學習價值,需要的朋友可以參考下

使用python 對驗證碼圖片進行降噪處理

首先貼一張驗證碼上來做案例：第一步先通過二值化處理把干擾線去掉： from PIL import Image

python自動化實現登入獲取圖片驗證碼功能

主要記錄一下：圖片驗證碼 1.獲取登入介面的圖片 2.獲取驗證碼位置 3.在登入頁面擷取驗證碼儲存

Python +Selenium解決圖片驗證碼登入或註冊問題(推薦)

1. 解決思路首先要獲得這張驗證碼的圖片，但是該圖片一般都是用的js寫的，不能夠通過url進行下載。

寫給程式設計師的機器學習入門 (八) - 卷積神經網路 (CNN) - 圖片分類和驗證碼識別

這一篇將會介紹卷積神經網路 (CNN)，CNN 模型非常適合用來進行圖片相關的學習，例如圖片分類和驗證碼識別，也可以配合其他模型實現 OCR。

python圖片驗證碼識別最新模組muggle_ocr的示例程式碼

相關推薦