Keras初探（二）——識別驗證碼

阿新 • • 發佈：2018-12-16

訪問本站觀看效果更佳 繼上篇對於Keras的初步探討之後，我將給出一個例子講解如何利用Keras用於處理影象分類問題，今天我們先探討一下識別驗證碼的問題。

一、探討內容

1、資料來源

2、模型搭建

3、優化問題

二、資料來源

在本文中，我打算對驗證碼進行識別，有一個python包——captcha，利用它可生成驗證碼。當然使用前需要先匯入相關packages。

sudo pip3 install captcha

import cv2
import numpy as np
from captcha.image import ImageCaptcha

這裡可以設定驗證碼的大小為28*28，字型大小24。比如下面兩張圖片，第一張是5，第二張是6。干擾相對較大。

下面給出完整程式碼

import cv2
import numpy as np
from captcha.image import ImageCaptcha

def generate_captcha(text):
    
    capt= ImageCaptcha(width=28,height=28,font_sizes = [24])
    image = capt.generate_image(text)
    image = np.array(image,dtype=np.uint8)
    return image

if __name__ == '__main__' 
:
    output_dir = './datasets/images/'
    for i in range(5000):
        label = np.random.randint(0,10)
        image = generate_captcha(str(label))
        image_name = 'image{}_{}.jpg'.format(i+1,label)
        output_path = output_dir +image_name
        cv2.imwrite(output_path,image)

儲存檔案為gendata.py，執行檔案後生成5000張驗證碼圖片。這裡只是實驗性質，所以驗證碼圖片數量較少，大家自己做實驗的時候可以適當增加一些圖片數量。

三、模型搭建

一開始我們可以搭建一個非常簡單的LeNet來進行驗證和測試。儲存下列檔案命名為lenet.py

# import the necessary packages
from keras.models import Sequential
from keras.layers.convolutional import Conv2D
from keras.layers.convolutional import MaxPooling2D
from keras.layers.core import Activation
from keras.layers.core import Flatten
from keras.layers.core import Dense
from keras import backend as K
 
class LeNet:
    @staticmethod
    def build(width, height, depth, classes):
        # initialize the model
        model = Sequential()
        inputShape = (height, width, depth)
        # if we are using "channels last", update the input shape
        if K.image_data_format() == "channels_first":   #for tensorflow
            inputShape = (depth, height, width)
        # first set of CONV => RELU => POOL layers
        model.add(Conv2D(20, (5, 5),padding="same",input_shape=inputShape))
        model.add(Activation("relu"))
        model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))
        #second set of CONV => RELU => POOL layers
        model.add(Conv2D(50, (5, 5), padding="same"))
        model.add(Activation("relu"))
        model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))
        # first (and only) set of FC => RELU layers
        model.add(Flatten())
        model.add(Dense(500))
        model.add(Activation("relu"))

        # softmax classifier
        model.add(Dense(classes))
        model.add(Activation("softmax"))

        # return the constructed network architecture
        return model

接著我們載入資料，每張圖片對應的數字是放在檔名’_'之後。

def get_data(images_path):
    if not os.path.exists(images_path):
        raise ValueError('images_path is not exist.')

    images = []
    labels = []
    images_path = os.path.join(images_path,'*.jpg')
    count = 0
    for image_file in glob.glob(images_path):
        count +=1
        if count % 100 == 0:
            print('Load{} images .'.format(count))
        image = cv2.imread(image_file)
        image = cv2.cvtColor(image,cv2.COLOR_BGR2RGB)
        image = cv2.resize(image, (norm_size, norm_size))
        label = int(image_file.split('_')[-1].split('.')[0])
        images.append(image)
        labels.append(label)
    images = np.array(images)
    labels = np.array(labels)

    (trainX, testX, trainY, testY) = train_test_split(images,
            labels, test_size=0.25, random_state=42)

    # convert the labels from integers to vectors
    trainY = to_categorical(trainY, num_classes=CLASS_NUM)
    testY = to_categorical(testY, num_classes=CLASS_NUM)   
    return trainX,trainY,testX,testY

經過處理我們得到訓練集和測試集。我們先放出來完整程式碼train.py，然後我們在程式碼基礎上加以修改。執行命令如下

python3 train.py -d images/ -m my.model

其中images/ 為驗證碼存放目錄，my.model為模型儲存位置。

import matplotlib
matplotlib.use("Agg")
 
# import the necessary packages
import glob
from keras.preprocessing.image import ImageDataGenerator
from keras.optimizers import Adam
from sklearn.model_selection import train_test_split
from keras.preprocessing.image import img_to_array
from keras.utils import to_categorical
#from imutils import paths
import matplotlib.pyplot as plt
import numpy as np
import argparse
import random
import cv2
import os
import sys
sys.path.append('..')
from lenet import LeNet



def args_parse():
    # construct the argument parse and parse the arguments
    ap = argparse.ArgumentParser()
    ap.add_argument("-d", "--dataset", required=True,
        help="path to input dataset")
    ap.add_argument("-m", "--model", required=True,
        help="path to output model")
    ap.add_argument("-p", "--plot", type=str, default="plot.png",
        help="path to output accuracy/loss plot")
    args = vars(ap.parse_args()) 
    return args


args = args_parse()

# initialize the number of epochs to train for, initial learning rate,
# and batch size
EPOCHS = 200
INIT_LR = 1e-2
BS = 128
CLASS_NUM = 10
norm_size = 32
# initialize the data and labels

def get_data(images_path):
    if not os.path.exists(images_path):
        raise ValueError('images_path is not exist.')

    images = []
    labels = []
    images_path = os.path.join(images_path,'*.jpg')
    count = 0
    for image_file in glob.glob(images_path):
        count +=1
        if count % 100 == 0:
            print('Load{} images .'.format(count))
        image = cv2.imread(image_file)
        image = cv2.cvtColor(image,cv2.COLOR_BGR2RGB)
        image = cv2.resize(image, (norm_size, norm_size))
        label = int(image_file.split('_')[-1].split('.')[0])
        images.append(image)
        labels.append(label)
    images = np.array(images)
    labels = np.array(labels)

    (trainX, testX, trainY, testY) = train_test_split(images,
            labels, test_size=0.25, random_state=42)

    # convert the labels from integers to vectors
    trainY = to_categorical(trainY, num_classes=CLASS_NUM)
    testY = to_categorical(testY, num_classes=CLASS_NUM)   
    return trainX,trainY,testX,testY

def train(aug,trainX,trainY,testX,testY,args):
    # initialize the model
    print("[INFO] compiling model...")
    model = LeNet.build(width=norm_size, height=norm_size, depth=3, classes=CLASS_NUM)
    opt = Adam(lr=INIT_LR, decay=INIT_LR / EPOCHS)
#    opt = Adam(lr=INIT_LR)
    model.compile(loss="categorical_crossentropy", optimizer=opt,
        metrics=["accuracy"])

    # train the network
    print("[INFO] training network...")
    H = model.fit_generator(aug.flow(trainX, trainY, batch_size=BS),
        validation_data=(testX, testY), steps_per_epoch=len(trainX) // BS,
        epochs=EPOCHS, verbose=1)

    # save the model to disk
    print("[INFO] serializing network...")
    model.save(args["model"])
    
    # plot the training loss and accuracy
    plt.style.use("ggplot")
    plt.figure()
    N = EPOCHS
    plt.plot(np.arange(0, N), H.history["loss"], label="train_loss")
    plt.plot(np.arange(0, N), H.history["val_loss"], label="val_loss")
    plt.plot(np.arange(0, N), H.history["acc"], label="train_acc")
    plt.plot(np.arange(0, N), H.history["val_acc"], label="val_acc")
    plt.title("Training Loss and Accuracy on Invoice classifier")
    plt.xlabel("Epoch #")
    plt.ylabel("Loss/Accuracy")
    plt.legend(loc="lower left")
    plt.savefig(args["plot"])
    
#python train.py --dataset ../../invoice_all/train  --model invoice.model
if __name__=='__main__':
    args = args_parse()
    file_path = args["dataset"]
    trainX,trainY,testX,testY = get_data(file_path)
    # construct the image generator for data augmentation
    aug = ImageDataGenerator(rotation_range=30, width_shift_range=0.1,
        height_shift_range=0.1, shear_range=0.2, zoom_range=0.2,
        horizontal_flip=True, fill_mode="nearest")
    train(aug,trainX,trainY,testX,testY,args)

四、優化模型

我們可以根據每次生成的圖片觀察訓練效果，這張圖是已經經過若干次修改後的結果，正確率大概為0.80，從下圖可以看到val_loss的抖動還是比較大，這是由於兩個原因：一是初始的學習率比較大,二是因為在本例中我採用了dropout，而dropoutrate設定得太高了（0.25）所以我們需要修改。

4.1 採用BatchNormalization

BatchNormalization()真是非常好用，把它放在卷積層和池化層之間能非常有效地提升效能。

model.add(Conv2D(30, (2, 2), padding="same"))
        model.add(BatchNormalization())
        model.add(Activation("relu"))
        model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))

4.2 學習率衰減策略

其實我們在一開始嘗試的時候完全沒必要設定學習率衰減策略。我們大可以嘗試使用或大或小的學習率觀察結果。隨後我們可以讓學習率隨輪數衰減，以達到微調的效果。

    opt = Adam(lr=INIT_LR, decay=INIT_LR / EPOCHS)
#    opt = Adam(lr=INIT_LR)
    model.compile(loss="categorical_crossentropy", optimizer=opt,
        metrics=["accuracy"])

4.3 dropout

使用了batchnormal再使用dropout效果可能不太明顯。我們可以在最後的全連線層處使用dropout，在卷積層中間使用dropout會導致結果不可預測。

        model.add(Dense(200))
        model.add(Dropout(droprate))

4.4 資料擴充

我們把圖片變形扭曲增加資料來源

aug = ImageDataGenerator(rotation_range=30, width_shift_range=0.1,
        height_shift_range=0.1, shear_range=0.2, zoom_range=0.2,
        horizontal_flip=True, fill_mode="nearest")

現在的結果如下圖所示，由於訓練輪數（200）不是特別多，所以效果還不是很好正確率大概在85%。有興趣的朋友可以在此基礎上加以修改一下。完整程式碼參見 code

Keras初探（二）——識別驗證碼

訪問本站觀看效果更佳繼上篇對於Keras的初步探討之後，我將給出一個例子講解如何利用Keras用於處理影象分類問題，今天我們先探討一下識別驗證碼的問題。一、探討內容 1、資料來源 2、模型搭建 3、優化問題二、資料來源在本文中，我打算對驗證碼進行識別，有

RNN入門（二）識別驗證碼

介紹作為RNN的第二個demo，筆者將會介紹RNN模型在識別驗證碼方面的應用。我們的驗證碼及樣本資料集來自於部落格： CNN大戰驗證碼,在這篇部落格中，我們已經準備好了所需的樣本資料集，不需要在辛辛苦苦地再弄一遍，直接呼叫data.csv就可以進行建

Java之集合初探（二）Iterator（叠代器），collections，打包/解包（裝箱拆箱），泛型(Generic)，comparable接口

基本 generate 等於框架 ring bin list() each 是否 Iterator（叠代器）所有實現了Collection接口的容器都有一個iterator方法, 用來返回一個實現了Iterator接口的對象 Iterator對象稱作叠代器, 用來

國內物聯網平臺初探（二） ——阿裏雲物聯網套件

black pps 協議方法 size 20px 安全認證合法性時間 payload 架構數據通道為設備和物聯網應用程序提供發布和接收消息的安全通道。數據通道目前支持CCP協議和MQTT協議。用戶可以基於CCP協議實現Pub/Sub異步通信，也可以使用遠程調

跟廠長學PHP內核（二）：源碼分析的環境與工具

compiler one upload info org print fin 圖形界面 waiting 本文主要介紹分析源碼的方式，其中包含環境的搭建、分析工具的安裝以及源碼調試的基本操作。一、工具清單 PHP7.0.12 GDB CLion 二、源碼下載及安裝

Keras學習（二）——Regression（迴歸）

本文主要介紹利用keras搭建簡單的神經網路，對資料擬合。示例程式碼： import numpy as np from keras import Sequential # 按順序建立的神經網路 from keras.layers import Dense # Dense全連線層

SpringSecurity（六）簡訊驗證碼登入

由 SpringSecurity（四）認證流程我們已經知道了Spring Security使用者名稱和密碼的登入流程。仿照使用者名稱和密碼登入編寫一個簡訊驗證碼登入手機驗證碼登入流程圖簡訊驗證碼新建一個SmsCode類

SpringSecurity（五）圖片驗證碼的使用

SpringSecurity預設是沒有圖片驗證碼功能的，假如我們需要在登入介面新增一個圖片驗證碼的功能，我們可以在UsernamePasswordAuthenticationFilter過濾器之前寫一個圖片驗證碼過濾器，圖片驗證碼過濾器的功能：首先判斷請求地址是否需要圖片驗證碼，如果需要就判斷圖片驗

開源電子書專案FBReader初探（二）

FBReader第一次接觸，開啟選單一、FBReader是如何處理使用者的“第一個有效”點選事件，並將其轉換成對應actionId呢？本來是想要探索FBReader是如何開啟一本書的，但是發現涉及到的方方面面特別的多，索性我們就來細細拆解，根據使用FBReader的步驟，循序漸進的去品位FBReade

java多執行緒-初探（二）

java多執行緒-初探（一）常見的執行緒函式 sleep 當前執行緒暫停 join 加入到當前執行緒中 setPriority 執行緒優先順序

keras系列（二）：模型設定

Keras模型簡介 Keras的初始構建塊是一個模型，最簡單的模型稱為序列。Keras序列模型是一個神經網路層的線性管道(一個堆疊)。 from keras.models import Sequential model = Sequential() model.

ExtJs初探（二）- 窗體（eclipse+Springboot+maven）

配置完畢（承接ExtJs初探（一）- 下載及配置入專案（eclipse+Springboot+maven））後進入到ExtJs的各種方法用法和控制元件生成，直接上栗子和效果圖。部分程式碼參考自：http://www.cnblogs.com/iamlilinfeng/archive/2012/06

Web填坑之路（3） --- js驗證碼外掛GVerify

轉載：網路 !(function(window, document) { function GVerify(options) { //建立一個圖形驗證碼物件，接收options物件為引數 this.options = { //預設opti

Keras入門（二）模型的儲存、讀取及載入

本文將會介紹如何利用Keras來實現模型的儲存、讀取以及載入。本文使用的模型為解決IRIS資料集的多分類問題而設計的深度神經網路（DNN）模型，模型的結構示意圖如下：模型儲存 Keras使用HDF5檔案系統來儲存模型。模型儲存的方法很容易，只

Keras初探（一）

訪問本站觀看效果更佳嘗試寫一下Keras的一些東西，算是必要的時候能備忘一下吧！希望大家多提提意見。一、安裝Keras Keras並不能直接用於構建模型，它需要後端支援。 Keras 可以基於兩個Backend，一個是 Theano，一個是 Tensorfl

語法分析器（二）識別多錯誤 Java版

在上次實驗的基礎上進行改進，能夠識別多個錯誤，本文的只進行了部分資料的測試，所以可能會有其他錯誤識別不出來部分缺失的程式碼請參考我之前寫的部落格，可以檢視完整的程式碼 package codescanner; import java.util.ArrayList;

網易自動化測試工具Airtest初探（二）

上一篇網易自動化測試工具Airtest初探（一），使用了IDE介面方式開發了一條超級簡單的傳送資訊的指令碼。本篇主要是要處理一些邏輯思維稍微複雜一些的，包含一些邏輯判斷，邏輯控制等。 1、先了解下python的一些資料型別和基本語法（個人感覺，其實這些python基礎的東西，稍微知道一下寫法

粒子濾波初探（二）利用粒子濾波實現視訊目標跟蹤-程式碼部分（C++&&opencv2.49）

利用粒子濾波實現視訊目標跟蹤工程實戰放在最前：致謝taotao1233、yangyangcv、yang_xian521 以及先驅 Rob Hess 所開源的程式碼和思路。本篇：基本為工程翻譯，以及對上面版本的一些修正，使用的是opencv2.49，以Ma

StoryBoard初探（二）:使用UINavigationController和UITabBarController

UINavigationController StoryBoard的Segue型別有三種：Push,Modal,Custom.其中Push型別的Segue需要用到UINavigationController。第一步，先清空之前所有的連線和連線，選擇ViewControlle

模型評估和超引數調整（二）——交叉驗證（cross validation）

讀《python machine learning》chapt 6 Learning Best Practices for Model Evaluation and Hyperparameter Tuning【主要內容】（1）獲得對模型評估的無偏估計（2）診斷機器學習演算法的

Keras初探（二）——識別驗證碼

一、探討內容

1、資料來源

2、模型搭建

3、優化問題

二、資料來源

三、模型搭建

四、優化模型

4.1 採用BatchNormalization

4.2 學習率衰減策略

4.3 dropout

4.4 資料擴充

相關推薦