Python Numpy gradient原始碼解析

阿新 • • 發佈：2018-12-21

複習影象梯度，發現Numpy有一個梯度計算函式，解析它的原始碼和需要注意的問題，最後自定義一個梯度函式

用法解析

Numpy提供了陣列梯度計算函式

gradient(f, *varargs, **kwargs)

輸入

必選引數：類N維陣列（列表/元組/陣列）
可選引數：標量列表或陣列列表，用於計算差分時的間隔空間
- 單個標量：為所有軸指定間隔
- N個標量：為每一軸指定了間隔，比如dx, dx, dz...
- N個數組：為f的每一軸都指定了取樣值的下標，每個陣列的長度必須匹配相對應那一維的長度
- N個標量或陣列的組合，其作用等同於2和3
- 注意：如果關鍵字引數axis
  
  被指定，那麼這個可選引數的長度必須和要計算的軸數一致
關鍵字引數有2個：
- edge_order：在邊界使用第N-階差分公式，預設是1-階
- axis：None或者是整數又或者是整數的元組，表示沿著指定的軸計算梯度，預設為None，表示計算所有軸的梯度。如果輸入為負數，表示從後向前計算軸的梯度

從它的介紹中，我唯一沒理解的就是可選引數的用法，看了程式碼後也沒理解，因為它和一階差分沒有關係，所以在後面的內容就忽略對可選引數的解析

輸出

Return the gradient of an N-dimensional array.

The returned gradient hence has the same shape as the input array.

如果輸入是一維陣列，返回同樣大小的梯度陣列

如果是多維陣列，並要求計算多維，那麼返回一個列表，包含計算的每一維的梯度，其大小和輸入陣列一致

計算方式

The gradient is computed using second order accurate central differences
in the interior points and either first or second order accurate one-sides
(forward or backwards) differences at the boundaries.

在它的介紹中，中間元素使用**2階中心差分**，但是檢視原始碼發現，它預設使用的其實是一階中心差分

，除非輸入可選引數的3和4選項，但是程式碼中的二階差分公式也沒看懂？，應該是使用泰勒級數的近似公式，所以跳過對程式碼中二階差分的介紹

兩邊使用一階前向或後向差分，也可通過關鍵字引數edge_order修改計算方式

差分公式

一階前向差分：

$\bigtriangleup f(x_{k})=f(x_{k+1})-f(x_{k})$

一階後向差分：

$\bigtriangledown f(x_{k})=f(x_{k})-f(x_{k-1})$

一階中心差分：

$\delta f(x_{k})=\frac{f(x_{k+1})-f(x_{k-1})}{2}$

二階中心差分：

${f}''(x_{k})={f}'(x_{k})-{f}'(x_{k-1})=f(x_{k+1})-f(x_{k})-(f(x_{k})-f(x_{k-1}))=f(x_{k+1})+f(x_{k-1})-2\cdot f(x_{k})$

示例和問題

計算一維或二維陣列的梯度，以及計算影象梯度

一維陣列

>>> import numpy as np
>>> arr1 = np.array(range(5))
>>> arr1
array([0, 1, 2, 3, 4])
>>> np.gradient(arr1)
array([1., 1., 1., 1., 1.])
>>> np.gradient(arr1, 2) # 增加可選引數
array([0.5, 0.5, 0.5, 0.5, 0.5])

問題：沒理解可選引數的適用場景？

二維陣列

>>> arr2 = np.random.randint(1, 9, (3,4))
>>> arr2
array([[6, 3, 5, 1],
    [3, 8, 6, 3],
    [7, 3, 3, 4]])
>>> np.gradient(arr2)
[array([[-3. ,  5. ,  1. ,  2. ],
    [ 0.5,  0. , -1. ,  1.5],
    [ 4. , -5. , -3. ,  1. ]]), array([[-3. , -0.5, -1. , -4. ],
    [ 5. ,  1.5, -2.5, -3. ],
    [-4. , -2. ,  0.5,  1. ]])]
>>> np.gradient(arr2, axis=0) # 對行求導
array([[-3. ,  5. ,  1. ,  2. ],
    [ 0.5,  0. , -1. ,  1.5],
    [ 4. , -5. , -3. ,  1. ]])
>>> np.gradient(arr2, axis=1) # 對列求導
array([[-3. , -0.5, -1. , -4. ],
    [ 5. ,  1.5, -2.5, -3. ],
    [-4. , -2. ,  0.5,  1. ]])

axis=0表示對行求導，axis=1表示對列求導

影象梯度計算需要注意的問題

>>> import cv2 as cv
>>> img = cv.imread("C:\\python\\ConvolveAndGradient\\lena.jpg", 0)
>>> arr3 = img[:3, :4]
>>> arr3
array([[163, 162, 161, 160],
    [162, 162, 162, 161],
    [162, 162, 163, 161]], dtype=uint8)
>>> np.gradient(arr3, axis=0)
array([[255. ,   0. ,   1. ,   1. ],
    [127.5,   0. ,   1. ,   0.5],
    [  0. ,   0. ,   1. ,   0. ]])
>>> np.gradient(arr3, axis=1)
array([[255. , 127. , 127. , 255. ],
    [  0. ,   0. , 127.5, 255. ],
    [  0. ,   0.5, 127.5, 254. ]])

輸入影象計算梯度時，發現數組邊界的梯度計算有誤，後來看原始碼的時候突然發覺是因為數值型別的關係，影象型別是uint8，得到負數會溢位，所以需要先轉換成更高精度的型別

>>> arr4 = np.asarray(img[:3, :4], dtype=np.double)
>>> arr4
array([[163., 162., 161., 160.],
    [162., 162., 162., 161.],
    [162., 162., 163., 161.]])
>>> np.gradient(arr4, axis=0)
array([[-1. ,  0. ,  1. ,  1. ],
    [-0.5,  0. ,  1. ,  0.5],
    [ 0. ,  0. ,  1. ,  0. ]])
>>> np.gradient(arr4, axis=1)
array([[-1. , -1. , -1. , -1. ],
    [ 0. ,  0. , -0.5, -1. ],
    [ 0. ,  0.5, -0.5, -2. ]])

原始碼解析

def gradient(f, *varargs, **kwargs):
    # 將類陣列轉換成陣列
    f = np.asanyarray(f)
    # 獲取維數
    N = f.ndim  # number of dimensions

    # 有沒有指定計算的軸數，預設為空
    axes = kwargs.pop('axis', None)
    if axes is None:
        # 如果為空，那麼計算每一軸的梯度
        axes = tuple(range(N))
    else:
        # 如果不為空，將其轉換成非負整數軸數列表，比如，axis=(-2,-1) -> (0, 1)
        axes = _nx.normalize_axis_tuple(axes, N)

    # 待計算的軸數
    len_axes = len(axes)
    # 可選引數長度，不解析，預設為0
    n = len(varargs)
    if n == 0:
        # no spacing argument - use 1 in all axes
        dx = [1.0] * len_axes
    elif n == 1 and np.ndim(varargs[0]) == 0:
        # single scalar for all axes
        dx = varargs * len_axes
    elif n == len_axes:
        # scalar or 1d array for each axis
        dx = list(varargs)
        for i, distances in enumerate(dx):
            if np.ndim(distances) == 0:
                continue
            elif np.ndim(distances) != 1:
                raise ValueError("distances must be either scalars or 1d")
            if len(distances) != f.shape[axes[i]]:
                raise ValueError("when 1d, distances must match "
                                "the length of the corresponding dimension")
            diffx = np.diff(distances)
            # if distances are constant reduce to the scalar case
            # since it brings a consistent speedup
            if (diffx == diffx[0]).all():
                diffx = diffx[0]
            dx[i] = diffx
    else:
        raise TypeError("invalid number of arguments")

    # 邊界求導方式，預設一階，最多2階
    edge_order = kwargs.pop('edge_order', 1)
    if kwargs:
        raise TypeError('"{}" are not valid keyword arguments.'.format(
                                                '", "'.join(kwargs.keys())))
    if edge_order > 2:
        raise ValueError("'edge_order' greater than 2 not supported")

    # use central differences on interior and one-sided differences on the
    # endpoints. This preserves second order-accuracy over the full domain.

    outvals = []

    # create slice objects --- initially all are [:, :, ..., :]
    # 為每一軸設定切片函式
    slice1 = [slice(None)]*N
    slice2 = [slice(None)]*N
    slice3 = [slice(None)]*N
    slice4 = [slice(None)]*N

    otype = f.dtype
    # 定義輸出數值型別，先解析輸入陣列數值型別
    # 除特殊型別（np.datetime64/np.timedelta64/np.inexact）外，均轉換成np.double
    if otype.type is np.datetime64:
        # the timedelta dtype with the same unit information
        otype = np.dtype(otype.name.replace('datetime', 'timedelta'))
        # view as timedelta to allow addition
        f = f.view(otype)
    elif otype.type is np.timedelta64:
        pass
    elif np.issubdtype(otype, np.inexact):
        pass
    else:
        # all other types convert to floating point
        otype = np.double
    
    # 求導每一軸
    # 按axes定義的軸順序來求導
    # 為啥要弄個dx？，它和可選引數有關，不解析可選引數，所以預設為1.0，不影響計算
    for axis, ax_dx in zip(axes, dx):
        if f.shape[axis] < edge_order + 1:
            # 每一軸的長度必須符合邊界求導規則，比如一階求導，必須有2個數值；2階求導，必須有3個數值
            raise ValueError(
                "Shape of array too small to calculate a numerical gradient, "
                "at least (edge_order + 1) elements are required.")
        # result allocation
        # 建立未初始化陣列
        out = np.empty_like(f, dtype=otype)

        # spacing for the current axis
        # 因為不接戲可選引數，所以為True
        uniform_spacing = np.ndim(ax_dx) == 0

        # Numerical differentiation: 2nd order interior
        # 切片1：取中間值
        # 切片2：取前n-2個
        # 切片3：取中間值
        # 切片4：取後n-2個
        slice1[axis] = slice(1, -1)
        slice2[axis] = slice(None, -2)
        slice3[axis] = slice(1, -1)
        slice4[axis] = slice(2, None)

        if uniform_spacing:
            # 中間數值使用中心差分，有計算可知，是一階差分公式
            out[tuple(slice1)] = (f[tuple(slice4)] - f[tuple(slice2)]) / (2. * ax_dx)
        else:
            dx1 = ax_dx[0:-1]
            dx2 = ax_dx[1:]
            a = -(dx2)/(dx1 * (dx1 + dx2))
            b = (dx2 - dx1) / (dx1 * dx2)
            c = dx1 / (dx2 * (dx1 + dx2))
            # fix the shape for broadcasting
            shape = np.ones(N, dtype=int)
            shape[axis] = -1
            a.shape = b.shape = c.shape = shape
            # 1D equivalent -- out[1:-1] = a * f[:-2] + b * f[1:-1] + c * f[2:]
            out[tuple(slice1)] = a * f[tuple(slice2)] + b * f[tuple(slice3)] + c * f[tuple(slice4)]

        # Numerical differentiation: 1st order edges
        if edge_order == 1:
            # 前向差分
            slice1[axis] = 0
            slice2[axis] = 1
            slice3[axis] = 0
            dx_0 = ax_dx if uniform_spacing else ax_dx[0]
            # 1D equivalent -- out[0] = (f[1] - f[0]) / (x[1] - x[0])
            out[tuple(slice1)] = (f[tuple(slice2)] - f[tuple(slice3)]) / dx_0
            
            # 後向差分
            slice1[axis] = -1
            slice2[axis] = -1
            slice3[axis] = -2
            dx_n = ax_dx if uniform_spacing else ax_dx[-1]
            # 1D equivalent -- out[-1] = (f[-1] - f[-2]) / (x[-1] - x[-2])
            out[tuple(slice1)] = (f[tuple(slice2)] - f[tuple(slice3)]) / dx_n

        # Numerical differentiation: 2nd order edges
        else:
            slice1[axis] = 0
            slice2[axis] = 0
            slice3[axis] = 1
            slice4[axis] = 2
            if uniform_spacing:
                a = -1.5 / ax_dx
                b = 2. / ax_dx
                c = -0.5 / ax_dx
            else:
                dx1 = ax_dx[0]
                dx2 = ax_dx[1]
                a = -(2. * dx1 + dx2)/(dx1 * (dx1 + dx2))
                b = (dx1 + dx2) / (dx1 * dx2)
                c = - dx1 / (dx2 * (dx1 + dx2))
            # 1D equivalent -- out[0] = a * f[0] + b * f[1] + c * f[2]
            out[tuple(slice1)] = a * f[tuple(slice2)] + b * f[tuple(slice3)] + c * f[tuple(slice4)]

            slice1[axis] = -1
            slice2[axis] = -3
            slice3[axis] = -2
            slice4[axis] = -1
            if uniform_spacing:
                a = 0.5 / ax_dx
                b = -2. / ax_dx
                c = 1.5 / ax_dx
            else:
                dx1 = ax_dx[-2]
                dx2 = ax_dx[-1]
                a = (dx2) / (dx1 * (dx1 + dx2))
                b = - (dx2 + dx1) / (dx1 * dx2)
                c = (2. * dx2 + dx1) / (dx2 * (dx1 + dx2))
            # 1D equivalent -- out[-1] = a * f[-3] + b * f[-2] + c * f[-1]
            out[tuple(slice1)] = a * f[tuple(slice2)] + b * f[tuple(slice3)] + c * f[tuple(slice4)]

        # 在結果陣列中新增該軸計算得到的梯度陣列
        outvals.append(out)

        # reset the slice object in this dimension to ":"
        slice1[axis] = slice(None)
        slice2[axis] = slice(None)
        slice3[axis] = slice(None)
        slice4[axis] = slice(None)

# 返回
if len_axes == 1:
    return outvals[0]
else:
    return outvals

自定義

下面自定義一個影象梯度計算函式

要求：

對輸入影象數值型別進行轉換，避免溢位
對中間畫素進行1階中心差分，對邊界進行1階前向或後向差分
計算過程不改變輸入影象畫素值
返回一個列表，包含每一軸的梯度陣列
可以指定要計算哪一軸的梯度，如果僅計算一軸梯度，返回該陣列即可

實現如下：

def image_gradient(f, **kwargs):
    """
    影象梯度求導
    :param f: 類陣列
    :return: 陣列或陣列列表，dtype=np.double
    可選引數：axis - 空或整數或整數列表
    """
    f = np.asanyarray(f, dtype=np.double)
    N = f.ndim  # number of dimensions

    axes = kwargs.pop('axis', None)
    if axes is None:
        axes = tuple(range(N))
    else:
        axes = _nx.normalize_axis_tuple(axes, N)
    len_axes = len(axes)

    outvals = []

    # create slice objects --- initially all are [:, :, ..., :]
    slice1 = [slice(None)] * N
    slice2 = [slice(None)] * N
    slice3 = [slice(None)] * N
    slice4 = [slice(None)] * N

    otype = f.dtype

    for axis in axes:
        if f.shape[axis] < 2:
            raise ValueError(
                "Shape of array too small to calculate a numerical gradient, "
                "at least 2 elements are required.")
        # result allocation
        out = np.empty_like(f, dtype=otype)

        # Numerical differentiation: 2nd order interior
        slice1[axis] = slice(1, -1)
        slice2[axis] = slice(None, -2)
        slice3[axis] = slice(1, -1)
        slice4[axis] = slice(2, None)
        # 1D equivalent -- out[0] = (f[1] - f[-1]) / 2
        out[tuple(slice1)] = (f[tuple(slice4)] - f[tuple(slice2)]) / 2

        slice1[axis] = 0
        slice2[axis] = 1
        slice3[axis] = 0
        # 1D equivalent -- out[0] = (f[1] - f[0]) / (x[1] - x[0])
        out[tuple(slice1)] = (f[tuple(slice2)] - f[tuple(slice3)])

        slice1[axis] = -1
        slice2[axis] = -1
        slice3[axis] = -2
        # 1D equivalent -- out[-1] = (f[-1] - f[-2]) / (x[-1] - x[-2])
        out[tuple(slice1)] = (f[tuple(slice2)] - f[tuple(slice3)])

        outvals.append(out)

        # reset the slice object in this dimension to ":"
        slice1[axis] = slice(None)
        slice2[axis] = slice(None)
        slice3[axis] = slice(None)
        slice4[axis] = slice(None)

    if len_axes == 1:
        return outvals[0]
    else:
        return outvals

輸出陣列或陣列列表的數值型別是np.double，計算影象梯度如下：

import cv2 as cv
import numpy as np
import numpy.core.numeric as _nx
import matplotlib.pyplot as plt

__author__ = 'zj'

image_name = 'lena.jpg'

def get_image():
    return cv.imread(image_name, 0)

def show_image(grad_x, grad_y, grad):
    plt.figure(figsize=(10, 5))  # 設定視窗大小
    plt.suptitle('image_gradient')  # 圖片名稱
    plt.subplot(1, 3, 1)
    plt.title('grad_x')
    plt.imshow(grad_x, cmap='gray'), plt.axis('off')
    plt.subplot(1, 3, 2)
    plt.title('gray_y')
    plt.imshow(grad_y, cmap='gray'), plt.axis('off')
    plt.subplot(1, 3, 3)
    plt.title('grad')
    plt.imshow(grad, cmap='gray'), plt.axis('off')
    plt.savefig('./grad.png') # 儲存影象
    plt.show()

if __name__ == '__main__':
    img = get_image()
    grad_x = image_gradient(img, axis=1)
    grad_y = image_gradient(img, axis=0)
    grad = np.sqrt(grad_x ** 2 + grad_y ** 2)
    abs_x = np.array(np.abs(grad_x), dtype=np.uint8)
    abs_y = np.array(np.abs(grad_y), dtype=np.uint8)
    abs_grad = np.array(np.abs(grad), dtype=np.uint8)
    # cv.imshow("x", abs_x)
    # cv.imshow("y", abs_y)
    # cv.imshow("grad", abs_grad)
    # cv.waitKey()
    show_image(abs_x, abs_y, abs_grad)

在這裡插入圖片描述

Python Numpy gradient原始碼解析

複習影象梯度，發現Numpy有一個梯度計算函式，解析它的原始碼和需要注意的問題，最後自定義一個梯度函式目錄用法解析示例和問題原始碼解析自定義用法解析 Numpy提供了陣列梯度計算函式 gradient(f, *varargs, **kwar

Python 列舉類原始碼解析

1、氣泡排序 def sortport(): for i in range(len(lis)-1): for j in range(len(lis)-1-i): if lis[j]>lis[j+1]:

Python 列表原始碼解析

列表作為python 最常用的一種資料型別, 一直很好奇其是如何實現動態的擴充套件的，於是上github看了原始碼（https://github.com/python/cpython/blob/master/Include/listobject.h， https://github.com/p

從原始碼解析Python的Flask框架中request物件的用法

from flask import requestFlask 是一個人氣非常高的Python Web框架，筆者也拿它寫過一些大大小小的專案，Flask 有一個特性我非常的喜歡，就是無論在什麼地方，如果你想要獲取當前的request物件，只要簡單的：從當前request

python 原始碼解析（一）

為了看懂 python 原始碼，特地學了 c++ ，依然看不懂，看了個大概，先留個坑，慢慢填。先從 python dict 物件開始看起。 python dict 物件是鍵值對的一種結構，類似於 java 的hashmap 物件。 dictd 物件，每個鍵值對，

跳一跳python輔助軟體思路及原始碼解析

跳一跳python輔助軟體思路及影象識別原始碼解析本文將梳理github上最火的wechat_jump_game的實現思路，並解析其影象處理部分原始碼首先廢話少說先看效果核心思想獲取棋子到下一個方塊的中心點的距離計算觸控式螢幕幕的時間

【Python測試】unittest原始碼解析一----測試用例是如何被執行的

在Python的單元測試中，有各種不同方式來執行使用者的測試用例，在接下來的篇幅中，我們會詳細敘述每種方式的具體執行流程。先來看下unittest中的__init__.py中提供的一個測試用例案例: import unittest class In

python原始碼解析（二）

一：PyObject 首先，先來看PyObject在object.h中的定義。typedef struct _object { _PyObject_HEAD_EXTRA Py_ssize_t ob_refcnt; struct _typeobjec

python unittest原始碼解析四----關於原始碼中的__unittest的用處

讀python的unittest原始碼的時候，發現每個檔案的開頭部分都有這樣一句話: __unittest = True我們知道，下劃線在python中含有特殊的意義，而雙下線(double underline scope)就表示私有的，那麼這裡在每個檔案中定義一個私

python dict 原始碼解析

雜湊表和雜湊衝突概念 python的字典是一種雜湊表，是根據關鍵碼值(Key value)而直接進行訪問的資料結構。也就是說，它通過把關鍵碼值對映到表中一個位置來訪問記錄，以加快查詢的速度。這個對映函式叫做雜湊函式(雜湊函式)，存放記錄的陣列叫做散

Python核心原始碼解析與C/CPP-API拓展程式設計（一）PyObject

開發十年，就只剩下這套架構體系了！ >>>

Python資料分析之numpy陣列全解析

1 什麼是numpy numpy是一個在Python中做科學計算的基礎庫，重在數值計算，也是大部分Python科學計算庫的基礎庫，多用於大型、多維資料上執行數值計算。在NumPy 中，最重要的物件是稱為 ndarray 的N維陣列型別，它是描述相同型別的元素集合，numpy所有功能幾乎都以ndarray

Python記憶體管理機制-《原始碼解析》

### Python 記憶體管理分層架構 ```c++ /* An object allocator for Python. Here is an introduction to the layers of the Python memory architecture, showing whe

Python優秀開源專案Rich原始碼解析

這篇文章對優秀的開源專案Rich的原始碼進行解析，OMG，盤他。為什麼建議閱讀原始碼，有兩個原因，第一，單純學語言很難在實踐中靈活應用，通過閱讀原始碼可以看到每個知識點的運用場景，印象會更深，以後寫程式碼的時候就能應用起來；第二，通過閱讀優秀的開原始碼，可以學習比人的程式碼規範、設計思路；第三，參與到開源社群

python簡單工廠模式解析

用戶維護 clas 只需要耦合分析計算根據 while 下面考慮《大話設計模式》中的一個例子：題目：用任意一種面向對象語言實現一個計算器控制臺程序。要求輸入兩個數和運算符號，得到結果。題目分析：程序應該做到：（1）可維護；（2）可復用；（3）可擴展；（4）靈

Python--Numpy安裝

安裝 ads ges 自己的 col 上網 style 3.6 什麽 1.Numpy簡介 NumPy系統是Python的一種開源的數值計算擴展，一個用python實現的科學計算包。 2.下載下載地址：https://pypi.python.org/pypi/numpy#d

python numpy數組中的復制問題

方便 highlight 復制比較 cto light 等於不變 brush vector = numpy.array([5, 10, 15, 20]) equal_to_ten_or_five = (vector == 10) | (vector == 5) vec

Python numpy.ZIP 安裝問題

appdata 最好 sta 直接 nump setup.py 簡單 numpy 問題　　　　今天在python上安裝numpy，按照網上教程，安裝pip,然後命令行直接:pip install numpy 。但是一直因為資源問題下載失敗。後來下載了一個numpy-1

python: numpy--函數 shape用法

[1] 舉例 pytho 簡寫 rom board num sse mod http://blog.csdn.net/u010758410/article/details/71554224 shape函數是numpy.core.fromnumeric中的函數，它的功能是查

python爬蟲知識點三--解析豆瓣top250數據

www request 10.8 blog 分享 encode uid gb2 on() 一。利用cookie訪問import requests headers = {‘User-Agent‘: ‘Mozilla/5.0 (Windows NT 6.3; WOW64)

Python Numpy gradient原始碼解析

輸入

輸出

計算方式

差分公式

示例和問題

一維陣列

二維陣列

影象梯度計算需要注意的問題

相關推薦