12、TensorFlow 影象處理

阿新 • • 發佈：2019-01-24

一、影象編碼與解碼

影象在儲存時並不是直接記錄這些矩陣中的數字，而是記錄經過壓縮編碼之後的結果。所以要將一張影象還原成一個三維矩陣，需要解碼的過程。OpenCV 中的 imread 和 imwrite 就是一個解碼和編碼的過程。TensorFLow 中提供了相應的編碼和解碼的函式。

# 影象解碼函式
tf.image.decode_image(
    contents,
    channels=None,
    name=None
)

# 引數
contents: 0-D string. The encoded image bytes.
channels: An optional int. Defaults to 0. 
 Number of color channels for the decoded image.

# 返回值
Tensor with type uint8 with shape [height, width, num_channels] for BMP, JPEG, and PNG images and shape [num_frames, height, width, 3] for GIF images. 


# 影象編碼函式
tf.image.encode_jpeg()
tf.image.encode_png()

二、影象大小調整

# 1、縮放
tf.image.resize_images(
    images,
    size,
    method=ResizeMethod.BILINEAR,
    align_corners=False 

)

# 引數
images: 4-D Tensor of shape [batch, height, width, channels] or 3-D Tensor of shape [height, width, channels].

size: A 1-D int32 Tensor of 2 elements: new_height, new_width. The new size for the images.

method can be one of:
ResizeMethod.BILINEAR: 雙線性插值法，預設
ResizeMethod.NEAREST_NEIGHBOR: 最近鄰法
ResizeMethod.BICUBIC: 雙三線性插值法
ResizeMethod.AREA: 面積插值法

# 返回值(float) 

If images was 4-D, a 4-D float Tensor of shape [batch, new_height, new_width, channels]. If images was 3-D, a 3-D float Tensor of shape [new_height, new_width, channels].



# 2、裁剪(居中)或補零(四周均勻)
tf.image.resize_image_with_crop_or_pad(
    image,
    target_height,
    target_width
)

# 引數
image: 4-D Tensor of shape [batch, height, width, channels] or 3-D Tensor of shape [height, width, channels].

# 返回值
Cropped and/or padded image. If images was 4-D, a 4-D float Tensor of shape [batch, new_height, new_width, channels]. If images was 3-D, a 3-D float Tensor of shape [new_height, new_width, channels]



# 3、按比例居中裁剪
tf.image.central_crop(
    image,
    central_fraction
)



# 4、對輸入影象做剪裁併通過插值方法調整尺寸
tf.image.crop_and_resizecrop_and_resize(
    image,
    boxes,
    box_ind,
    crop_size,
    method='bilinear',
    extrapolation_value=0,
    name=None
)



# 5、沿著給定的 bbox 座標進行裁剪
tf.image.crop_to_bounding_box(
    image,
    offset_height,
    offset_width,
    target_height,
    target_width
)

# 引數
image: 4-D Tensor of shape [batch, height, width, channels] or 3-D Tensor of shape [height, width, channels].

bbox: the top-left corner of the returned image is at offset_height, offset_width in image, and its lower-right corner is at offset_height + target_height, offset_width + target_width.

# 返回值
If image was 4-D, a 4-D float Tensor of shape [batch, target_height, target_width, channels] If image was 3-D, a 3-D float Tensor of shape [target_height, target_width, channels]



# 6、沿著原影象補零到指定高度(target_height)和寬度(target_width)
tf.image.pad_to_bounding_boxpad_to_bounding_box(
    image,
    offset_height,
    offset_width,
    target_height,
    target_width
)

# 工作原理
Adds offset_height rows of zeros on top, offset_width columns of zeros on the left, and then pads the image on the bottom and right with zeros until it has dimensions target_height, target_width.

# 引數
image: 4-D Tensor of shape [batch, height, width, channels] or 3-D Tensor of shape [height, width, channels].

offset_height: Number of rows of zeros to add on top.
offset_width: Number of columns of zeros to add on the left.

target_height: Height of output image.
target_width: Width of output image.

# 返回值
If image was 4-D, a 4-D float Tensor of shape [batch, target_height, target_width, channels] If image was 3-D, a 3-D float Tensor of shape [target_height, target_width, channels]

三、影象翻轉、旋轉

# 1、(隨機)上下翻轉
tf.image.flip_up_down(image)
tf.image.random_flip_up_down(image，seed=None)


# 2、(隨機)左右翻轉
tf.image.flip_left_right(image)
tf.image.random_flip_left_right(image，seed=None)


# 3、沿對角線翻轉：交換影象的第一維和第二維
tf.image.transpose_image(image)
# 引數
image: 3-D tensor of shape [height, width, channels]

# 返回值
A 3-D tensor of the same type and shape as image


# 4、將影象逆時針旋轉 90*k 度
tf.image.rot90(image, k=1)
# 引數
image: A 3-D tensor of shape [height, width, channels].
k: A scalar integer. The number of times the image is rotated by 90 degrees.
name: A name for this operation (optional).

# 返回值
A rotated 3-D tensor of the same type and shape as image.


# 5、Rotate image(s) by the passed angle(s) in radians(弧度)
tf.contrib.image.rotate(
    images,
    angles,
    interpolation='NEAREST'
)
# 引數
images: A tensor of shape (num_images, num_rows, num_columns, num_channels) (NHWC), (num_rows, num_columns, num_channels) (HWC), or (num_rows, num_columns) (HW).

angles: A scalar angle to rotate all images by, or (if images has rank 4) a vector of length num_images, with an angle for each image in the batch.

interpolation: Interpolation mode. Supported values: "NEAREST", "BILINEAR".

# 返回值
Image(s) with the same type and shape as images, rotated by the given angle(s). Empty space due to the rotation will be filled with zeros.

四、影象色彩調整

# 1、調整 RGB 影象或灰度圖的亮度
# delta is the amount to add to the pixel values, should be in [0,1)
tf.image.adjust_brightness(
    image,
    delta
)


# 2、調整 RGB 影象的色相， delta must be in the interval [-1, 1]
tf.image.adjust_hue(
    image,
    delta,
    name=None
)


# 3、調整 RGB 影象或灰度圖的對比度
tf.image.adjust_contrast(
    images,
    contrast_factor
)


# 4、調整 RGB 影象的飽和度
tf.image.adjust_saturation(
    image,
    saturation_factor,
    name=None
)


# 5、在輸入影象上執行伽馬校正
tf.image.adjust_gamma(
    image,
    gamma=1,
    gain=1
)


# 6、在[-max_delta, max_delta]的範圍內隨機調整影象的亮度，0 的時候就是原始影象
tf.image.random_brightness(
    image,
    max_delta,
    seed=None
)


# 7、在[-max_delta, max_delta]的範圍內隨機調整影象的色相
# max_delta must be in the interval [0, 0.5]
tf.image.random_hue(
    image,
    max_delta,
    seed=None
)


# 8、在[lower, upper] 的範圍隨機調整影象的對比度
tf.image.random_contrast(
    image,
    lower,
    upper,
    seed=None
)


# 9、在[lower, upper] 的範圍隨機調整影象的飽和度
tf.image.random_saturation(
    image,
    lower,
    upper,
    seed=None
)

# 10、影象色彩空間轉換
tf.image.rgb_to_grayscale()
tf.image.grayscale_to_rgb()
tf.image.hsv_to_rgb()
tf.image.rgb_to_hsv()  # 必須先轉換為實數(float32)影象


# 11、影象資料型別轉換，eg: 轉成 uint8-->float32, 除 255 轉成 [0,1)
tf.image.convert_image_dtype(
    image,
    dtype,
    saturate=False,
    name=None
)


# 12、影象標準化處理(均值為0，方差為1)
tf.image.per_image_standardization(image)

五、處理標註框(bounding_box)

# 1、Draw bounding boxes on a batch of images
draw_bounding_boxes(
    images,
    boxes,
    name=None
)
# 引數
images: A Tensor. Must be one of the following types: float32, half. 4-D with shape [batch, height, width, depth]. A batch of images.

boxes: A Tensor of type float32. 3-D with shape [batch, num_bounding_boxes, 4] containing bounding boxes.

# 返回值
A Tensor. Has the same type as images. 4-D with the same shape as images. The batch of input images with bounding boxes drawn on the images.

# 資料型別和維度注意事項
images 要求為實數，所以需要先將影象矩陣轉化為實數型別，並增加一個 batch 維度 1，eg:
batched = tf.expand_dims(
    tf.image.convert_image_dtype(images, tf.float32),
    axis=0
)

# 座標系順序和相對座標注意事項
The coordinates of the each bounding box in boxes are encoded as [y_min, x_min, y_max, x_max]. The bounding box coordinates are floats in [0.0, 1.0] relative to the width and height of the underlying image.

For example, if an image is 100 x 200 pixels and the bounding box is [0.1, 0.2, 0.5, 0.9], the bottom left and upper right coordinates of the bounding box will be (10, 40) to (50, 180).



# 2、非極大值抑制
tf.image.non_max_suppression(
    boxes,
    scores,
    max_output_size,
    iou_threshold=0.5,
    name=None
)



# 3、Generate a single randomly distorted bounding box for an image
tf.image.sample_distorted_bounding_box(
    image_size,
    bounding_boxes,
    seed=None,
    seed2=None,
    min_object_covered=None,
    aspect_ratio_range=None,
    area_range=None,
    max_attempts=None,
    use_image_if_no_bounding_boxes=None,
    name=None
)

六、參考資料

12、TensorFlow 影象處理

一、影象編碼與解碼影象在儲存時並不是直接記錄這些矩陣中的數字，而是記錄經過壓縮編碼之後的結果。所以要將一張影象還原成一個三維矩陣，需要解碼的過程。OpenCV 中的 imread 和 imwrite 就是一個解碼和編碼的過程。TensorFLow 中提

OpenCV、Skimage、PIL影象處理的細節差異

在進行影象處理時一點要注意各個庫之間的細微差異，還有要注意影象放縮時插值方法的選擇，而且即使是相同的插值方法，各個庫的實現也不同，結果也會有些許差異 PIL(RGB) 首先介紹PIL(Python Imaging Library)這個庫，這是Python中最基礎的影象處理庫，主要注意對圖片進行處理時w，

【Shader特效8】著色器濾鏡、影象卷積與濾波、數字影象處理

##說在開頭： PhotoShop和特效相機中有許多特效的濾鏡。片元著色器時基於片元為單位執行的，完全可以實現特殊的濾鏡效果。要想實現這些濾鏡效果還需要簡單的瞭解《數字影象處理》中的影象卷積與濾波的一些

Tensorflow影象處理相關操作

#對影象的處理 import matplotlib.pyplot as plt import tensorflow as tf #讀取影象的原始資料 image_raw_data=tf.gfile.FastGFile("./path/to/picture/timg.j

【資訊科技】【2010.12】利用影象處理實現實時事故檢測系統的有效步驟

本文為印度安娜大學（作者：LOGESHVASU）的電子與通訊工程學士論文，共145頁。隨著現代CPU處理器運算能力的提高，許多複雜的實時應用已經成為可能，並在世界範圍內的各個領域得以實現。其中廣泛的實時應用之一是視訊監控系統。視訊監控系統已被用於安全監控、異

TensorFlow學習－－tensorflow影象處理--影象讀取/格式轉換1

一張RGB格式的彩色影象可以看成是一個三維矩陣，矩陣中的每一個數代表影象不同的位置上不同的顏色的亮度．但是影象儲存時並不是直接儲存這些三維矩陣，而是要先對其進行壓縮編碼再儲存．因此讀取影象的過程其實是先讀取其壓縮編碼後的結果，然後將其解碼的過程．讀取影象&轉換格

視訊、圖形影象處理之Opencv技術記錄（五）、Opencv教程之影象處理（imgproc模組）之平滑影象

目標在本教程中，您將學習如何使用OpenCV函式應用各種線性濾鏡來平滑影象，例如：理論注意下面的解釋屬於Richard Szeliski和LearningOpenCV的計算機視覺：演算法和應用一書平滑，也稱為模糊，是一種簡單且經常使用的影象處理操作。

TensorFlow學習－－tensorflow影象處理--影象讀取/格式轉換

tensorflow影象處理一張RGB格式的彩色影象可以看成是一個三維矩陣，矩陣中的每一個數代表影象不同的位置上不同的顏色的亮度．但是影象儲存時並不是直接儲存這些三維矩陣，而是要先對其進行壓縮編碼再

視訊、圖形影象處理之Opencv技術記錄（六）、均衡直方圖

目標在本教程中，您將學習：什麼是影象直方圖以及為什麼它有用理論什麼是影象直方圖？它是影象強度分佈的圖形表示。它量化了所考慮的每個強度值的畫素數。什麼是直方圖均衡？

1-2、數字影象處理基礎

數學建模題目中有時會涉及到與數字影象有關的操作。在這類題目中，往往不會涉及到太多與數字影象處理相關的專業知識，但是要求程式設計師瞭解影象儲存格式與常用基礎操作等。一、數字影象常用儲存格式。數字影象在計算機中以矩陣形式儲存，通過一個或多個數字表示每個點

第52章、Bitmap影象處理（從零開始學Android）

1、Drawable → Bitmap public static Bitmap drawableToBitmap(Drawable drawable) { Bitmap bitmap = Bitmap .createBitmap( drawable.getIntrinsicWidth(), drawabl

Python 影象處理 OpenCV （12）： Roberts 運算元、 Prewitt 運算元、 Sobel 運算元和 Laplacian 運算元邊緣檢測技術

![](https://cdn.geekdigging.com/opencv/opencv_header.png) 前文傳送門： [「Python 影象處理 OpenCV （1）：入門」](https://www.geekdigging.com/2020/05/17/5513454552/) [「Pyt

影象處理與計算機視覺基礎、經典以及最近發展

******************************************************************************************************************************************************

[Python影象處理] 九.形態學之影象開運算、閉運算、梯度運算

該系列文章是講解Python OpenCV影象處理知識，前期主要講解影象入門、OpenCV基礎用法，中期講解影象處理的各種演算法，包括影象銳化運算元、影象增強技術、影象分割等，後期結合深度學習研究影象識別、影象分類應用。希望文章對您有所幫助，如果有不足之處，還請海涵~ 同時推薦作者的

第十九節、基於傳統影象處理的目標檢測與識別(詞袋模型BOW+SVM附程式碼)

在上一節、我們已經介紹了使用HOG和SVM實現目標檢測和識別，這一節我們將介紹使用詞袋模型BOW和SVM實現目標檢測和識別。一詞袋介紹詞袋模型(Bag-Of-Word)的概念最初不是針對計算機視覺的，但計算機視覺會使用該概念的升級。詞袋最早出現在神經語言程式學(NLP)和資訊檢索(IR)領域，該模型

影象處理之影象基本變化（平移、縮放、旋轉）（Octave實現）

在模式識別及計算機視覺中，要經常進行影象的變化。例如：在識別手寫數字中，我們可能在廣泛應用中要求所有的圖片都是20*20這麼好的規格。所以，我們就需要進行縮放來達到目的。今天來總結下學到的影象的基本變換。首先我們計 (w,v) (w,v)為源影象的

膨脹、腐蝕、開、閉運算——數字影象處理中的形態學

轉自：https://blog.csdn.net/welcome_xu/article/details/6694985 膨脹、腐蝕、開、閉運算是數學形態學最基本的變換。本文主要針對二值影象的形態學膨脹：把二值影象各1畫素連線成分的邊界擴大一層（填充邊緣或0畫素內部的孔）；腐蝕：把二

形態學影象處理:開運算、閉運算、形態學梯度、頂帽、黑帽合輯

說明開運算：先腐蝕後膨脹的過程，可以用來消除小物體、在纖細點處分離物體、平滑較大物體的邊界的同時並不明顯改變其面積。閉運算：先膨脹後腐蝕的過程，能夠排除小型黑洞(黑色區域)。形態學梯度：膨脹圖與腐蝕圖之差，對二值影象進行這一操作可以將團塊（blob）的邊緣突出出來。可以用形態學

影象處理（十一）影象分割(3)泛函能量LevelSet、snake分割

一、level set相關理論基於水平集的影象分割演算法是一種進化版的Snake演算法，也是需要給定初始的輪廓曲線，然後根據泛函能量最小化，進行曲線演化。水平集的方法，用的是一種隱式函式的方法，這個演算法比較難理解，我一年前開始搞這個演算法的時候，雖然知道程式碼怎麼寫，但是它的原理推

影象處理中飽和度、色調、對比度的定義

目錄飽和度色調對比度轉自這裡影象處理(image processing)，用計算機對影象進行分析，以達到所需結果的技術。又稱影像處理。影象處理一般指數字影象處理。數字影象是指用工業相機、攝像機、掃描器等裝置經

12、TensorFlow 影象處理

一、影象編碼與解碼

二、影象大小調整

三、影象翻轉、旋轉

四、影象色彩調整

五、處理標註框(bounding_box)

六、參考資料

相關推薦