watershed演算法和影象分割

阿新 • • 發佈：2018-11-16

影象分割

學習opencv是為了工程應用，只學習不應用，等於白學習。下面分析一個影象分割的例子，以加強學習。

目標

學習使用cv::filter2D執行一些laplacian濾波來銳化影象
學習使用cv::distanceTransform來獲得二進位制影象的匯出表示，其中每個畫素的值被替換為最近的背景畫素的距離
學習使用cv::watershed從背景中隔離物體

程式碼

#include <opencv2/opencv.hpp>
#include <iostream>
using namespace std;
using 
 namespace cv;
int main(int, char** argv)
{
    // Load the image
    Mat src = imread(argv[1]);
    // Check if everything was fine
    if (!src.data)
        return -1;
    // Show source image
    imshow("Source Image", src);
    // Change the background from white to black, since that will help later to extract 

    // better results during the use of Distance Transform
    for( int x = 0; x < src.rows; x++ ) {
      for( int y = 0; y < src.cols; y++ ) {
          if ( src.at<Vec3b>(x, y) == Vec3b(255,255,255) ) {
            src.at<Vec3b>(x, y)[0] = 0;
            src.at<Vec3b>(x, y)[1] = 0 
;
            src.at<Vec3b>(x, y)[2] = 0;
          }
        }
    }
    // Show output image
    imshow("Black Background Image", src);
    // Create a kernel that we will use for accuting/sharpening our image
    Mat kernel = (Mat_<float>(3,3) <<
            1,  1, 1,
            1, -8, 1,
            1,  1, 1); // an approximation of second derivative, a quite strong kernel
    // do the laplacian filtering as it is
    // well, we need to convert everything in something more deeper then CV_8U
    // because the kernel has some negative values,
    // and we can expect in general to have a Laplacian image with negative values
    // BUT a 8bits unsigned int (the one we are working with) can contain values from 0 to 255
    // so the possible negative number will be truncated
    Mat imgLaplacian;
    Mat sharp = src; // copy source image to another temporary one
    filter2D(sharp, imgLaplacian, CV_32F, kernel);
    src.convertTo(sharp, CV_32F);
    Mat imgResult = sharp - imgLaplacian;
    // convert back to 8bits gray scale
    imgResult.convertTo(imgResult, CV_8UC3);
    imgLaplacian.convertTo(imgLaplacian, CV_8UC3);
    // imshow( "Laplace Filtered Image", imgLaplacian );
    imshow( "New Sharped Image", imgResult );
    src = imgResult; // copy back
    // Create binary image from source image
    Mat bw;
    cvtColor(src, bw, CV_BGR2GRAY);
    threshold(bw, bw, 40, 255, CV_THRESH_BINARY | CV_THRESH_OTSU);
    imshow("Binary Image", bw);
    // Perform the distance transform algorithm
    Mat dist;
    distanceTransform(bw, dist, CV_DIST_L2, 3);
    // Normalize the distance image for range = {0.0, 1.0}
    // so we can visualize and threshold it
    normalize(dist, dist, 0, 1., NORM_MINMAX);
    imshow("Distance Transform Image", dist);
    // Threshold to obtain the peaks
    // This will be the markers for the foreground objects
    threshold(dist, dist, .4, 1., CV_THRESH_BINARY);
    // Dilate a bit the dist image
    Mat kernel1 = Mat::ones(3, 3, CV_8UC1);
    dilate(dist, dist, kernel1);
    imshow("Peaks", dist);
    // Create the CV_8U version of the distance image
    // It is needed for findContours()
    Mat dist_8u;
    dist.convertTo(dist_8u, CV_8U);
    // Find total markers
    vector<vector<Point> > contours;
    findContours(dist_8u, contours, CV_RETR_EXTERNAL, CV_CHAIN_APPROX_SIMPLE);
    // Create the marker image for the watershed algorithm
    Mat markers = Mat::zeros(dist.size(), CV_32SC1);
    // Draw the foreground markers
    for (size_t i = 0; i < contours.size(); i++)
        drawContours(markers, contours, static_cast<int>(i), Scalar::all(static_cast<int>(i)+1), -1);
    // Draw the background marker
    circle(markers, Point(5,5), 3, CV_RGB(255,255,255), -1);
    imshow("Markers", markers*10000);
    // Perform the watershed algorithm
    watershed(src, markers);
    Mat mark = Mat::zeros(markers.size(), CV_8UC1);
    markers.convertTo(mark, CV_8UC1);
    bitwise_not(mark, mark);
//    imshow("Markers_v2", mark); // uncomment this if you want to see how the mark
                                  // image looks like at that point
    // Generate random colors
    vector<Vec3b> colors;
    for (size_t i = 0; i < contours.size(); i++)
    {
        int b = theRNG().uniform(0, 255);
        int g = theRNG().uniform(0, 255);
        int r = theRNG().uniform(0, 255);
        colors.push_back(Vec3b((uchar)b, (uchar)g, (uchar)r));
    }
    // Create the result image
    Mat dst = Mat::zeros(markers.size(), CV_8UC3);
    // Fill labeled objects with random colors
    for (int i = 0; i < markers.rows; i++)
    {
        for (int j = 0; j < markers.cols; j++)
        {
            int index = markers.at<int>(i,j);
            if (index > 0 && index <= static_cast<int>(contours.size()))
                dst.at<Vec3b>(i,j) = colors[index-1];
            else
                dst.at<Vec3b>(i,j) = Vec3b(0,0,0);
        }
    }
    // Visualize the final image
    imshow("Final Result", dst);
    waitKey(0);
    return 0;

程式碼說明

通過檔案載入影象，並檢查顯示。

    // Load the image
    Mat src = imread(argv[1]);
    // Check if everything was fine
    if (!src.data)
        return -1;
    // Show source image
    imshow("Source Image", src);

2.如果影象背景是白色的，最好轉化成黑色的，在距離變換時這將有助於前景區分物件。（這個操作很生硬，因為很多時候影象都不是純色）

    // Change the background from white to black, since that will help later to extract
    // better results during the use of Distance Transform
    for( int x = 0; x < src.rows; x++ ) {
      for( int y = 0; y < src.cols; y++ ) {
          if ( src.at<Vec3b>(x, y) == Vec3b(255,255,255) ) {
            src.at<Vec3b>(x, y)[0] = 0;
            src.at<Vec3b>(x, y)[1] = 0;
            src.at<Vec3b>(x, y)[2] = 0;
          }
        }
    }
    // Show output image
    imshow("Black Background Image", src);

3.接下來銳化影象來強化前景物體的邊緣。通過使用laplacian濾波。

    // Create a kernel that we will use for accuting/sharpening our image
    Mat kernel = (Mat_<float>(3,3) <<
            1,  1, 1,
            1, -8, 1,
            1,  1, 1); // an approximation of second derivative, a quite strong kernel
    // do the laplacian filtering as it is
    // well, we need to convert everything in something more deeper then CV_8U
    // because the kernel has some negative values,
    // and we can expect in general to have a Laplacian image with negative values
    // BUT a 8bits unsigned int (the one we are working with) can contain values from 0 to 255
    // so the possible negative number will be truncated
    Mat imgLaplacian;
    Mat sharp = src; // copy source image to another temporary one
    filter2D(sharp, imgLaplacian, CV_32F, kernel);
    src.convertTo(sharp, CV_32F);
    Mat imgResult = sharp - imgLaplacian;
    // convert back to 8bits gray scale
    imgResult.convertTo(imgResult, CV_8UC3);
    imgLaplacian.convertTo(imgLaplacian, CV_8UC3);
    // imshow( "Laplace Filtered Image", imgLaplacian );
    imshow( "New Sharped Image", imgResult );

4.轉成灰度影象和二值化。

    // Create binary image from source image
    Mat bw;
    cvtColor(src, bw, CV_BGR2GRAY);
    threshold(bw, bw, 40, 255, CV_THRESH_BINARY | CV_THRESH_OTSU);
    imshow("Binary Image", bw);

5.應用Distance Tranform於二值化的影象。另外，我們通過normalize處理影象。

    // Perform the distance transform algorithm
    Mat dist;
    distanceTransform(bw, dist, CV_DIST_L2, 3);
    // Normalize the distance image for range = {0.0, 1.0}
    // so we can visualize and threshold it
    normalize(dist, dist, 0, 1., NORM_MINMAX);
    imshow("Distance Transform Image", dist);

6.二值化影象然後執行腐蝕操作。

    // Threshold to obtain the peaks
    // This will be the markers for the foreground objects
    threshold(dist, dist, .4, 1., CV_THRESH_BINARY);
    // Dilate a bit the dist image
    Mat kernel1 = Mat::ones(3, 3, CV_8UC1);
    dilate(dist, dist, kernel1);
    imshow("Peaks", dist);

7.從每一個小塊上建立標記給watershed 演算法

    // Create the CV_8U version of the distance image
    // It is needed for findContours()
    Mat dist_8u;
    dist.convertTo(dist_8u, CV_8U);
    // Find total markers
    vector<vector<Point> > contours;
    findContours(dist_8u, contours, CV_RETR_EXTERNAL, CV_CHAIN_APPROX_SIMPLE);
    // Create the marker image for the watershed algorithm
    Mat markers = Mat::zeros(dist.size(), CV_32SC1);
    // Draw the foreground markers
    for (size_t i = 0; i < contours.size(); i++)
        drawContours(markers, contours, static_cast<int>(i), Scalar::all(static_cast<int>(i)+1), -1);
    // Draw the background marker
    circle(markers, Point(5,5), 3, CV_RGB(255,255,255), -1);
    imshow("Markers", markers*10000);

8.最後，我們使用watershed演算法，並且視覺化它。

    // Perform the watershed algorithm
    watershed(src, markers);
    Mat mark = Mat::zeros(markers.size(), CV_8UC1);
    markers.convertTo(mark, CV_8UC1);
    bitwise_not(mark, mark);
//    imshow("Markers_v2", mark); // uncomment this if you want to see how the mark
                                  // image looks like at that point
    // Generate random colors
    vector<Vec3b> colors;
    for (size_t i = 0; i < contours.size(); i++)
    {
        int b = theRNG().uniform(0, 255);
        int g = theRNG().uniform(0, 255);
        int r = theRNG().uniform(0, 255);
        colors.push_back(Vec3b((uchar)b, (uchar)g, (uchar)r));
    }
    // Create the result image
    Mat dst = Mat::zeros(markers.size(), CV_8UC3);
    // Fill labeled objects with random colors
    for (int i = 0; i < markers.rows; i++)
    {
        for (int j = 0; j < markers.cols; j++)
        {
            int index = markers.at<int>(i,j);
            if (index > 0 && index <= static_cast<int>(contours.size()))
                dst.at<Vec3b>(i,j) = colors[index-1];
            else
                dst.at<Vec3b>(i,j) = Vec3b(0,0,0);
        }
    }
    // Visualize the final image
    imshow("Final Result", dst);

watershed演算法和影象分割

影象分割學習opencv是為了工程應用，只學習不應用，等於白學習。下面分析一個影象分割的例子，以加強學習。目標學習使用cv::filter2D執行一些laplacian濾波來銳化影象學習使用cv::distanceTransform來獲得二進位制影象的匯出表

Opencv影象處理---基於距離變換和分水嶺演算法的影象分割

程式碼 #include <opencv2/opencv.hpp> #include <iostream> using namespace std; using namespace cv; int main(int, char** argv) {

Opencv 分水嶺演算法用於影象分割

目標 • 使用分水嶺演算法基於掩模的影象分割 • 學習函式： cv2.watershed() 原理任何一幅灰度影象都可以被看成拓撲平面，灰度值高的區域可以被看成是山峰，灰度值低的區域可以被看成是山谷。我們向每一個山谷中灌不同顏色的水，隨著水的位的升

opencv-用分水嶺演算法進行影象分割

參考： 3、https://opencv-python-tutroals.readthedocs.io/en/latest/py_tutorials/py_tutorials.html 4、https://github.com/makelove/OpenCV-Pytho

5.5用分水嶺演算法實現影象分割

<img src="https://img-blog.csdn.net/20160409220852681?watermark/2/text/aHR0cDovL2Jsb2cuY3Nkbi5uZXQv/font/5a6L5L2T/fontsize/400/fill/I0

區域增長演算法實現影象分割（網路）

下面程式碼採用堆疊的方式實現了給定種子點的區域生長，該方法步驟如下：（1）為輸出影象申請緩衝區，並初始化白色；（2）將種子點入棧，並將輸出影象對應位置編輯黑色；（3）從棧中彈出一個畫素點（該畫素點已在輸出緩衝區標記過），考察該畫素點的8鄰域中有畫素與種子點灰度差小於給定的閥值T，且該點在輸出

基於邊緣的影象分割——分水嶺演算法（watershed）演算法分析（附opencv原始碼分析）

最近需要做一個影象分割的程式，查了opencv的原始碼，發現opencv裡實現的影象分割一共有兩個方法，watershed和mean-shift演算法。這兩個演算法的具體實現都在segmentation.cpp檔案內。 watershed（分水嶺演算法）方法是一種基於邊界點

Opencv分水嶺演算法——watershed自動影象分割用法

分水嶺演算法是一種影象區域分割法，在分割的過程中，它會把跟臨近畫素間的相似性作為重要的參考依據，從而將在空間位置上相近並且灰度值相近的畫素點互相連線起來構成一個封閉的輪廓，封閉性是分水嶺演算法的一個重要特徵。其他影象分割方法，如閾值，邊緣檢測等都不會考慮畫素在空間關

【譯】DeepLab V2：基於深度卷積網、孔洞演算法和全連線CRFs的語義影象分割

【譯】DeepLab:基於深度卷積網、孔洞演算法和全連線CRFs的語義影象分割 Author: Liang-Chieh Chen 摘要在這項工作中有三個主要貢獻具有實質的實用價值: 第一，使用上取樣濾波器進行卷積，或者將“多孔 convolut

南開大學提出最新邊緣檢測與影象分割演算法，精度重新整理記錄（附開源地址）

作者 | 劉雲、程明明、胡曉偉、邊佳旺等譯者 | 劉暢整理 | Jane 出品 | AI科技大本營近日，南開大學媒體計算實驗室提出的最新邊緣檢測和影象過分割（可用於生成超畫素）被 IEEE PAMI 錄用。研究的第一作者也發微博稱：“這是第一個

影象分割演算法

影象分割的主要演算法： 1.基於閾值的分割方法 2.基於邊緣的分割方法 3.基於區域的分割方法 4.基於聚類分析的影象分割方法 5.基於小波變換的分割方法 6.基於數學形態學的分割方法 7.基於人工神經網路的分割方法 8.基於遺傳學演算法的分割方法

【計算機視覺必讀乾貨】影象分類、定位、檢測，語義分割和例項分割方法梳理

文章來源：新智元作者：張皓【導讀】本文作者來自南京大學計算機系機器學習與資料探勘所（LAMDA），本文直觀系統地梳理了深度學習在計算機視覺領域四大基本任務中的應用，包括影象分類、定位、檢測、語義分割和例項分割。本文旨在介紹深度學習在計算機視覺領域四大基本任務中的應用，包括分類(圖

利用k-means演算法對灰度影象分割

本文主要利用k-means來對灰度影象進行分割。首先對k-means進行簡單的介紹，然後直接上程式碼。那麼什麼是k-means演算法？K-means演算法是硬聚類演算法，是典型的基於原型的目標函式聚類方法的代表，它是資料點到原型的某種距離作為優化的目標函式，利用函式求極值的方法得到迭代運算的調整規則

深度學習 --- CNN的變體在影象分類、影象檢測、目標跟蹤、語義分割和例項分割的簡介（附論文連結）

以上就是卷積神經網路的最基礎的知識了，下面我們一起來看看CNN都是用在何處並且如何使用，以及使用原理，本人還沒深入研究他們，等把基礎知識總結完以後開始深入研究這幾個方面，然後整理在寫成部落格，最近的安排是後面把自然語言處理總結一下，強化學習的總結就先往後推一下。再往後是系統的學習一下演算法和資料

影象分割經典演算法--《泛洪演算法》（Flood Fill）

1.演算法介紹泛洪演算法——Flood Fill，（也稱為種子填充——Seed Fill）是一種演算法，用於確定連線到多維陣列中給定節點的區域。它被用在油漆程式的“桶”填充工具中，用於填充具有不同顏色的連線的，顏色相似的區域，並且在諸如圍棋（Go）和掃雷（M

影象分割經典演算法--《圖割》（Graph Cut、Grab Cut-----python實現）

1. 演算法介紹 Graph Cut（圖形切割）應用於計算機視覺領域用來有效的解決各種低階計算機視覺問題，例如影象平滑（image smoothing）、立體應對問題（stereo correspondence problem）、影象分割（image segme

OpenCv學習筆記4--影象分割之GrabCut演算法

說明: 本文章是opencv學習筆記系列的第四篇小結,可能前幾篇內容太多,排版也不甚合理,所以為了更好的觀看體驗,這次的內容會稍微少那麼一點點,再次重申歡迎star,不定時更新... 所謂影象分割指的是根據灰度、顏色、紋理和形狀等特徵把影象劃分成若干互不交迭的區域

一文詳解計算機視覺五大技術：影象分類、物件檢測、目標跟蹤、語義分割和例項分割

【導讀】目前，計算機視覺是深度學習領域最熱門的研究領域之一。計算機視覺實際上是一個跨領域的交叉學科，包括電腦科學（圖形、演算法、理論、系統、體系結構），數學（資訊檢索、機器學習），工程學（機器人、語音、自然語言處理、影象處理），物理學（光學），生物學（神經科學）和心理學（認知科學）等等。許

數字影象處理筆記（十二）：影象分割演算法

1 - 引言在影象識別中，如果可以將影象感興趣的物體或區別分割出來，無疑可以增加我們影象識別的準確率，傳統的數字影象處理中的分割方法多數基於灰度值的兩個基本性質不連續性以灰度突變為基礎分割一副影象，比如影象的邊緣相似性根據一組預定義的準則將一副影象分割為相似的

光照不均勻影象分割技巧2——頂帽變換和底帽變換

本文章由wikiwen撰寫，轉載請註明出處。文章連結：http://blog.csdn.net/kk55guang2/article/details/78490069 前言上篇文章介紹了通過分塊閾值的技巧解決光照不均勻影象分割出錯的問題，像大多數問題一樣，解決思路是多種

watershed演算法和影象分割

影象分割

目標

程式碼

程式碼說明

相關推薦