1. 程式人生 > >Object detection with AI vision is getting easier for developers

Object detection with AI vision is getting easier for developers

Computer vision is really coming of age. Apps that use AI vision are becoming more practical and commonplace thanks to a combination of innovations that have really dovetailed to expand the possibilities. And, developing new object detection applications with deep learning is easier than you might think.

GPUs!

Not long ago, deep learning was a very slow and not-so-practical endeavor. To leap from a research field to a developer’s tool, we needed more than just faster CPUs and plenty of hours for training — we needed GPUs. GPUs provided the boost that was needed with parallel computing performance. For better accuracy in less time, make sure your AI platform has access to GPU power.

Deep learning with transfer learning

GPUs provided the performance, but advances in deep learning with deep neural networks also provided a big step toward solving problems such as computer vision. However, the real key for most developers is the ability to leverage the work of others. Writing deep learning and neural networks from scratch would not work for most of us. Thanks to the frameworks that are available and the models that are reusable, deep learning can be easy to leverage. An AI novice can do image classification and object detection. Not long ago, that was not realistic. Today, you can use “transfer learning” — i.e., use an existing image recognition model and retrain it with your own dataset.

Making it accessible

Getting these frameworks, models, and GPUs to work together can be a little tricky. Developers that need to focus on delivering a specific app don’t have time to go off and build AI platforms and learn to be data scientists. This is where simplification is needed, and with a platform like IBM PowerAI Vision, developers can focus on implementing the app instead of implementing the platform. Loading a dataset with images and labeling the objects are tasks that can be done by anyone — no need to write code. Using deep learning with GPU acceleration can also be done with a few clicks of a mouse. One more click, and the model is deployed as a REST endpoint so developers can do their thing and write their app with object detection.

Object detection

When using computer vision in an app, object detection versus image classification is a key decision. For example, if you have a picture of an animal, do you just need to know whether it is a dog or “not a dog”? Or do you want to locate the dogs in the picture and perhaps count them? Looking at the whole picture and choosing a label is image classification. Of course, it can be more complex than dog/not-dog (for example, breed identification). If you really want to locate the dog (or whatever object) in the picture, then you want object detection. Training a model for image classification requires example datasets for each label. Training a model for object detection requires a dataset where each appearance of each target object in each image is identified. For example, in PowerAI Vision you would select a dog label and draw a bounding box around each dog. After your dataset has enough images with enough labeled objects, you can train a model. If your model is not accurate enough, add to your dataset and train some more. When using object detection in an app, the main difference between object detection and image classification is how you use the location and count information. An object detection app is likely to use the location to somehow highlight the object. It could also simply count how many were detected.

The counting Coke bottles code pattern

The code pattern at https://github.com/IBM/powerai-vision-object-detection uses the example of locating and counting Coca-Cola bottles in images. You can walk through the code pattern to create a REST endpoint with Coke bottle detection. PowerAI Vision takes advantage of GPUs to accelerate your deep learning tasks. It has built-in deep learning models so that you can train and deploy an object detection model without experience in deep learning or computer vision. Given an image to analyze, the REST endpoint that you created returns location information for each detected object. You can use functions like this to create a store inventory app, or take the same techniques with your own dataset for a wide variety of object detection use cases that require recognizing, locating, and counting objects in images.



As of December 2017, the above code pattern refers to a Tech Preview of PowerAI Vision, which could be installed on Power Systems or used with a trial account in the cloud. Please give it a try and take what you learn to build great cognitive apps.

相關推薦

Object detection with AI vision is getting easier for developers

Computer vision is really coming of age. Apps that use AI vision are becoming more practical and commonplace thanks to a combination of innovations that

論文閱讀筆記(六)Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

采樣 分享 最終 產生 pre 運算 減少 att 我們 作者:Shaoqing Ren, Kaiming He, Ross Girshick, and Jian SunSPPnet、Fast R-CNN等目標檢測算法已經大幅降低了目標檢測網絡的運行時間。可是盡管如此,仍然

【Faster RCNN】《Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks》

NIPS-2015 NIPS,全稱神經資訊處理系統大會(Conference and Workshop on Neural Information Processing Systems),是一個關於機器學習和計算神經科學的國際會議。該會議固定在每年的12月舉行

Single-Shot Object Detection with Enriched Semantics

整合一下能夠查到的資料,然後結合自己的理解,算是對這篇文章的一個小小的總結吧。這是CVPR2018的一篇關於小目標檢測的文章,出發點是作者認為小目標的檢測資訊隨著層數的增加而不斷地丟失了,所以想利用語義分割強化淺層的特徵資訊(這裡強化可能用得不準確,但是我是這麼理解的)。整體的網路框架分為三個部分Detect

22.Deeply Supervised Salient Object Detection with Short Connections

具有短連線的深度監督的顯著性目標檢測 摘要 顯著目標檢測的最新進展是實質性的,主要受益於卷積神經網路(CNN)的爆炸式發展。最近開發的語義分割和顯著目標檢測演算法主要基於完全卷積神經網路(FCN)。對於沒有明確處理尺度空間問題的通用FCN模型,仍有很大的改進空間。整體巢狀邊緣檢測器(HED)

論文閱讀筆記二十六:Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks(CVPR 2016)

論文源址:https://arxiv.org/abs/1506.01497 tensorflow程式碼:https://github.com/endernewton/tf-faster-rcnn 摘要       目標檢測依賴於區域proposals演算法對目標的位置進

Soft-NMS: Improving object detection with one line of code

Improving object detection with one line of code 是ICCV2017的文章,主要是優化解決目標檢測後處理中非極大值抑制(NMS,Non Maximum Suppression)的問題。 NMS: 在解析本文主治之前,先回顧下

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Abstract SPPnet和Fast R-CNN雖然減少了演算法執行時間,但region proposal仍然是限制演算法速度的瓶頸。而Faster R-CNN提出了Region Proposal Network (RPN),該網路基於卷積特徵預測每個位置是否為物體以及

【論文筆記】Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

寫在前面:      我看的paper大多為Computer Vision、Deep Learning相關的paper,現在基本也處於入門階段,一些理解可能不太正確。說到底,小女子才疏學淺,如果有錯

【筆記】Faster-R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

論文程式碼:重要:訓練檔案.prototxt說明:http://blog.csdn.net/Seven_year_Promise/article/details/60954553從RCNN到fast R

[論文學習]《Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 》

faster R-CNN的主要貢獻 提出了 region proposal network(RPN),通過該網路我們可以將提取region proposal的過程也納入到深度學習的過程之中。這樣做既增加了Accuracy,由降低了耗時。之所以說增加Accura

【翻譯】Faster-R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

摘要 目前最先進的目標檢測網路需要先用區域建議演算法推測目標位置,像SPPnet[7]和Fast R-CNN[5]這些網路已經減少了檢測網路的執行時間,這時計算區域建議就成了瓶頸問題。本文中,我們介紹一種區域建議網路(Region Proposal Network, R

Faster RCNN: Towards RealTime Object Detection with Region Proposal Networks+Visualizing and Underst

Faster RCNN是對之前的 RCNN、SPPNet、Fast RCNN 等目標檢測框架的進一步優化,將 Region Proposal 過程融合進入 CNN 模型,稱之為 RPN(Region Proposal Network),大幅降低了 test-time 計算量,

深度學習論文翻譯解析(十三):Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

論文標題:Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks   標題翻譯:基於區域提議(Region  Proposal)網路的實時目標檢測 論文作者:Shaoqing Ren, K

DeepLearning.ai作業:(4-3)-- 目標檢測(Object detection

title: ‘DeepLearning.ai作業:(4-3)-- 目標檢測(Object detection)’ id: dl-ai-4-3h tags: dl.ai homework categories: AI Deep Learning date:

DeepLearning.ai筆記:(4-3)-- 目標檢測(Object detection

title: ‘DeepLearning.ai筆記:(4-3)-- 目標檢測(Object detection)’ id: dl-ai-4-3 tags: dl.ai categories: AI Deep Learning date: 2018-10-11 1

「Computer Vision」Notes on Deep Learning for Generic Object Detection

QQ Group: 428014259 Sina Weibo:小鋒子Shawn Tencent E-mail:[email protected] http://blog.csdn.net/dgyuanshaofeng/article/details/83834249 [1]

[AI] 使用object detection訓練自己的模型用於物體識別

軟體環境 tensorflow (1.8.0) libprotoc 3.0.0/3.3.0 tensorflow models models/research/object_detection (2018/dec版本) models git clone https://

Fighting breast cancer with AI early detection Hack and Craft

Breast cancer awareness month is here and, with it, the latest statistics send a stark reminder of just how important early detection is in combating this

Setting Up Object Persistence with 6D.ai

Setting Up Object Persistence with 6D.aiHow To Make a Sample App with the 6D SDKWelcome to the 6D SDK. One of the primary features of the 6D SDK is Persist