Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision

阿新 • • 發佈：2018-12-10

（這篇部落格不設計文中的硬體部分）

這篇文章同樣也講述瞭如何在live video中應用motion estimation的演算法，通過應用上下幀之間的相似資訊來加速detection並且維持較高的accuracy。

文中列了一張不同的object detection演算法的效率和準確率的比較的圖。

在通常的live video的處理當中，大致的一個模組分佈如下圖所示：

其中的imge sensing部件就是用來就收從攝像頭傳來的影象流；IPS模組用來對得到的原始的攝像頭影象做一些處理；後端的部分就需要從加工過的圖片中提取出有用的資訊。

在這篇文章中，ISP處理影象得到的motion的資訊不像其他的一些演算法一樣使用過之後就簡單的丟棄，Eupharates會儲存下來進一步提升整個motion estimation的準確性。

同樣的這篇文章使用的motion estimation的演算法也是block matching。將整張image分成若干個 $L\times{L}$ 個mrcroblocks，然後衡量匹配差距的標準被定義為：Sum of Absolute Differences(SAD)。對於每個MB，搜尋的範圍就是水平和垂直方向上的 $2d+1$ 的範圍，如下圖所示。

這個演算法的時間複雜度很好計算，每個 $L\times{L}$ 的MB需要 $L^2(2d+1)^2$ 次計算；但是文中指出了另外一個更快的近似演算法叫做TSS（Three Step Search），對於每個MB只需要做 $L^2(1+8\log_2(d+1))$ 的複雜度。

最終BM演算法會為每個MB生成一個motion vector，代表MB之間的位移和其與前一幀之間最接近的block。

演算法將video的幀分為兩類：Inference frame和Extracpolation frame。前者經過完整的CNN網路，後者則是通過motion estimation來估計物體的位置。

在一個視野當中所有pixel的唯一的平均值在一定程度上能夠代表這個感受野當中物體的全域性位移，所以演算法的第一步就是對於一個給定的POI，計算畫素層的平均位移（式1），正如上文所提到的，這些位移都是MB-based的位移計算而來的。平均的位移很容易受到物體變形（肢體活動）的影響，所以需要增加除躁的步驟。

對於每個MV，計算其置信值，這個置信值的是高度依賴於SAD的，原因非常直觀啦。式2給出了置信值的計算方式，限制在了[0,1]的範圍內，最後對於每個ROI的置信值，只需要計算其所包含的MV的置信值的均值即可。

最終可以對置信度高的位移施加更大的權重，如式3所示，我們發現是應用到了之前位移的效應。

最後為了更好地解決物體形變（跑步運動員的擺手等）的問題，將每個ROI拆分成若干個sub-ROI，然後每個sub-ROI使用上述相同的方法，最後計算能夠將所有sub-ROI都涵蓋的最小的bounding box就好了。

就如何選擇I-frame提出了兩種方案，一種是簡單的constant方案，每隔一定的幀數選擇一張作為I-frame；還有一種自適應的方案，對哪個兩幀之間的差距超過一定的threshold的時候減小中間的間隔，否則增大間隔。

Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision

（這篇部落格不設計文中的硬體部分）這篇文章同樣也講述瞭如何在live video中應用motion estimation的演算法，通過應用上下幀之間的相似資訊來加速detection並且維持較高的accuracy。文中列了一張不同的object detection演算

論文學習：YodaNN1: An Architecture for Ultra-Low Power Binary-Weight CNN Acceleration

摘要：The computational effort of today’s CNNs requires power-hungry parallel processors（高耗能並行處理器） or GP-GPUs（計算圖形處理器）.Recent develo

bzoj3969 [WF2013]Low Power

lose col for esp bzoj3 alt type color target 傳送門：http://www.lydsy.com/JudgeOnline/problem.php?id=3969 【題解】二分答案x，貪心選取，如果選取了i個，有j對，那麽要滿足i&

bzoj 3969 LOW Power

iostream false -- print led cst memory cnblogs can 3969: [WF2013]Low Power Time Limit: 20 Sec Memory Limit: 256 MB 題目連接 http://www.lydsy

[CortexM0--stm32f0308]Low Power Mode

資料中斷 npe 計時 epo 出場操作一個 parent 問題描寫敘述 stm32f0308正常是運行在Run mode下。這樣的mode是在reset之後的默認模式。Low Power Mode。即低功耗模式。用於在IC空暇時能夠考慮選擇進入

Database Design for Sexbale Forum

color alt spa 分享 blog ans -a con mic Mars March 17, 2015 Database Design for Sexbale Forum

系統設計Design For Failure思想

前端結束領導 rect radi with 企業信息化 lex business 系統設計Design For Failure思想 Complex systems fail in spectacular ways. Failure isn’t a questi

【論文閱讀】Learning Dual Convolutional Neural Networks for Low-Level Vision

論文閱讀（【CVPR2018】Jinshan Pan - Learning Dual Convolutional Neural Networks for Low-Level Vision）本文針對低層視覺問題，提出了一般性的用於解決低層視覺問題的對偶卷積神經網路。作者認為，低層視覺問題，如常見的有

Design for social innovation

There is no doubt that we live in interesting times. Facing unprecedented challenges and previously unimaginable opportunities. The big question is whether

Case Study: How Research Simplified My UX Design for Healthcare App

How Research Simplified UX Design for Healthcare AppRegardless of your vision of the product and your confidence in its greatness, the future of it depends

Android Developers Blog: Introducing Oboe: A C++ library for low latency audio

Posted by Don Turner, Developer Advocate, Android Audio Framework This week we released the first production-ready version of Oboe - a C++ library f

Tier3D phone and watch have radical new design for Artificial Intelligence

Srini Srinivasan a founder of Tier3D (and veteran Silicon Valley executive & investor/advisor with companies like WebEx, Skype Qik, and Tibco), elabora

Useful hints to build a perfect design for iPhone Xs

Apple presents new gadgets every year, and each of this device deserves the attention. But when iPhone X was presented to the public, rules of app designin

Graphic Design For Screen Based Communication

Usability of Web DesignFirst of all, let’s talk about what usability is in terms of web design. Web usability is the ease of use of a website. In order to

Advanced Design for Artificial Intelligence

Jane WangResearch Scientist, Google DeepMindJane Wang started out as an applied physicist modeling the complex network dynamics of memory systems in the br

Ask HN: Free or cheap alternative for low volume log management and searching?

Which open source software or SaaS service in spirit of timber.io, logentries, papertrail etc. would you recommend for searching JSON-based logs generated

Ask HN: Low power programmable device?

I'm looking for a device which can support a REPL for some reasonably pleasant programming language and has a very long battery life to solve project Euler

Ask HN: How to assure a high software quality in low power embedded platforms?

What are the best practices for ensuring a high quality of embedded software in environments like ARM Cortex-M, PIC32 etc.? Manual testing? Unit testing in

Artificial intelligence helps track down mysterious cosmic radio bursts: Machine learning algorithm also helps search for new ki

Researchers at Breakthrough Listen, a SETI project led by the University of California, Berkeley, have now used machine learning to discover 72 new fast r

Using AWS IoT Core in a Low-Power Application

At AWS, we work closely with customers to assist them in building various types of IoT solutions. We often hear from customers about the need to m

Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision

相關推薦