On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

阿新 • • 發佈：2022-05-06

和用LSTM的方法對比，

和transform相比主要區別在於編碼器上，由3部分構成：

1、Shallow CNN，用於控制計算量

2、Adaptive 2D positional encoding

論文中說Transformer的Position Encoding模組可能在視覺作用中起不了作用，但是位置資訊又很重要，尤其是論文致力於解決任意形狀的文字識別問題，作者對位置編碼進行了可學習的自適應，目的是

E是影象卷積特徵，g是池化操作，然後經過線性層分別得到alpha和beta，再分別針對影象的h,w得到編碼資訊（按照Transformer位置編碼方式）。

識別出的α和β直接影響高度和寬度位置編碼，以控制水平軸和垂直軸之間的相對比率，以表達空間分集。通過學習從輸入推斷出α和β，A2DPE允許模型沿高度和寬度方向調整長度元素。

We visualize random input images from three groups with different predicted aspect ratios, as a by-product of A2DPE. Figure 7 shows the examples according to the ratios α/β. Low aspect ratio group, as expected, contains mostly horizontal samples, and high aspect ratio group contains mostly vertical samples. By dynamically adjusting the grid spacing, A2DPE reduces the representation burden for the other modules, leading to performance boost.

3、Locality-aware feedforward layer

For good STR performance, a model should not only utilize long-range dependencies but also local vicinity around single characters.

作者認為transformer的自監督長在長距離的關係處理，local關係處理的並不夠好，所以在feedforward位置作者做了從a到c的替換，提升相近特徵間的互動。

512-d的不同step的特徵利用卷積進行特徵互動，屬於transformer對cv區域性特徵的一種融合，感覺應該有一定作用。

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

和用LSTM的方法對比，和transform相比主要區別在於編碼器上，由3部分構成： 1、Shallow CNN，用於控制計算量

【論文閱讀】Effects of Emotional Music on Facial Emotion Recognition in Children with Autism Spectrum Disorder (ASD)

1.這篇文章究竟講了什麼問題？研究情感一致(congrunent)音樂對患有自閉症兒童的面部情感識別能力的影響

Jmeter報錯“Failed to write core dump. Minidumps are not enabled by default on client versions of Windows”

最近在新電腦上安裝jmeter，開啟無報錯，但一執行測試用例就閃退，報錯“Failed to write core dump. Minidumps are not enabled by default on client versions of Windows”

[CSS] Create Complex Shapes with CSS Clip Path and Border Radiusc (border-radius & clip-path)

In this lesson, we explore creating the Egghead Shell with CSS. We explore how different properties allow us to create different shapes and how we can use our developer tools to adjust and tweak style

go:index out of range [0] with length 0與non-constant array bound length

有一段程式碼，涉及陣列和指標： 1 //通過整形指標陣列獲取陣列中的元素 2 func test(){

【leetcode】1524. Number of Sub-arrays With Odd Sum

題目如下： Given an array of integersarr. Returnthe number of sub-arrayswithoddsum. As the answer may grow large, the answermust becomputed modulo10^9 + 7.

ORA-12012: error on auto execute of job 25；ORA-12005: may not schedule automatic refresh for times in the past

　　使用BethuneX做巡檢，連續報如下錯誤： --錯誤 Thu Oct 29 14:36:04 2020 Errors in file /u01/app/oracle/diag/rdbms/mtws/mtws/trace/mtws_j000_33913.trc:

python pandas Dataframe增加一列遇到A value is trying to be set on a copy of a slice from a DataFrame.

技術標籤：pythonpython大資料pandasDataframe df2是Dataframe資料，直接在其上面增加一列，使用如下程式碼：

【悟空雲課堂】第十九期：用不安全的授權建立臨時檔案漏洞（CWE-378: Creation of Temporary File With Insecure Permissions）

技術標籤：悟空雲課堂程式碼規範安全安全漏洞資訊保安java 關注公眾號“中科天齊軟體安全中心”（id：woocoom），一起漲知識！

org.dom4j.DocumentException: Error on line 41 of document : 元素型別 “SPBMJC“ 必須由匹配的結束標記 “＜/SPBMJC＞“ 終止

技術標籤：報錯 xml檔案解析的時候報錯 org.dom4j.DocumentException: Error on line 41 of document: 元素型別 "SPBMJC" 必須由匹配的結束標記 "</SPBMJC>" 終止。 Nested exception

[LeetCode] 1155. Number of Dice Rolls With Target Sum 擲骰子的N種方法

You haveddice and each die hasffaces numbered1, 2, ..., f. Return the number of possible ways (out offdtotal ways)modulo109+ 7 to roll the dice so the sum of the face-up numbers equalstarget.

ORA-12012: error on auto execute of job "SYS"."ORA$AT_OS_OPT_SY_363"

環境: OS:Centos 7 DB:18.3.0.0 問題:Errors in file /u01/oracle/app/diag/rdbms/slnngk/slnngk1/trace/slnngk1_j000_949.trc:ORA-12012: error on auto execute of job \"SYS\".\"ORA$AT_OS_OPT_SY_363\"ORA-200

ON THE ROLE OF PLANNING IN MODEL-BASED DEEP REINFORCEMENT LEARNING

發表時間：2021（ICLR 2021）文章要點：這篇文章想要分析model-based reinforcement learning (MBRL)裡面各個部分的作用。文章以muzero為基礎，回答了三個問題

The balance sheet of KriBank starts with an allowance for loan losses of $2.66 million. During the year, KriBank writes-off worthless loans amounting to $1.68 million, reco

The balance sheet of KriBank starts with an allowance for loan losses of $2.66 million. During the year, KriBank writes-off worthless loans amounting to $1.68 million, recovers $0.44 million on loans

Identify three possible adverse effects on an entity’s financial statements arising from recognition of a lease arrangement on the statement of financial position.

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

1、Shallow CNN，用於控制計算量

2、Adaptive 2D positional encoding

3、Locality-aware feedforward layer

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

【論文閱讀】Effects of Emotional Music on Facial Emotion Recognition in Children with Autism Spectrum Disorder (ASD)

Jmeter報錯“Failed to write core dump. Minidumps are not enabled by default on client versions of Windows”

[CSS] Create Complex Shapes with CSS Clip Path and Border Radiusc (border-radius & clip-path)

go:index out of range [0] with length 0與non-constant array bound length

【leetcode】1524. Number of Sub-arrays With Odd Sum

ORA-12012: error on auto execute of job 25；ORA-12005: may not schedule automatic refresh for times in the past

python pandas Dataframe增加一列遇到A value is trying to be set on a copy of a slice from a DataFrame.

【悟空雲課堂】第十九期：用不安全的授權建立臨時檔案漏洞（CWE-378: Creation of Temporary File With Insecure Permissions）

org.dom4j.DocumentException: Error on line 41 of document : 元素型別 “SPBMJC“ 必須由匹配的結束標記 “＜/SPBMJC＞“ 終止

[LeetCode] 1155. Number of Dice Rolls With Target Sum 擲骰子的N種方法

ORA-12012: error on auto execute of job "SYS"."ORA$AT_OS_OPT_SY_363"

ON THE ROLE OF PLANNING IN MODEL-BASED DEEP REINFORCEMENT LEARNING

The balance sheet of KriBank starts with an allowance for loan losses of $2.66 million. During the year, KriBank writes-off worthless loans amounting to $1.68 million, reco

Identify three possible adverse effects on an entity’s financial statements arising from recognition of a lease arrangement on the statement of financial position.

[LeetCode] 1292. Maximum Side Length of a Square with Sum Less than or Equal to Threshold 元素和小於等於閾值的正方形的最大邊長

org.dom4j.DocumentException: Error on line 1 of document : 前言中不允許有內容。

A note on the calculation of some functions in finite fields: Tricks of the Trade解讀

LeetCode 1524 Number of Sub-arrays With Odd Sum 思維

Configuration of your jobs with .gitlab-ci.yml

On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention

1、Shallow CNN，用於控制計算量

2、Adaptive 2D positional encoding

3、Locality-aware feedforward layer

相關推薦