【NeurIPS2022】Cross Aggregation Transformer for Image Restoration

阿新 • • 發佈：2022-12-08

研究動機：當前方法 Transformer 方法把影象分成8x8的小塊處理，the square window lacks inter-window interaction, leading to the slow increase of the receptive field。同時，the channel-wise attention mechanism may lose some spatial information。影響了 Transformer 方法在影象修復裡的應用。

為此，作者提出了 Cross Aggregation Transformer，架構如下圖所示，主幹網路為RCAN（超解析度中用的非常多的網路），中間是多個 CAT block 的堆疊。CAT block 的核心是作者提出的注意力機制：Rectangle-Window Self-Attention（Rwin-SA）。

1、 Rectangle-Window Self-Attention

Rwin-SA如下圖所示，使用的是矩形的視窗，而不是正方形的視窗。視窗的寬度和高度分別為 sw 和 sh。此外，還使用 axis-shift 實現視窗間資訊的互動。

2、Locality Complementary Module

作者在計算注意力時，添加了一個獨立的卷積運算，稱為 Locality complementary module，如下圖所示，其實就是在V上加了一個卷積，attention 的結果和卷積融合。

【NeurIPS2022】Cross Aggregation Transformer for Image Restoration

1、 Rectangle-Window Self-Attention

2、Locality Complementary Module

【NeurIPS2022】Cross Aggregation Transformer for Image Restoration

【CVPR2022】Restormer: Efficient Transformer for High-Resolution Image Restoration

【Leetcode】 two sum #1 for rust solution

【NeurIPS】ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

論文解讀（CDTrans）《CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation》

ECCV2020論文-稀疏性表示-Neural Sparse Representation for Image Restoration翻譯

【ARXIV2104】Attention in Attention Network for Image Super-Resolution

【CVPR2021】Contrastive Learning for Compact Single Image Dehazing

【TIP2021】A Progressive Coupled Network for Real-Time Image Deraining

【CVPR 2022】論文閱讀：MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

【DMCP】2020-CVPR-DMCP Differentiable Markov Channel Pruning for Neural Networks-論文閱讀

【經驗】GaussDB(for MySQL)效能優化 —— 日誌的“快遞驛站”

【CodeForces576D】Flights for Regular Customers

【CodeForces219D】Choosing Capital for Treeland

Battle for Wosneth2【概率】-2020百度之星複賽

【Azure Redis 快取 Azure Cache For Redis】在建立高階層Redis(P1)整合虛擬網路(VNET)後，如何測試VNET中資源如何成功訪問及配置白名單的效果

【論文筆記（5）ECCV2020】Graph convolutional networks for learning with few clean and many noisy labels

【nlp論文筆記】 Glyce: Glyph-vectors for Chinese Character Representations

Educational Codeforces Round 97 (Rated for Div. 2)【ABCD】

【Golang】for case 迴圈使用者選擇

【NeurIPS2022】Cross Aggregation Transformer for Image Restoration

1、 Rectangle-Window Self-Attention

2、Locality Complementary Module

相關推薦