The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization (DeepAugment)

阿新 • • 發佈：2021-12-11

概
主要內容
程式碼

Hendrycks D., Basart S., Mu N., Kadavath S., Wang F., Dorundo E., Desai R., Zhu T., Parajuli S., Guo M., Song D., Steinhardt J. Gilmer J. The many faces of robustness: a critical analysis of out-of-distribution generalization. arXiv preprint arXiv:2006.16241, 2020.

概

作者通過或採樣或人造的資料集ImageNet Renditions, DeepFashion Remixed, StreetView StoreFronts來驗證七個假設:

更大的模型能夠提高魯棒性;
self-attention能夠提高魯棒性;
diverse data augmentation 能夠提高魯棒性;
在更大更復雜的資料集上進行預訓練能夠提高魯棒性;
CNN更傾向於紋理資訊, 這會破壞魯棒性;
魯棒性主要用在IID上的測試資料的正確率所反映(即提高泛化性的最有效途徑是提高測試精度(IID上的));
人造資料所帶來魯棒性對於現實生活中j'kjk偏移沒有幫助.

主要內容

ImageNet-R

ImageNet-R包含了ImageNet中的200個類的藝術加工後的結果:

注: 原ImageNet是不包含藝術加工後的資料的.

StreetView StoreFronts (SVSF)

SVSF是從 Google StreetView imagery中取樣的資料集, 包含3種不同型別的分佈遷移: 國家, 年份和拍攝硬體(攝像機).

訓練集: 於2019年, 在美國/墨西哥/加拿大通過新式攝像系統拍攝的照片;

測試集:

	Year	Country	Camera
1	2017	US/Mexico/Canada	new
2	2018	US/Mexico/Canada	new
3	2019	France	new
4	2019	US/Mexico/Canada	old

DeepFashion Remixed

DFR包括一個訓練集和8個測試集, 測試集和訓練集的差別在於在某個屬性上有差異.

	object size	object occlusion	camera viewpoint	camera zoom
Training	medium	medium	side/back	no zoom-in
1	small	medium	side/back	no zoom-in
2	large	medium	side/back	no zoom-in
3	medium	minimal	side/back	no zoom-in
4	medium	heavy	side/back	no zoom-in
5	medium	medium	frontal	no zoom-in
6	medium	medium	not-worn	no zoom-in
7	medium	medium	side/back	medium zoom-in
8	medium	medium	side/back	large zoom-in

DeepAugment

DeepAugment算是一種特殊的augmentation, 即一個image-to-image的網路\(h(\cdot; \theta)\), 通過\(h(x; \theta + \delta)\), 網路引數上的擾動使得得到diverse的圖片, 這些擾動包括: zeroing, negating, convolving, transposing, applying activation functions ...

實驗結論

1,2,3,4四個假設對於ImageNet-C和真實的模糊圖片是有效的, 但對於DFR, SVSF中的分佈偏移卻都不奏效. Larger Models和Diverse Data Augmentation對於ImageNet-R是有效果的(後者, 即 DeepAugment + AugMix的結果非常好).

對於CNN更偏向紋理資訊, 從ImageNet-R中可以瞥見一二, 普通的CNN在ImageNet-R上的泛化性很差, 但是通過diverse data augmentation可以緩解這一問題(因為其在一定程度上打亂了紋理資訊). 但是這類假設在DFR, SVSF卻並不奏效, 這大概也說明texture bias並非是影響魯棒性的唯一因素.

對於第六點, 雖然IID上的正確的確很重要, 但是正如上表所示, 大模型, diverse的資料增強對於泛化性很大的幫助(但是對於IID收效甚微).

對於最後一點, 即人造資料的作用, 顯然人造資料的確是能夠增加泛化性的, 雖然這類方法在面對地理偏移等時效果不明顯.

程式碼

原文程式碼

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization (DeepAugment)

目錄概主要內容ImageNet-RStreetView StoreFronts (SVSF)DeepFashion RemixedDeepAugment實驗結論程式碼

論文閱讀：The Role of “Condition”: A Novel Scientific Knowledge Graph Representation and Construction Model

“條件”的作用:一種新的科學知識圖表示與構建模型 Abstract 　　條件關係在科學觀測、假設和陳述中起著重要作用，但是現有的科學知識圖譜（SicKgs）與一般領域的知識圖譜（KGs）一樣，沒有考慮事實有效的條件，僅

A note on the calculation of some functions in finite fields: Tricks of the Trade解讀

本節對該paper進行解讀，記錄筆記。經常見到的是在素域\\(F_p\\)上計算的，尤其是雙線性對出現後，在擴域\\(F_{p^m}\\)上計效率就需要優化了。該論文主要總結了一些在有限域上進行某些計算（求模逆，hash到curve的

Ant Design of Vue a-form表單效驗用法

Ant Design of Vue a-form表單效驗用法　　（這個表單基本上算是比較完整的，能完成表單回撥、拿值、效驗、v-fow等，表單基本用法了）

文獻閱讀 | Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms

Yamamoto, T., Nagasaki, H., Yonemaru, J. et al. Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms. BMC

Ant Design of Vue a-form表單效驗用法(二)

Ant Design of Vue a-form表單效驗用法(二) 　　(這裡新增上期的間隔效驗用法：v-if導致的表單不能效驗問題，moment時間的用法，表單回撥)

RN Setting onMessage on a WebView overrides existing values of window.postMessage, but a previous value was defined錯誤

在使用RN的WebView時有時會彈出： Setting onMessage on a WebView overrides existing values of window.postMessage, but a previous value was defined 錯誤的一個介面，關閉後不影響正常使用。

A Duplication Analysis Based Evolutionary Algorithm for Bi-objective Feature Selection

基於重複分析的雙目標特徵選擇進化演算法 1. 摘要2. 介紹3. 演算法3.1 演算法整體框架3.2 初始化及終止條件3.3 繁殖3.4 重複分析3.5 多樣性的維護

OoDAnalyzer: Interactive Analysis of Out-of-Distribution Samples

論文傳送門作者清華大學 Changjian ChenJun YuanHang SuShixia LiuYafeng Lu 微軟亞洲研究院

[論文解讀]A Quantitative Analysis Framework for Recurrent Neural Network

A Quantitative Analysis Framework for Recurrent Neural Network 文章目錄 A Quantitative Analysis Framework for Recurrent Neural Network簡介摘要動機THE DeepStellar FRAMEWORK抽象模型構建應用

ArrayAdapter requires the resource ID to be a TextView

錯誤程式碼： ListItem2Adapter adapter = new ListItem2Adapter(ListItem2Activity.this,R.layout.list_item_pic,data);

Only the original thread that created a view hierarchy can touch its views.

之前遇到過這個問題，當時的解決方法是再UI執行緒或主執行緒進行view相關操作，如果想要在view程序要在子執行緒之後進行，就需要阻塞主執行緒。

System.InvalidOperationException: Unable to configure HTTPS endpoint. No server certificate was specified, and the default developer certificate could not be found or is out of date.

PS E:\\C#\\core\\mvc\\mvctest> dotnet run正在生成...crit: Microsoft.AspNetCore.Server.Kestrel[0]Unable to start Kestrel.System.InvalidOperationException: Unable to configure HTTPS endpoint. No ser

關於.NET CORE 編譯時錯誤：Microsoft.AspNetCore.Razor.Design.CodeGeneration.targets(79, 5): The project XXXXX must provide a value for Configuration.

此筆記記載了本人在編譯.Net Core專案時遇到的Microsoft.AspNetCore.Razor.Design.CodeGeneration.targets(79, 5): The project XXXXX must provide a value for Configuration.的症狀、排查及解決方案

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization (DeepAugment)

概

主要內容

ImageNet-R

StreetView StoreFronts (SVSF)

DeepFashion Remixed

DeepAugment

實驗結論

程式碼

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization (DeepAugment)

論文閱讀：The Role of “Condition”: A Novel Scientific Knowledge Graph Representation and Construction Model

A note on the calculation of some functions in finite fields: Tricks of the Trade解讀

Ant Design of Vue a-form表單效驗用法

文獻閱讀 | Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms

Ant Design of Vue a-form表單效驗用法(二)

RN Setting onMessage on a WebView overrides existing values of window.postMessage, but a previous value was defined錯誤

A Duplication Analysis Based Evolutionary Algorithm for Bi-objective Feature Selection

OoDAnalyzer: Interactive Analysis of Out-of-Distribution Samples

[論文解讀]A Quantitative Analysis Framework for Recurrent Neural Network

ArrayAdapter requires the resource ID to be a TextView

Only the original thread that created a view hierarchy can touch its views.

System.InvalidOperationException: Unable to configure HTTPS endpoint. No server certificate was specified, and the default developer certificate could not be found or is out of date.

關於.NET CORE 編譯時錯誤：Microsoft.AspNetCore.Razor.Design.CodeGeneration.targets(79, 5): The project XXXXX must provide a value for Configuration.

Jenkins:the input device is not a TTY

A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks

【HttpClient】HttpRequestHeaders.From提示The specified value is not a valid 'From' header string.

A Child's History of England.54

A Child's History of England.57

A Child's History of England.58

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization (DeepAugment)

概

主要內容

ImageNet-R

StreetView StoreFronts (SVSF)

DeepFashion Remixed

DeepAugment

實驗結論

程式碼

相關推薦