<cleverhans 對抗樣本防護編譯與測試（含 FGSM 攻擊與 ADV 防護）>

阿新 • • 發佈：2020-08-12

(4 條訊息)Python3 環境下 cleverhans 對抗樣本防護編譯與測試（含 FGSM 攻擊與 ADV 防護）_大資料探勘 SparkExpert 的部落格 - CSDN 部落格_cleverhans

在看人工智慧安全方面的資料，順手看到 cleverhans 的資料，就將它在 python 3.6 的環境下進行編譯和測試。

在Ian Goodfellow的《Machine learning privacy and security》報告中才瞭解到cleverhans專案名字的由來：“一匹叫做 Clever Hans 的馬。剛出現的時候人們認為這匹馬會做算術，但實際上它只是會閱讀人的表情，當它點馬蹄的次數接近正確答案時，人們的表情會更興奮，它就知道該這個時候停止了。”

這個專案是 tensorflow 的子專案（https://github.com/tensorflow/cleverhans），原始的程式碼版本是 PYTHON 2.7 環境，於程式碼下載後進行了重構和 3.6 版本的編譯。發現這個程式碼的工作量挺多的。下面就重點關注的幾塊進行測試。

（1）FGSM 的影象擾動攻擊

FGSM，是 Goodfellow 等人提出的比較典型的對抗樣本生成演算法。

它的資料生成方式如下（由於

很小，因此 x'和 x 的數值相差不大，因此人眼一般不會感知到明顯區別, 但是對於 CNN 模型來說，識別的錯誤還是發生了。）：

, 具體的程式碼可見 https://github.com/tensorflow/cleverhans/blob/master/cleverhans/attacks_tf.py 相關的函式。

前後

此外，這個網址上提供了許多 FGSM 的例子。（見 https://www.kaggle.com/benhamner/fgsm-attack-example/code）

將生成後的 FGSM 擾動資料送到影象識別模型中如程式碼中給出的 inceptionv3 中，可以看到影象的識別結果全部變亂了。

A：下圖為原始的圖片識別結果

B：下圖為 FGSM 擾動後的的圖片識別結果，可以看出識別分類結果相差特別的大。

（2）FGSM 攻擊的防護（NIPS2017 論文相關程式碼）

在找防護的過程中，才發現 cleverhans 整合的程式碼居然也是 tensorflow models 中的相關程式碼，見 https://github.com/tensorflow/models/tree/master/research/adv_imagenet_models。本質上而言，它需要在擾動的圖片上進行訓練，從而才能實現對擾動的程式碼進行準確識別。如論文原文中指出的貢獻如下：

實際程式碼中，cleverhans 提供了兩種對抗訓練，一種是基於 inceptionv3 的，一種是 inception-resnet-v2 的增強版。測試結果如下，則擾動後的圖片，也能被正確識別。

（3）一些其他的例子，cleverhans 程式碼庫提供了多樣性的對抗樣本生成方法，具體如下：

sample_attacks/- directory with examples of attacks:
- sample_attacks/fgsm/- Fast gradient sign attack.
- sample_attacks/noop/- No-op attack, which just copied images unchanged.
- sample_attacks/random_noise/- Attack which adds random noise to images.
sample_targeted_attacks/- directory with examples of targeted attacks:
- sample_targeted_attacks/step_target_class/- one step towards target class attack. This is not particularly good targeted attack, but it demonstrates how targeted attack could be written.
- sample_targeted_attacks/iter_target_class/- iterative target class attack. This is a pretty good white-box attack, but it does not do well in black box setting.
sample_defenses/- directory with examples of defenses:
- sample_defenses/base_inception_model/- baseline inception classifier, which actually does not provide any defense against adversarial examples.
- sample_defenses/adv_inception_v3/- adversarially trained Inception v3 model fromAdversarial Machine Learning at Scalepaper.
- sample_defenses/ens_adv_inception_resnet_v2/- Inception ResNet v2 model which is adversarially trained against an ensemble of different kind of adversarial examples. Model is described inEnsemble Adversarial Training: Attacks and Defensespaper.

同時也提供了好幾個 example。還是對抗樣本生成與對抗訓練非常好的一個庫。

附圖為其中第一個 example。

全文完

本文由簡悅 SimpRead優化，用以提升閱讀體驗使用了全新的簡悅詞法分析引擎^beta，點選檢視詳細說明

<cleverhans 對抗樣本防護編譯與測試（含 FGSM 攻擊與 ADV 防護）>

<cleverhans 對抗樣本防護編譯與測試（含 FGSM 攻擊與 ADV 防護）>

軟體質量保證與測試（秦航第二版）筆記第三章

軟體質量保證與測試---軟體質量控制問題與質量控制技術

RabbitMQ簡單模式開發與測試（一）

測試左移與測試右移的定義與理解

【BZOJ5003】與鏈（多重揹包計數轉完全揹包）

寶塔Linux面板 - 新增站點建站時沒有域名實現 IP 地址訪問測試（寶塔面板建站 IP 訪問）

TensorFlow環境安裝配置1.2.1（含anaconda下載與安裝）

步進電機基礎（2.2）- 轉子的分類與結構（1.PM型步進電機）

Matrix Profile 與 Stumpy （時間序列挖掘，矩陣畫像）

Python實現excel的查詢與替換（轉EXE後可直接執行）

NO.A.0002——FreeNAS安裝與配置（版本9.3與11.04）/linux客戶端iscsi共享儲存/FreeNASA配置iscsi/Linux下Targets連結/iscsi自動掛載

注意是否為多例項測試（即是否為多組輸入）

作業系統第二章程序的描述與控制（下半部分：演算法理論篇））

寒假每日一題 AcWing 1113. 紅與黑（連通塊內點的個數）

PYQT5學習筆記（ N+ 1) 訊號與槽（內建訊號與槽的使用）

ALINK(二十七)：特徵工程（六）特徵組合與交叉（特徵組合也叫特徵交叉）

@dynamicCallable與callAsFunction（將型別例項作為函式呼叫）

Jmeter資料庫壓力測試（含階梯式增壓）

第二節組合邏輯程式碼設計與模擬（多路選擇器邏輯設計）

<cleverhans 對抗樣本防護編譯與測試（含 FGSM 攻擊與 ADV 防護）>

相關推薦