閱讀筆記 Modality-specific and shared generative adversarial network for cross-modal retrieval

阿新 • • 發佈：2020-10-14

這一篇論文講的是使用多模態來進行圖片的檢索，通過文字檢索出最好的圖片，模型結構如下：
在這裡插入圖片描述

文章提出兩個特徵概念

modality-specific 模態獨立特徵
modality-shared 模態分享特徵，也可以理解為共同特徵

文章採用對抗訓練框架， 在生成模型處：

使用3個loss 進行訓練：

semantic discrimination loss 用於保證語義的區分能力，要求模型提取的特徵，對於類別的區分度高。要求模型提取的special特徵和shared特徵(文中是把兩個提取的特徵拼接成一個特徵向量來進行預測)，都能夠有效的去辨別樣本的類別。
contrastive loss 對於相同類別的兩個不同的樣本，要求模型提取對兩個樣本提取出來的spceial特徵相近（包括兩個模態），模型提取出的兩個樣本的shared特徵

large margin loss 保證模態獨立特徵和模態分享特徵之間的差別度

在區分模型處

判斷給定樣本的modality-shared特徵，來判斷資訊的模態是什麼

這樣可以減少提取的modality-shared特徵的模態區別。也就是說對於每一個提取出來的共享特徵，他對於模態之間的結果是變化不大的，無論他是從畫面還是從文字提取出來的特徵，它的共享特徵是類似的，所以它的結果是相同的。

閱讀筆記 Modality-specific and shared generative adversarial network for cross-modal retrieval

這一篇論文講的是使用多模態來進行圖片的檢索，通過文字檢索出最好的圖片，模型結構如下：

DeepPrivacy: A Generative Adversarial Network for Face Anonymization閱讀筆記

DeepPrivacy: A Generative Adversarial Network for Face Anonymization ISVC 2019　　https://arxiv.org/pdf/1909.04538.pdf

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation閱讀筆記

動機本文是2020年SIGIR的一篇文章。最近圖卷積網路（GCN）在協同過濾推薦中大放異彩，但是卻很少有工作探究其為什麼在協同過濾推薦系統中有效，缺乏較為完善的消融實驗，在本文中，作者通過一系列消融實驗發現GCN中

論文閱讀筆記《Distribution Consistency Based Covariance Metric Networks for Few-Shot Learning》

小樣本學習&元學習經典論文整理||持續更新核心思想本文提出一種基於度量學習的小樣本學習演算法（CovaMNet），其從二階統計量（協方差）的角度出發，通過構建各個樣本的特徵向量之間的協方差矩陣

【論文筆記】LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

本文提出了一種輕型但是有效的GCN網路用於推薦系統，它捨棄了傳統GCN的特徵變換和非線性啟用，並通過實驗驗證了這兩種操作對協同過濾是無效的，同時提出了一種輕量級的GCN網路構建模型（LightGCN）用於推薦

論文閱讀筆記《A semi-supervised CNN based method for steel surface defect recognition》

核心思想本文提出一種半監督的鋼鐵表面缺陷檢測方法（PLCNN），半監督的思路也比較常見，利用CNN對無標籤樣本進行預測，輸出偽標籤（Pseudo-Label），並將帶有偽標籤的樣本作為訓練樣本對網路進行進一

論文閱讀筆記exploiting spatial dimensions of latenr in GAN for real-time image editing

所提出網路框架：目標是使用編碼器將影象實時準確地投影到潛在空間，並在潛在空間上區域性操作影象

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation 2021-07-2120:23:07 Paper:https://arxiv.org/pdf/2107.00249.pdf

【論文閱讀筆記】《Conditional Generative Adversarial Nets》

論文：《Conditional Generative Adversarial Nets》年份：2014年引言原始的GAN過於自由，訓練會很容易失去方向，導致不穩定且效果差。比如說GAN生成MNIST數字的過程，雖然可以生成數字，但生成的結果是隨機的（

CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks閱讀筆記

CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks 2020 CVPR　　2005.09544.pdf (arxiv.org)

Hetero-Center Loss for Cross-Modality Person Re-Identification閱讀筆記

論文題目：Hetero-Center Loss for Cross-Modality Person Re-Identification 來源：Neurocomputing 動機：對於跨模態reid，大多數的研究都是關注於提高類間的特徵差異去解決問題（也就是提高不同ID行人圖片

論文翻譯閱讀(3)--Conditional Generative Adversarial Nets

Conditional GAN 論文閱讀筆記 Abstract1 Introduction2 Related Work3 Conditional Adversarial Nets3.1 Generative Adversarial Nets3.2 Conditional Adversarial Nets

論文：Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 閱讀筆記

一、論文 (16)Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks https://arxiv.org/abs/1604.02878

論文閱讀筆記《Deep Active Learning for Civil Infrastructure Defect Detection and Classification》

小樣本學習&元學習經典論文整理||持續更新核心思想本文提出一種基於主動學習的民用設施缺陷檢測方法，其思路主要是考慮到在樣本較少的情況下，訓練得到的網路可能不能很好的對各種型別的缺陷都進

Input and Output, Python Tutorial閱讀筆記（4）

　　參考資料：　　Python官網Tutorial 　　注：由於感覺自己的Python還沒有學通透，在看專案的程式碼時還是有一些困難。所以想看一下Python官網的Tutorial自學一下，我在讀的時候也是略過了自己已經會的地方，所以

Beat the AI：Investigating Adversarial Human Annotation for Reading Comprehension論文閱讀筆記

論文原文連結：[2002.00293] Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension (arxiv.org)

閱讀筆記《A hybrid video anomaly detection framework via memory-argumented flow reconstruction and flow-guided frame prediction》

1. 摘要在本文中，提出了HF2VAD框架，一個集成了光流重建和框架預測的混合框架來處理視訊異常檢測。首先，設計了ML-MemAE-SC(具有跳過連線的自動編碼機中的多層次記憶模組)來記憶光流重建的正常模式，以便在光流重

閱讀筆記 Modality-specific and shared generative adversarial network for cross-modal retrieval

閱讀筆記 Modality-specific and shared generative adversarial network for cross-modal retrieval

DeepPrivacy: A Generative Adversarial Network for Face Anonymization閱讀筆記

LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation閱讀筆記

論文閱讀筆記《Distribution Consistency Based Covariance Metric Networks for Few-Shot Learning》

【論文筆記】LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation

論文閱讀筆記《A semi-supervised CNN based method for steel surface defect recognition》

論文閱讀筆記exploiting spatial dimensions of latenr in GAN for real-time image editing

OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation

【論文閱讀筆記】《Conditional Generative Adversarial Nets》

CIAGAN: Conditional Identity Anonymization Generative Adversarial Networks閱讀筆記

Hetero-Center Loss for Cross-Modality Person Re-Identification閱讀筆記

論文翻譯閱讀(3)--Conditional Generative Adversarial Nets

論文：Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks 閱讀筆記

論文閱讀筆記《Deep Active Learning for Civil Infrastructure Defect Detection and Classification》

Input and Output, Python Tutorial閱讀筆記（4）

Beat the AI：Investigating Adversarial Human Annotation for Reading Comprehension論文閱讀筆記

閱讀筆記《A hybrid video anomaly detection framework via memory-argumented flow reconstruction and flow-guided frame prediction》

Learning local feature descriptors with triplets and shallow convolutional neural networks 論文閱讀筆記

1.4 Multiplication and Inverse Matrices 閱讀筆記

1.10 Independence, Basis and Dimension 閱讀筆記

閱讀筆記 Modality-specific and shared generative adversarial network for cross-modal retrieval

相關推薦