Multi-View Gait Recognition Based on A Spatial-Temporal Deep Neural Network論文翻譯和理解

翻譯格式：一句英文，一句中文
結合圖來講解

ABSTRACT

ABSTRACT This paper proposes a novel spatial-temporal deep neural network (STDNN) that is applied to multi-view gait recognition. STDNN comprises a temporal feature network (TFN) and a spatial feature network (SFN). In TFN, a feature sub-network is adopted to extract the low-level edge features of gait silhouettes. These features are input to the spatial-temporal gradient network (STGN) that adopts a spatial temporal gradient (STG) unit and long short-term memory (LSTM) unit to extract the spatial-temporal gradient features. In SFN, the spatial features of gait sequences are extracted by multilayer convolutional neural networks from a gait energy image (GEI). SFN is optimized by classification loss and verification loss jointly, which makes inter-class variations larger than intra-class variations. After training, TFN and SFN are employed to extract temporal and spatial features, respectively which are applied to multi-view gait recognition. Finally, the combined predicted probability is adopted to identify individuals by the differences in their gaits. To evaluate the performance of STDNN, extensive evaluations are carried out based on the CASIA-B, OU-ISIR and CMU MoBo datasets. The best recognition scores achieved by STDNN are 95.67% under an identical view, 93.64% under a cross-view and 92.54% under a multi-view. State-of-the-art approaches are compared with STDNN in various situations. The results show that STDNN outperforms the other methods and demonstrates the great potential of STDNN for practical applications in the future.

摘要

摘要-本文提出了一種應用於多視角步態識別的新型時空深度神經網路（STDNN）。STDNN包括時間特徵網路（TFN）和空間特徵網路（SFN）。在TFN中，採用特徵子網路來提取步態輪廓的區域性邊緣特徵，然後這些特徵被輸入到空間 - 時間梯度網路（STGN），其採用空間時間梯度（STG）單元和長短期記憶（LSTM）單元來提取空間 - 時間梯度特徵。在SFN中，步態序列的空間特徵由多層卷積神經網路從步態能量影象（GEI）提取。 SFN通過分類損失和驗證損失共同進行優化，這使得類間變化大於類內變化。經過訓練後，TFN和SFN用於提取時間和空間特徵，分別應用於多視角步態識別。最後，通過由每個個體的步態差異學習到的組合預測概率來識別個體。為了評估STDNN的效能，我們在資料集CASIA-B，OU-ISIR和CMU MoBo進行了充分的評估。在相同視角下，STDNN獲得的最佳識別率為95.67％，在交叉檢視下為93.64％，在多檢視下為92.54％。將現在領先的幾種方法與STDNN在不同條件下進行比較。結果表明，STDNN優於其他方法，並顯示了STDNN在未來實際應用中的巨大潛力。

先看總結構圖：

STDNN

再細看看

總結構圖

看完了？然後再分別來看 SFN和TFN

A. SPATIAL FEATURE NETWORK (SFN) —空間特徵網路

In this section, the proposed spatial feature network (SFN) is adopted to extract the spatial features of a gait sequence. Its working mechanism is as follows.

在這節當中，我們提出能從步態序列中提取空間特徵的空間特徵網路（SFN），它的工作原理如下：

SFN

1) Spatial Feature Extraction

A spatial feature network comprises three parts, the input layer, the feature extraction network and the loss function layer. During the training phase, a pair of GEIs is input to the multi-layer convolutional neural networks in turn, which are adopted to extract the spatial features of a gait sequence. The last convolution layer which is connected to the fullyconnected layer, by which high dimensional feature vectors are generated. SFN is optimized based on two supervised signals, classification loss and verification loss.

空間特徵網路包括三個部分，輸入層，特徵提取網路和損失函式層。在訓練階段，依次將一對GEI輸入到多層卷積神經網路中，用來提取步態序列的空間特徵。最後一個卷積層後接一個全連線層，通過該層生成高維特徵向量。SFN基於兩個監督訊號對分類損失和驗證損失進行優化。

During the testing phase, the network optimized by the two loss functions is adopted to extract the spatial features, based on which the input GEIs pair is judged as to whether it belongs to the same subject or not. Many methods have recently been proposed to extract the effective spatial features based on convolution neural networks [24], [39].

在測試階段，用訓練好的網路來提取空間特徵，在此基礎上判斷輸入的一對GEI是否屬於同一個人。最近提出了許多方法來提取基於卷積神經網路的有效空間特徵[24]，[39]。

To extract the effective spatial features of a gait sequence, a ConvNet with four convolutional layers is designed as the base network model of the spatial feature network. Furthermore, two other kinds of CNN-based networks are selected to compare with SFN on gait recognition accuracy, which is discussed in detail in section V.C.

空間特徵網路的基礎網路模型是具有四個卷積層的ConvNet，這樣設計是為了提取步態序列的有效空間特徵。此外，選擇另外兩種基於CNN的網路來與的SFN進行步態識別精度比較（這裡僅僅是比較他們兩個所提取空間特徵哪個更有效），這將在 V.C. 部分中詳細討論。

The process of gait spatial feature extraction comprises three parts. In the data preparation phase, GEIs are generated using the method described in section III.C. Then, the sample pair is sent to ConvNet in turn, which is adopted to extract high dimensional feature vectors. These vectors output by ConvNet are capable of representing the spatial features of the input samples, based on which softmax layer is used to predict the categories of the input samples.

步態空間特徵提取的過程包括三個部分：在資料準備階段，使用第 III.C 部分中描述的方法生成GEI。然後，將樣本對依次送入到ConvNet，用於提取高維特徵向量。 ConvNet輸出的這些向量能夠表示輸入樣本的空間特徵，softmax層基於這些向量來預測輸入樣本的類別。

GEI is a kind of small sample due to little information,which is a barrier in identifying different subjects by solely applying the aforementioned method. Inspired by the promising performance of the method proposed in [40], an additional verification signal is adopted, which not only enlarges inter-class variations but also reduces intra-class variations. The two supervisory signals are implemented using two kinds of loss functions which work together to compel SFN to focus on the identity-related feature itself, rather than other influential factors, such as wearing noise, illuminations and so on. The working mechanism of SFN is shown in Figure 5.

由於GEI所攜帶的資訊很少，僅僅應用上述方法來識別不同人的身份效果不佳。受到[40]中提出的一種比較有前景的方法的啟發，我們採用了額外的驗證資訊，這種驗證資訊不僅能擴大類間變化而且還可以減少了類內變化。這兩個監控訊號是由兩種損失函式實現的，由兩種損失函式共同作用來迫使SFN專注於身份相關的特徵本身，而不是其他影響因素，如佩戴物，噪聲，照明等。 SFN的工作機制如圖5所示。

The first supervisory signal is verification loss, which is adopted to reduce the intra-class variation of gait samples.

第一個監控訊號是驗證損失，用於減少步態樣本的類內變化。（好強的監督資訊）

Based on the L2 norm [41], verification loss is defined in Equation (13) where $x_{i}$ and $x_{j}$ denote two input GEIs, and $f\left ( x_{i} \right )$ and $f\left ( x_{j} \right )$ denote the feature vectors output by the fully connected layer $Fc6$ . When $\theta _{ij}$ = 1 this means that $x_{i}$ and $x_{j}$ are from the same gait individual, and features $f\left ( x_{i} \right )$ and $f\left ( x_{j} \right )$ are enforced to be close. On the contrary, when $\theta _{ij}$ =0 this means that $x_{i}$ and $x_{j}$ are from different persons. In this case, the features $f\left ( x_{i} \right )$ and $f\left ( x_{j} \right )$ are pushed apart. The size of the margin is denoted as $\delta$ , which is smaller than the distance between the features carried by different subjects.

基於L2範數[41]的驗證損失在等式（13）中定義，其中 $x_{i}$ 和 $x_{j}$ 表示兩個輸入GEI， $f\left ( x_{i} \right )$ 和 $f\left ( x_{j} \right )$ 表示由完全連線的層Fc6輸出的特徵向量。當 $\theta _{ij}$ = 1時，這意味著 $x_{i}$ 和 $x_{j}$ 是同一個人的步態能量圖，並且特徵 $f\left ( x_{i} \right )$ 和 $f\left ( x_{j} \right )$ 的差異會在訓練過程中逐步變小。相反，當 $\theta _{ij}$ = 0時，這表示 $x_{i}$ 和 $x_{j}$ 是不同人的步態能量圖。在這種情況下，特徵 $f\left ( x_{i} \right )$ 和 $f\left ( x_{j} \right )$ 在訓練過程中差異會變大。 $\delta$ 表示邊界的大小，其小於不同主體產生的特徵之間的距離。這裡說的就是對比損失函式的定義。

14,15
方程（15）中 $p_{i}$ 的分母中 $e^{y_{i}}$ 的下標 $i$ 應該改為 $j$ , $y_{i}$ = 後求和符號的起始下標應該為 $j=1$

The second supervisory signal is classification loss. After being optimized by this supervisory signal, SFN is adopted to identify different subjects. This loss function is defined in Equation (14)-(15) where $f$ denotes the spatial feature vector, pi denotes the target probability distribution, when $p_{i}$ = 0 for all $i$ except $p_{t}$ = 1 for the target class $t$ . and $q_{i}$ denotes the predicted probability that the input sample belongs to a specific class. The output of the softmax layer is the probability distribution of the input samples. $m$ denotes the dimensions of the feature output by the fully-connected layer, $x_{i}$ is the input feature image, and $w (i 相關推薦 .r{ margin-bottom:10px; border-bottom:1px solid #f1f1f1; padding-bottom:10px;}
.r p{ color:#999; line-height:25px;}
.r h5 a{ font-size:16px; line-height:25px;}
.r h5 a:hover{ color:#ff6600} Multi - View Gait Recognition Based on A Spatial - Temporal Deep Neural Network 論文翻譯和理解 Multi-View Gait Recognition Based on A Spatial-Temporal Deep Neural Network論文翻譯和理解
翻譯格式：一句英文，一句中文結合圖來講解
ABSTRACT
ABSTRACT This paper p Multi -Task GANs for View -Specific Feature Learning in Gait Recognition 論文翻譯以及理解 Multi-Task GANs for View-Specific Feature Learning in Gait Recognition論文翻譯以及理解
今天想嘗試一下翻譯一篇自己讀的論文。寫的不好，後續慢慢改進。
Abstract
Abstract— Gait rec Edge Computing Application： Real-Time Face Recognition Based on Cloudlet A mobile-cloud architecture provides a practical platform for performing face recognition on a mobile device. Firstly, even though View Invariant Gait Recognition Using Only One Uniform Model 論文翻譯以及理解 View Invariant Gait Recognition Using Only One Uniform Model論文翻譯以及理解
一行英文，一行翻譯
論文中所述的優點：The unique advantage is that it can extract view in Set up a multi -data center Cassandra cluster on a Kubernetes platform Video & podcast producer & Strategist for developerWorks. I've also been a radio reporter and show director for programming on Public Radio Interna A Bayesian Approach to Deep Neural Network Adaptation with Applications to Robust Automatic Speech Recognition 機器學習屬於瓶頸特征 oid ack enter 變換表示基於貝葉斯的深度神經網絡自適應及其在魯棒自動語音識別中的應用

直接貝葉斯DNN自適應
使用高斯先驗對DNN進行MAP自適應
為何貝葉斯在模型自適應中很有用？
因為自適應問題可以視為後驗估計論文筆記：Visual Object Tracking based on Adaptive Siamese and Motion Estimation Network Visual Object Tracking based on Adaptive Siamese and Motion Estimation

本文提出一種利用上一幀目標位置座標，在本幀中找出目標可能出現的位置的網路--motion es 論文學習 | 利用塊分割資訊增強壓縮視訊質量：Enhancing HEVC Compressed Videos with a Partition-Masked Convolutional Neural Network 目錄

一、亮點
二、網路
三、Mask 及其融合
四、結論

一、亮點
提出 partition-masked Convolutin Neural Network (CNN) ，用以提升 HEVC 壓縮視訊的質量。其亮點在於：該網路利用編碼端提供的塊分割資訊，在解碼端進行質量增強。論文閱讀筆記十八：ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation 每一個內核基於 proc vgg 包含 rep 重要偏差
論文源址：https://arxiv.org/abs/1606.02147
tensorflow github: https://github.com/kwotsin/TensorFlow-ENet
摘要蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記轉
蒸餾神經網路(Distill the Knowledge in a Neural Network) 論文筆記

2017年08月06日 16:19:48
haoji00 On Loss Functions for Deep Neural Networks in Classification讀後感分類問題中的另一類loss函式

In particular, for purely accuracy focused research, squared hinge loss seems
to be a better choice at it converge GaitGAN: Invariant Gait Feature Extraction Using Generative Adversarial Networks 論文翻譯以及理解 GaitGAN: Invariant Gait Feature Extraction Using Generative Adversarial Networks論文翻譯以及理解
格式：一段英文，一段中文
2. Proposed method
To reduce the eff Beyond View Transformation Cycle-Consistent Global 論文翻譯以及理解 Beyond View Transformation Cycle-Consistent Global and Partial Perception Gan for View-Invariant Gait Recognition論文翻譯以及理解
翻譯格式：一段英文，一段中文
下面圍論文筆記：TextBoxes: A Fast Text Detector with a Single Deep Neural Network 在自然場景中，場景文字（Scene text）是最常見的視覺物件（visual objects）之一。經常出現在路標，車牌，產品包裝袋上等等。閱讀場景文字產生了很多有用的應用，例如基於圖片的地理定位（image-basedgeolocation）。儘管它和傳統的OCR很相似，但是場景文字的閱讀更具有挑戰性，因 A Deep Neural Network Approach To Speech Bandwidth Expansion 題名：一種用於語音頻寬擴充套件的深度神經網路方法
作者：Kehuang Li；Chin-Hui Lee
2015年出來的
摘要
　　本文提出了一種基於深度神經網路(DNN)的語音頻寬擴充套件(BWE)方法。利用對數譜功率作為輸入輸出特徵進行所需的非線性變換，訓練神經網路來實現這種高維對映函式。在10小 Codeforces Round #423 (Div. 2, rated, based on VK Cup Finals) Problem A - B initial index 技術 ble continue efi whole ret rem

Pronlem A
In a small restaurant there are a tables for one person and b tables for t MATLAB中mesh函數的使用：基於像素強度畫3D密度圖（create a 3D density plot based on the pixel intensity：mesh function） ase tps splay 示例 width bubuko pre pos 簡單所用的函數非常簡單，只需要用到mesh函數，示例代碼如下：

Ima=imread(‘F:\pathto\test.jpg‘);
surf_ima = surf(rgb2gray(Ima A NEW HYPERSPECTRAL BAND SELECTION APPROACH BASED ON CONVOLUTIONAL NEURAL NETWORK 文章筆記方法可能 lec pan 結果 ica repr 貢獻 tps A NEW HYPERSPECTRAL BAND SELECTION APPROACH BASED ON CONVOLUTIONAL NEURAL NETWORK
文章地址：https://ieeexplor D. Arpa and a list of numbers Codeforces Round #432 (Div. 2, based on IndiaHacks Final Round 2017) bsp tdi ble mat sum i++ amp ext com http://codeforces.com/contest/851/problem/D

分區間操作

1 #include <cstdio>
2 #include <cstdl Author name disambiguation using a graph model with node splitting and merging based on bibliographic information 分隔需要 sin 相似性度量進行 ati 判斷特征向量 edi Author name disambiguation using a graph model with node splitting and merging based on bibliographic搜尋基礎教學 Mysql入門 Sql入門 Android入門 Docker入門 Go語言入門 Ruby程式入門 Python入門 Python進階 Django入門 Python爬蟲入門最近訪問首頁前端設計程式設計免費資源實用技巧資料庫資訊字典 Copyright © 2002-2020 程式人生 796T.COM All rights reserved..footer{padding-bottom: 20px;}hljs.initHighlightingOnLoad();$

Multi-View Gait Recognition Based on A Spatial-Temporal Deep Neural Network論文翻譯和理解