【CVPR2018】論文整理(收藏這一篇就夠了)
CVPR 2018
CVPR作為CV界最受關注的三大頂會之一,每一個CVer都應該好好關注CVPR的論文。CVPR2018在今年6月18日-22日在美國鹽湖城舉行。
如果你想要CVPR2018所有論文合集,可以訪問這個連結:http://openaccess.thecvf.com/CVPR2018.py
如果你想看CVPR2018論文資料的詳細統計,可以往下看。
先介紹一下CVPR2018的一些資料:
- 今年一共收到3309篇文章,其中979篇被錄用。投錄比約為29.5%。
- 收錄論文按專家評分,分為三個層次:Poster, Spotlight, Oral。
- Spotlight(亮點論文)一共有224篇,佔收錄論文(224/979)的22.88%
- Oral(演示論文)一共有70篇,佔收錄論文(70/979)的7.1%。
用一張韋恩圖表示收錄文章佔比:
所以說,不光中篇CVPR難,中篇spotlight更難,中篇oral基本可以說是灰常難了。就這麼說吧,今年國內所有高校加起來中的CVPR oral是個位數
。
當然,最牛的還是Best paper
和best student paper
,只會分別選出1篇。
今年的best paper給了來自Stanford和Berkeley的合作論文,論文標題為:
|Taskonomy: Disentangling Task Transfer Learning|
|:|
|下載地址為:https://arxiv.org/abs/1804.08328|
最佳學生論文來自CMU,標題為:
|Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies|
|:|
|下載地址為:https://arxiv.org/abs/1801.01615v1|
當然,就像奧斯卡頒獎一樣,最佳論文獎提名也可以突出文章質量很高。今年四篇最佳論文提名獎如下:
標題 | 第一單位 | 下載地址 |
---|---|---|
Deep_Learning_of_Graph_Matching | Lund University | http://openaccess.thecvf.com /content_cvpr_2018 /CameraReady/1830.pdf |
SPLATNet: Sparse Lattice Networks for Point Cloud Processing | UMass Amherst | https://arxiv.org/pdf/1802.08275.pdf |
CodeSLAM-learning a Compact, Optimisable Representation for Dense Visual SLAM | 帝國理工 | https://arxiv.org/pdf/1804.00874.pdf |
Efficient Optimization for Rank-based Loss Functions | IIIT Hyderabad | https://arxiv.org/pdf/1604.08269.pdf |
所以,客觀認為的論文含金量是:
best paper (2篇) > honorable mention(提名獎 4篇) > Oral (70篇) > Spotlight(224篇) > poster(其他)
CVPR2018雖好,可不要貪杯,一共有979篇,每天看1篇也得看3年,待你看完之日也是演算法過時之時。所以,給各位CVer(包括自己)一些建議:
- 從高質量論文開始看,至少優先看spotlight或者oral論文。
- 在自己的領域找論文看,別想做什麼CVPR的集大成者,如果你是CVPR oral大神,那麼當我這條沒說過。
- 哪裡有CVPR論文分享會就去聽,聽原作者自己講一個小時,比自己看一禮拜更管用。如果沒有現場版,看看視訊也是好的。
最後
附上68篇oral論文標題:(文末有下載連結)
1 | DensePose: Multi-Person Dense Human Pose Estimation In The Wild |
---|---|
2 | Context Encoding for Semantic Segmentation |
3 | Augmented Skeleton Space Transfer for Depth-based Hand Pose Estimation |
4 | Semi-parametric Image Synthesis |
5 | Practical Block-wise Neural Network Architecture Generation |
6 | Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning |
7 | PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume |
8 | Illuminant Spectra-based Source Separation Using Flash Photography |
9 | SPLATNet: Sparse Lattice Networks for Point Cloud Processing |
10 | Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies |
11 | Deep Layer Aggregation |
12 | Left-Right Comparative Recurrent Model for Stereo Matching |
13 | Analytic Expressions for Probabilistic Moments of PL-DNN with Gaussian Input |
14 | An Analysis of Scale Invariance in Object Detection - SNIP |
15 | Finding Tiny Faces in the Wild with Generative Adversarial Network |
16 | Taskonomy: Disentangling Task Transfer Learning |
17 | High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs |
18 | Finding “It”: Weakly-Supervised Reference-Aware Visual Grounding in Instructional Video |
19 | Unsupervised Discovery of Object Landmarks as Structural Representations |
20 | Rotation Averaging and Strong Duality |
21 | Im2Flow: Motion Hallucination from Static Images for Action Recognition |
22 | Group Consistent Similarity Learning via Deep CRFs for Person Re-Identification |
23 | 3D-RCNN: Instance-level 3D Scene Understanding via Render-and-Compare |
24 | Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering |
25 | Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation |
26 | Squeeze-and-Excitation Networks |
27 | DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Single Depth Sensor |
28 | Learning to Find Good Correspondences |
29 | Actor and Action Video Segmentation from a Sentence |
30 | Maximum Classifier Discrepancy for Unsupervised Domain Adaptation |
31 | Detail-Preserving Pooling in Deep Networks |
32 | Convolutional Neural Networks with Alternately Updated Clique |
33 | Deep Learning of Graph Matching |
34 | Synthesizing Images of Humans in Unseen Poses |
35 | Neural Inverse Kinematics for Unsupervised Motion Retargetting |
36 | Direction-aware Spatial Context Features for Shadow Detection |
37 | Density Adaptive Point Set Registration |
38 | Hybrid Camera Pose Estimation |
39 | Relation Networks for Object Detection |
40 | Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects |
41 | Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View |
42 | Polarimetric Dense Monocular SLAM |
43 | Wasserstein Introspective Neural Networks |
44 | The Perception-Distortion Tradeoff |
45 | Discriminative Learning of Latent Features for Zero-Shot Recognition |
46 | Photometric Stereo in Participating Media Considering Shape-Dependent Forward Scatter |
47 | Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net |
48 | Trapping Light for Time of Flight |
49 | Feature Space Transfer for Data Augmentation |
50 | Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250Hz |
51 | CodeSLAM --- Learning a Compact, Optimisable Representation for Dense Visual SLAM |
52 | FlipDial: A Generative Model for Two-Way Visual Dialogue |
53 | OATM: Occlusion Aware Template Matching by Consensus Set Maximization |
54 | Surface Networks |
55 | VirtualHome: Simulating Household Activities via Programs |
56 | Egocentric Activity Recognition on a Budget |
57 | Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering |
58 | Efficient Optimization for Rank-based Loss Functions |
59 | MakeupGAN: Makeup Transfer via Cycle-Consistent Adversarial Networks |
60 | Revisiting Deep Intrinsic Image Decompositions |
61 | StarGAN: Unified Generative Adversarial Networks for Controllable Multi-Domain Image-to-Image Translation |
62 | Ordinal Depth Supervision for 3D Human Pose Estimation |
63 | Multi-Cell Classification by Convolutional Dictionary Learning with Class Proportion Priors |
64 | Accurate and Diverse Sampling of Sequences based on a ``Best of Many'' Sample Objective |
65 | MapNet: An Allocentric Spatial Memory for Mapping Environments |
66 | A Globally Optimal Solution to the Non-Minimal Relative Pose Problem |
67 | A Volumetric Descriptive Network for 3D Object Synthesis |
68 | Learning Face Age Progression: A Pyramid Architecture of GANs |