CVPR 2017論文集錦(論文分類)—— 附錄部分翻譯
阿新 • • 發佈:2019-01-06
作為計算機視覺領域的三大頂級會議之一,CVPR 2017 又收錄了很多優秀的文章。具體可參見 CVPR 的論文官網:http://www.cvpapers.com/cvpr2017.html
Machine Learning 1 (機器學習)
Spotlight 1-1A (關注的焦點 1-1 A)
- Exclusivity-Consistency Regularized Multi-View Subspace Clustering
- Xiaojie Guo, Xiaobo Wang, Zhen Lei, Changqing Zhang, Stan Z. Li
- Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning
- Weifeng Ge, Yizhou Yu
- The More You Know: Using Knowledge Graphs for Image Classification
- Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta
- Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs
- Martin Simonovsky, Nikos Komodakis
- Convolutional Neural Network Architecture for Geometric Matching
- Ignacio Rocco, Relja Arandjelović, Josef Sivic
- Deep Affordance-Grounded Sensorimotor Object Recognition
- Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos
- Discovering Causal Signals in Images
- David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou
- On Compressing Deep Models by Low Rank and Sparse Decomposition
- Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao
Oral 1-1A (口頭彙報 1-1A)
- PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
- Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas
- Universal Adversarial Perturbations
- Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard
- Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks
- Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan
- Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (PDF, code)
- Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi
3D Vision 1 (三維視覺)
Spotlight 1-1B (關注的焦點 1-1 B)
- Context-Aware Captions From Context-Agnostic Supervision
- Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
- Global Hypothesis Generation for 6D Object Pose Estimation (PDF)
- Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother
- A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light
- Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer
- CATS: A Color and Thermal Stereo Benchmark
- Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael O'Neal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu
- Elastic Shape-From-Template With Spatially Sparse Deforming Forces
- Abed Malti, Cédric Herzet
- Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context
- Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao
- Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
- Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe
- Dynamic Time-Of-Flight
- Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin
Oral 1-1B (口頭彙報 1-1 B)
- Semantic Scene Completion From a Single Depth Image
- Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser
- 3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions
- Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas Funkhouser
- Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency (PDF, project, code)
- On-The-Fly Adaptation of Regression Forests for Online Camera Relocalisation (PDF)
- Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr
Low- & Mid-Level Vision
Spotlight 1-1C (關注的焦點 1-1 C)
- Designing Effective Inter-Pixel Information Flow for Natural Image Matting
- Yağiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys
- Deep Video Deblurring for Hand-Held Cameras
- Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang
- Instance-Level Salient Object Segmentation
- Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu
- Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring
- Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee
- Diversified Texture Synthesis With Feed-Forward Networks
- Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang
- Radiometric Calibration for Internet Photo Collections (PDF)
- Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita
- Deeply Aggregated Alternating Minimization for Image Restoration
- Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn
- End-To-End Instance Segmentation With Recurrent Attention
- Mengye Ren, Richard S. Zemel
Oral 1-1C
- SRN: Side-output Residual Network for Object Symmetry Detection in the Wild
- Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye
- Deep Image Matting (PDF, abstract)
- Ning Xu, Brian Price, Scott Cohen, Thomas Huang
- Wetness and Color From a Single Multispectral Image
- Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato
- FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling
- Yuanming Hu, Baoyuan Wang, Stephen Lin
Poster 1-1
3D Computer Vision
- Face Normals “In-The-Wild†Using Fully Convolutional Networks
- George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou
- A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting
- Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers
- A Linear Extrinsic Calibration of Kaleidoscopic Imaging System From Single 3D Point
- Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama
- Polarimetric Multi-View Stereo
- Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz
- An Exact Penalty Method for Locally Convergent Maximum Consensus (PDF, code)
- Huu Le, Tat-Jun Chin, David Suter
- Deep Supervision With Shape Concepts for Occlusion-Aware 3D Object Parsing
- Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
- Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes From 2D Ones in RGB-Depth Images
- Zhuo Deng, Longin Jan Latecki
Analyzing Humans in Images
- Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection
- Guillermo Garcia-Hernando, Tae-Kyun Kim
- Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition With Convolutional Neural Networks
- Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona
- Detecting Masked Faces in the Wild With LLE-CNNs
- Shiming Ge, Jia Li, Qiting Ye, Zhao Luo
- A Domain Based Approach to Social Relation Recognition
- Qianru Sun, Bernt Schiele, Mario Fritz
- Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition
- Junwu Weng, Chaoqun Weng, Junsong Yuan
- Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks
- Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister
Applications
- Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core
- Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab
- Multi-Scale FCN With Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild
- Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbi II, Daniel Kifer, C. Lee Giles
- Viraliency: Pooling Local Virality
- Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci
Biomedical Image/Video Analysis
- A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction
- Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng
Image Motion & Tracking
- Video Acceleration Magnification
- Silvia L. Pintea, Yichao Zhang, Jan C. van Gemert
- Superpixel-Based Tracking-By-Segmentation Using Markov Chains
- Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han
- BranchOut: Regularization for Online Ensemble Tracking With Convolutional Neural Networks
- Bohyung Han, Jack Sim, Hartwig Adam
- Learning Motion Patterns in Videos
- Pavel Tokmakov, Karteek Alahari, Cordelia Schmid
Low- & Mid-Level Vision
- Deep Level Sets for Salient Object Detection
- Ping Hu, Bing Shuai, Jun Liu, Gang Wang
- Binary Constraint Preserving Graph Matching
- Bo Jiang, Jin Tang, Chris Ding, Bin Luo
- From Local to Global: Edge Profiles to Camera Motion in Blurred Images
- Subeesh Vasu, A. N. Rajagopalan
- What Is the Space of Attenuation Coefficients in Underwater Computer Vision?
- Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz
- Robust Energy Minimization for BRDF-Invariant Shape From Light Fields
- Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker
- Boundary-Aware Instance Segmentation
- Zeeshan Hayder, Xuming He, Mathieu Salzmann
- Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes
- S. Alireza Golestaneh, Lina J. Karam
- Model-Based Iterative Restoration for Binary Document Image Compression With Dictionary Learning
- Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman
- FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence
- Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn
Machine Learning
- Learning by Association — A Versatile Semi-Supervised Training Method for Neural Networks
- Philip Haeusser, Alexander Mordvintsev, Daniel Cremers
- Dilated Residual Networks
- Fisher Yu, Vladlen Koltun, Thomas Funkhouser
- Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction
- Richard Zhang, Phillip Isola, Alexei A. Efros
- Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting
- Mariano Tepper, Guillermo Sapiro
- Truncated Max-Of-Convex Models
- Pankaj Pansari, M. Pawan Kumar
- Additive Component Analysis
- Calvin Murdock, Fernando De la Torre
- Subspace Clustering via Variance Regularized Ridge Regression
- Zhao Kang, Chong Peng, Qiang Cheng
- The Incremental Multiresolution Matrix Factorization Algorithm
- Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh
- Transformation-Grounded Image Generation Network for Novel 3D View Synthesis
- Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg
- Learning Dynamic Guidance for Depth Image Enhancement (PDF)
- Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang
- A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment (PDF)
- Shuang Ma, Jing Liu, Chang Wen Chen
- Teaching Compositionality to CNNs
- Austin Stone, Huayan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George
- Using Ranking-CNN for Age Estimation
- Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao
- Accurate Single Stage Detector Using Recurrent Rolling Convolution
- Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu
- A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
- Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai (Helen) Li
- The Impact of Typicality for Informative Representative Selection
- Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury
- Infinite Variational Autoencoder for Semi-Supervised Learning
- M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel
- SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks
- Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani
- Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning
- Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri
- Variational Bayesian Multiple Instance Learning With Gaussian Processes
- Manuel Haußmann, Fred A. Hamprecht, Melih Kandemir
- Temporal Attention-Gated Model for Robust Sequence Classification
- Wenjie Pei, Tadas Baltrušaitis, David M.J. Tax, Louis-Philippe Morency
- Non-Uniform Subset Selection for Active Learning in Structured Data
- Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury
- Colorization as a Proxy Task for Visual Understanding
- Gustav Larsson, Michael Maire, Gregory Shakhnarovich
- Shading Annotations in the Wild
- Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala
- LCNN: Lookup-Based Convolutional Neural Network
- Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi
Object Recognition & Scene Understanding ( 目標檢測、場景理解)
- Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation
- Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang
- Pixelwise Instance Segmentation With a Dynamically Instantiated Network
- Anurag Arnab, Philip H. S. Torr
- Object Detection in Videos With Tubelet Proposal Networks
- Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang
- AMVH: Asymmetric Multi-Valued Hashing
- Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan
- Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion
- Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang
- Deep Visual-Semantic Quantization for Efficient Image Retrieval
- Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu
- Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations
- Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondřej Chum
- Feature Pyramid Networks for Object Detection
- Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie
- Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation
- Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo
- StyleNet: Generating Attractive Visual Captions With Styles
- Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng
- Fine-Grained Recognition of Thousands of Object Categories With Single-Example Training
- Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok
- Improving Interpretability of Deep Neural Networks With Semantic Information
- Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang
- Video Captioning With Transferred Semantic Attributes
- Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei
- Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features
- Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi
Video Analytics (視訊分析)
- Temporal Convolutional Networks for Action Segmentation and Detection
- Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager
- Surveillance Video Parsing With Single Frame Supervision
- Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun
- Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking
- Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso
- Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos
- De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles
- Zero-Shot Action Recognition With Error-Correcting Output Codes
- Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang
- Enhancing Video Summarization via Vision-Language Embedding
- Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik
- Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet
- Jianwen Xie, Song-Chun Zhu, Ying Nian Wu
Object Recognition & Scene Understanding - Computer Vision & Language
-
( 目標檢測、場景理解—— 計算機視覺 & 語言)
- Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries
- Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee
- Automatic Understanding of Image and Video Advertisements
- Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka
- Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval
- Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao
- Discover and Learn New Objects From Documentaries
- Kai Chen, Hang Song, Chen Change Loy, Dahua Lin
- Spatial-Semantic Image Search by Visual Feature Synthesis
- Long Mai, Hailin Jin, Zhe Lin, Chen Fang, Jonathan Brandt, Feng Liu
- Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification
- Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogerio Feris
- Semantic Compositional Networks for Visual Captioning
- Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng
- Training Object Class Detectors With Click Supervision
- Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari
Oral 1-2A
- Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
- Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li
- From Red Wine to Red Tomato: Composition With Context
- Ishan Misra, Abhinav Gupta, Martial Hebert
- Captioning Images With Diverse Objects
- Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko
- Self-Critical Sequence Training for Image Captioning
- Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, Vaibhava Goel
Analyzing Humans 1
Spotlight 1-2B
- Crossing Nets: Combining GANs and VAEs With a Shared Latent Space for Hand Pose Estimation
- Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao
- Predicting Behaviors of Basketball Players From First Person Videos
- Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park
- LCR-Net: Localization-Classification-Regression for Human Pose
- Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid
- Learning Residual Images for Face Attribute Manipulation
- Wei Shen, Rujie Liu
- Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing
- Jin Sun, David W. Jacobs
- Deep Learning on Lie Groups for Skeleton-Based Action Recognition
- Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool
- Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations
- Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
- Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose
- Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Oral 1-2B
- Weakly Supervised Action Learning With RNN Based Fine-To-Coarse Modeling
- Alexander Richard, Hilde Kuehne, Juergen Gall
- Disentangled Representation Learning GAN for Pose-Invariant Face Recognition
- Luan Tran, Xi Yin, Xiaoming Liu
- ArtTrack: Articulated Multi-Person Tracking in the Wild
- Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele
- Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields (PDF, code)
- Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh
Image Motion & Tracking; Video Analysis (影象運動與追蹤;視訊分析)
Spotlight 1-2C
- Template Matching With Deformable Diversity Similarity
- Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor
- Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-Identification
- Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang
- Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization
- Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun
- Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
- Linchao Zhu, Zhongwen Xu, Yi Yang
- Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning
- Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi
- TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
- Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim
- Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing
- Yu-Chuan Su, Kristen Grauman
- Unsupervised Adaptive Re-Identification in Open World Dynamic Camera Networks
- Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury
Oral 1-2C
- Context-Aware Correlation Filter Tracking
- Matthias Mueller, Neil Smith, Bernard Ghanem
- Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360° Sports Videos
- Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun
- Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data
- Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger
- CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
- Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang
Poster 1-2
3D Computer Vision
- Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment
- Erik Wijmans, Yasutaka Furukawa
- A Combinatorial Solution to Non-Rigid 3D Shape-To-Image Matching
- Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers
- NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance
- Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Piniés, Paul Newman
- End-To-End Training of Hybrid CNN-CRF Models for Stereo
- Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock
- Learning Shape Abstractions by Assembling Volumetric Primitives (PDF, project, code)
- Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik
- Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation
- Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang
- Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging (PDF)
- Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh
- Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network
- Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Gérard Medioni
- End-To-End 3D Face Reconstruction With Deep Neural Networks
- Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris
- DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction
- Antonio Agudo, Francesc Moreno-Noguer
Analyzing Humans in Images
- Finding Tiny Faces
- Peiyun Hu, Deva Ramanan
- Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network
- Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz
- Deep Temporal Linear Encoding Networks
- Ali Diba, Vivek Sharma, Luc Van Gool
- Joint Registration and Representation Learning for Unconstrained Face Identification (PDF)
- 3D Human Pose Estimation From a Single Image via Distance Matrix Regression
- Francesc Moreno-Noguer
- One-Shot Metric Learning for Person Re-Identification
- Slawomir BÄ…k, Peter Carr
- Generalized Rank Pooling for Activity Recognition
- Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould
- Deep Representation Learning for Human Motion Prediction and Classification
- Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström
- Interspecies Knowledge Transfer for Facial Keypoint Detection
- Maheen Rashid, Xiuye Gu, Yong Jae Lee
- Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization
- Runpeng Cui, Hu Liu, Changshui Zhang
Applications
- Modeling Sub-Event Dynamics in First-Person Action Recognition
- Hasan F. M. Zaki, Faisal Shafait, Ajmal Mian
Computational Photography
- Turning an Urban Scene Video Into a Cinemagraph
- Hang Yan, Yebin Liu, Yasutaka Furukawa
- Light Field Reconstruction Using Deep Convolutional Network on EPI
- Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu
Image Motion & Tracking (目標追蹤)
- FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks
- Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox
Low- & Mid-Level Vision
- Attention-Aware Face Hallucination via Deep Reinforcement Learning
- Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li
- Simple Does It: Weakly Supervised Instance and Semantic Segmentation
- Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele
- Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal
- Tushar Sandhan, Jin Young Choi
- Deep Joint Rain Detection and Removal From a Single Image
- Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan
- Radiometric Calibration From Faces in Images
- Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi
- Webly Supervised Semantic Segmentation
- Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk
- Removing Rain From Single Images via a Deep Detail Network
- Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley
- Deep Crisp Boundaries
- Yupei Wang, Xin Zhao, Kaiqi Huang
- Coarse-To-Fine Segmentation With Shape-Tailored Continuum Scale Spaces
- Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi
- Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network
- Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun
- Single Image Reflection Suppression
- Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk
- CASENet: Deep Category-Aware Semantic Edge Detection
- Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam
- Reflectance Adaptive Filtering Improves Intrinsic Image Estimation
- Thomas Nestmeyer, Peter V. Gehler
Machine Learning
- Conditional Similarity Networks
- Andreas Veit, Serge Belongie, Theofanis Karaletsos
- Spatially Adaptive Computation Time for Residual Networks
- Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov
- Xception: Deep Learning With Depthwise Separable Convolutions
- François Chollet
- Feedback Networks
- Amir R. Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese
- Online Summarization via Submodular and Convex Optimization
- Ehsan Elhamifar, M. Clara De Paolis Kaluza
- Deep MANTA: A Coarse-To-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis From Monocular Image
- Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau
- Improving Pairwise Ranking for Multi-Label Image Classification
- Yuncheng Li, Yale Song, Jiebo Luo
- Active Convolution: Learning the Shape of Convolution for Image Classification
- Yunho Jeon, Junmo Kim
- Linking Image and Text With 2-Way Nets
- Aviv Eisenschtat, Lior Wolf
- Stacked Generative Adversarial Networks
- Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie
- Image Splicing Detection via Camera Response Function Analysis
- Can Chen, Scott McCloskey, Jingyi Yu
- Building a Regular Decision Boundary With Deep Networks
- Edouard Oyallon
- More Is Less: A More Complicated Network With Less Inference Complexity
- Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan
- Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications
- Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres
- Scale-Aware Face Detection
- Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu
- Deep Unsupervised Similarity Learning Using Partially Ordered Sets
- Miguel A. Bautista, Artsiom Sanakoyeu, Björn Ommer
- Generative Hierarchical Learning of Sparse FRAME Models
- Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu
Object Recognition & Scene Understanding
- Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval
- Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis
- Perceptual Generative Adversarial Networks for Small Object Detection
- Emotion Recognition in Context (PDF, supplementary material)
- Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework
- Jongyoo Kim, Sanghoon Lee
- Dense Captioning With Joint Inference and Visual Context
- Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li
- CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
- Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
- Cross-View Image Matching for Geo-Localization in Urban Environments
- Yicong Tian, Chen Chen, Mubarak Shah
- Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning
- Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song
- Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces
- Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar
- Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification
- Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang
- Semantically Consistent Regularization for Zero-Shot Recognition
- Pedro Morgado, Nuno Vasconcelos
- Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes?
- Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle
Video Analytics
- Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model
- Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang
- Predictive-Corrective Networks for Action Detection (project, abstract, PDF)
- Budget-Aware Deep Semantic Video Segmentation
- Behrooz Mahasseni, Sinisa Todorovic, Alan Fern
- Unified Embedding and Metric Learning for Zero-Exemplar Event Detection
- Noureldien Hussein, Efstratios Gavves, Arnold W.M. Smeulders
- Spatiotemporal Pyramid Network for Video Action Recognition
- Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu
- ER3: A Unified Framework for Event Retrieval, Recognition and Recounting
- Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng
- FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos
- Suyog Dutt Jain, Bo Xiong, Kristen Grauman
- Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach
- Aidean Sharghi, Jacob S. Laurel, Boqing Gong
- Flexible Spatio-Temporal Networks for Video Prediction
- Chaochao Lu, Michael Hirsch, Bernhard Schölkopf
- Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos
- Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros
Machine Learning 2
Spotlight 2-1A
- Dual Attention Networks for Multimodal Reasoning and Matching
- Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim
- DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents
- Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker
- Interpretable Structure-Evolving LSTM
- Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing
- ShapeOdds: Variational Bayesian Learning of Generative Shape Models
- Shireen Elhabian, Ross Whitaker
- Fast Video Classification via Adaptive Cascading of Deep Models
- Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy
- Deep Metric Learning via Facility Location
- Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy
- Semi-Supervised Deep Learning for Monocular Depth Map Prediction
- Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe
- Weakly Supervised Semantic Segmentation Using Web-Crawled Videos
- Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han
Oral 2-1A
- Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
- Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu
- Learning From Simulated and Unsupervised Images Through Adversarial Training
- Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb
- Inverse Compositional Spatial Transformer Networks
- Chen-Hsuan Lin, Simon Lucey
- Densely Connected Convolutional Networks
- Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger
Computational Photography
Spotlight 2-1B
- Visual Dialog
- Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra
- Video Frame Interpolation via Adaptive Convolution
- Simon Niklaus, Long Mai, Feng Liu
- FastMask: Segment Multi-Scale Object Candidates in One Shot
- Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha
- Reconstructing Transient Images From Single-Photon Sensors
- Matthew O'Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein
- DeshadowNet: A Multi-Context Embedding Deep Network for Shadow Removal
- Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau
- Illuminant-Camera Communication to Observe Moving Objects Under Strong External Light by Spread Spectrum Modulation
- Ryusuke Sagawa, Yutaka Satoh
- Photorealistic Facial Texture Inference Using Deep Neural Networks
- Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li
- The Geometry of First-Returning Photons for Non-Line-Of-Sight Imaging
- Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan
Oral 2-1B
- Unrolling the Shutter: CNN to Correct Motion Distortions
- Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan
- Light Field Blind Motion Deblurring
- Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi
- Computational Imaging on the Electric Grid
- Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos
- Deep Outdoor Illumination Estimation
- Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde
3D Vision 2
Spotlight 2-1C
- Efficient Solvers for Minimal Problems by Syzygy-Based Reduction
- Viktor Larsson, Kalle Åström, Magnus Oskarsson
- HSfM: Hybrid Structure-from-Motion
- Hainan Cui, Xiang Gao, Shuhan Shen, Zhanyi Hu
- Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures
- Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III
- A New Rank Constraint on Multi-View Fundamental Matrices, and Its Application to Camera Location Recovery
- Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri
- IM2CAD
- Hamid Izadinia, Qi Shan, Steven M. Seitz
- ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes
- Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner
- Noise Robust Depth From Focus Using a Ring Difference Filter
- Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon
- Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy
- Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe
Oral 2-1C
- A Point Set Generation Network for 3D Object Reconstruction From a Single Image
- Haoqiang Fan, Hao Su, Leonidas J. Guibas
- 3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder
- Gil Elbaz, Tamar Avraham, Anath Fischer
- Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras
- Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua
- DSAC - Differentiable RANSAC for Camera Localization (PDF, code, project)
- Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother
Poster 2-1
3D Computer Vision
- Scalable Surface Reconstruction From Point Clouds With Extreme Scale and Density Diversity
- Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof
- Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes Wi