## Vision & Language - Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images - Mateusz Malinowski, Marcus Rohrbach, Mario Fritz - Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books - Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler - Learning Query and Image Similarities With Ranking Canonical Correlation Analysis - Wah Ngo ## Recognition, Low-Level Vision, and Biomedical Image Analysis 1. Learning to See by Moving - Pulkit Agrawal, Joao Carreira, Jitendra Malik - scene recognition, object recognition, visual odometry, keypoint matching -- representation (feature) learning 2. Convolutional Channel Features - Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li - pedestrian detection, face detection, edge detection, object proposal generation -- representation learning 3. Local Convolutional Features With Unsupervised Training for Image Retrieval - Mattis Paulin, Matthijs Douze, Zaid Harchaoui, Julien Mairal, Florent Perronin, Cordelia Schmid - patch descriptor learning, image retrieval 4. Discriminative Learning of Deep Convolutional Feature Point Descriptors - Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer - patch-level feature learning 5. SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [[Paper]](http://www.xunhuang.org/research) - Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao - saliency detection 6. Deep Networks for Image Super-Resolution With Sparse Prior - Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang - SR 7. Learning Ordinal Relationships for Mid-Level Vision - Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman - intrinsic image decomposition, depth from single image 8. Deep Colorization - Zezhou Cheng, Qingxiong Yang, Bin Sheng - image colorization 9. High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision - Gedas Bertasius, Jianbo Shi, Lorenzo Torresani - **boundary detection**, semantic boundary labeling, semantic segmentation 10. Video Super-Resolution via Deep Draft-Ensemble Learning - Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia - SR 11. Compression Artifacts Reduction by a Deep Convolutional Network - Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang - JPEG artifact reduction ## Recognition and 3D Computer Vision 1. Semantic Pose Using Deep Networks Trained on Synthetic RGB-D - Jeremie Papon, Markus Schoeler - indoor scene understanding from rgb-d 2. Learning Informative Edge Maps for Indoor Scene Layout Prediction - Arun Mallya, Svetlana Lazebnik - edge map prediction, indoor scene layout prediction 3. Multi-View Convolutional Neural Networks for 3D Shape Recognition - Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller - 3d shape classification and retrieval, 3d shape descriptor 4. Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images - Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother - 6d pose estimation 5. A Deep Visual Correspondence Embedding Model for Stereo Matching Costs [[KITTI-submission](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow_detail.php?benchmark=stereo&error=2&eval=all&result=810169a667c1d8f712ce4c82969a5e9b8b4956c8)] - Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang 6. Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation - Xin Lu, Zhe Lin, Xiaohui Shen, Radomír Měch, James Z. Wang - image style recognition, aesthetic quality categorization, image quality estimation 7. Improving Image Classification With Location Context - Kevin Tang, Manohar Paluri, Li Fei-Fei, Rob Fergus, Lubomir Bourdev - image(scene) classification 8. HICO: A Benchmark for Recognizing Human-Object Interactions in Images - Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, Jia Deng - benchmark paper 9. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification - Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun - **ImageNet Classification** 10. Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network - Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan - clothing detection/retrieval 11. Contextual Action Recognition With R*CNN - Georgia Gkioxari, Ross Girshick, Jitendra Malik - action recognition 12. What Makes an Object Memorable? - Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, Bernard Ghanem - understanding the memorability of objects in images 13. MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition - Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham 14. Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model - Spyros Gidaris, Nikos Komodakis 15. Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks - Marcel Simon, Erik Rodner 16. Multi-Scale Recognition With DAG-CNNs - Songfan Yang, Deva Ramanan 17. Im2Calories: Towards an Automated Mobile Vision Food Diary - Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy 18. Aggregating Local Deep Features for Image Retrieval - Artem Babenko, Victor Lempitsky 19. Learning Deep Object Detectors From 3D Models - Xingchao Peng, Baochen Sun, Karim Ali, Kate Saenko 20. Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification - Ruobing Wu, Baoyuan Wang, Wenping Wang, Yizhou Yu 21. Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval - Gaurav Sharma, Bernt Schiele - image retrieval ## Segmentation, Edges and Saliency 1. Semantic Image Segmentation via Deep Parsing Network - Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, Xiaoou Tang 2. Human Parsing With Contextualized Convolutional Neural Network - Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan 3. Holistically-Nested Edge Detection - Saining Xie, Zhuowen Tu ## Learning Representations & Attributes 1. Learning Image Representations Tied to Ego-Motion - Dinesh Jayaraman, Kristen Grauman - representation learning 2. Unsupervised Visual Representation Learning by Context Prediction - Carl Doersch, Abhinav Gupta, Alexei A. Efros 3. Webly Supervised Learning of Convolutional Networks - Xinlei Chen, Abhinav Gupta 4. Fast R-CNN, Ross Girshick 5. Bilinear CNN Models for Fine-Grained Visual Recognition - Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji ## Statistical Methods & Learning 1. Deep Neural Decision Forests - Peter Kontschieder, Madalina Fiterau, Antonio Criminisi, Samuel Rota Bulò 2. Deep Fried Convnets - Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang 3. Semantic Component Analysis - Calvin Murdock, Fernando De la Torre 4. Learning Discriminative Reconstructions for Unsupervised Outlier Removal -Yan Xia, Xudong Cao, Fang Wen, Gang Hua, Jian Sun ## Optimization, Segmentation, and Recognition 1. Learning Deconvolution Network for Semantic Segmentation - Hyeonwoo Noh, Seunghoon Hong, Bohyung Han 2. Conditional Random Fields as Recurrent Neural Networks - Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr 3. Boosting Object Proposals: From Pascal to COCO - Jordi Pont-Tuset, Luc Van Gool 4. Joint Object and Part Segmentation Using Deep Learned Potentials - Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille 5. BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies - Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen 11. Contour Guided Hierarchical Model for Shape Matching - Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng 12. Robust Image Segmentation Using Contour-Guided Color Palettes - Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C.-C. Jay Kuo 14. BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation - Jifeng Dai, Kaiming He, Jian Sun 15. Detection and Segmentation of 2D Curved Reflection Symmetric Structures - Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos 16. Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories - Mihai Marian Puscas, Enver Sangineto, Dubravko Culibrk, Nicu Sebe 17. Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes - Mete Ozay, Umit Rusen Aktas, Jeremy L. Wyatt, Aleš Leonardis 19. Learning to Combine Mid-Level Cues for Object Proposal Generation - Tom Lee, Sanja Fidler, Sven Dickinson 20. Enhancing Road Maps by Parsing Aerial Images Around the World - Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun 24. StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images - Ran Ju, Tongwei Ren, Gangshan Wu 25. Semantic Segmentation of RGBD Images With Mutex Constraints - Zhuo Deng, Sinisa Todorovic, Longin Jan Latecki 26. Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation - George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille 28. Parsimonious Labeling - Puneet K. Dokania, M. Pawan Kumar 32. Constrained Convolutional Neural Networks for Weakly Supervised Segmentation - Deepak Pathak, Philipp Krähenbühl, Trevor Darrell 35. Convolutional Sparse Coding for Image Super-Resolution - Shuhang Gu, Wangmeng Zuo, Qi Xie, Deyu Meng, Xiangchu Feng, Lei Zhang 40. Depth-Based Hand Pose Estimation: Data, Methods, and Challenges - James S. Supančič III, Grégory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan 43. Learning Deep Representation With Large-Scale Attributes - Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang 44. Deep Learning Strong Parts for Pedestrian Detection - Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang 45. Flowing ConvNets for Human Pose Estimation in Videos - Tomas Pfister, James Charles, Andrew Zisserman 47. BubbLeNet: Foveated Imaging for Visual Discovery - Kevin Matzen, Noah Snavely 49. Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions - Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu 53. Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging - Jianlong Fu, Yue Wu, Tao Mei, Jinqiao Wang, Hanqing Lu, Yong Rui 54. Visual Phrases for Exemplar Face Detection - Vijay Kumar, Anoop Namboodiri, C. V. Jawahar 55. Spatial Semantic Regularisation for Large Scale Object Detection - Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell 56. Human Pose Estimation in Videos - Dong Zhang, Mubarak Shah 57. Contour Box: Rejecting Object Proposals Without Explicit Closed Contours - Cewu Lu, Shu Liu, Jiaya Jia, Chi-Keung Tang ## Recognition and 3D CV 2. Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo, Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan 4. Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition, Torsten Sattler, Michal Havlena, Filip Radenović, Konrad Schindler, Marc Pollefeys 5. Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences, Mark Brown, David Windridge, Jean-Yves Guillemaut 10. Semantically-Aware Aerial Reconstruction From Multi-Modal Data - Randi Cabezas, Julian Straub, John W. Fisher III 15. Exploiting Object Similarity in 3D Reconstruction - Chen Zhou, Fatma Güney, Yizhou Wang, Andreas Geiger 16. You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans - Hang Chu, Dong Ki Kim, Tsuhan Chen 24. The Likelihood-Ratio Test and Efficient Robust Estimation - Andrea Cohen, Christopher Zach 35. Real-Time Pose Estimation Piggybacked on Object Detection - Roman Juránek, Adam Herout, Markéta Dubská, Pavel Zemčík 36. Understanding and Predicting Image Memorability at a Large Scale - Aditya Khosla, Akhil S. Raju, Antonio Torralba, Aude Oliva 37. Multiple Granularity Descriptors for Fine-Grained Categorization - Dequan Wang, Zhiqiang Shen, Jie Shao, Wei Zhang, Xiangyang Xue, Zheng Zhang 38. Guiding the Long-Short Term Memory Model for Image Caption Generation - Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars 39. Just Noticeable Differences in Visual Attributes - Aron Yu, Kristen Grauman 40. VQA: Visual Question Answering - Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh 41. Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach - Guoyu Lu, Yan Yan, Li Ren, Jingkuan Song, Nicu Sebe, Chandra Kambhamettu 42. Dense Optical Flow Prediction From a Static Image - Jacob Walker, Abhinav Gupta, Martial Hebert 44. Visual Madlibs: Fill in the Blank Description Generation and Question Answering - Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg 45. Actions and Attributes From Wholes and Parts - Georgia Gkioxari, Ross Girshick, Jitendra Malik 46. DeepBox: Learning Objectness With Convolutional Networks - Weicheng Kuo, Bharath Hariharan, Jitendra Malik 47. Active Object Localization With Deep Reinforcement Learning - Juan C. Caicedo, Svetlana Lazebnik 48. Scene-Domain Active Part Models for Object Representation - Zhou Ren, Chaohui Wang, Alan L. Yuille 49. A Unified Multiplicative Framework for Attribute Learning - Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen 50. Contractive Rectifier Networks for Nonlinear Maximum Margin Classification - Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel 51. Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization - Zhe Xu, Shaoli Huang, Ya Zhang, Dacheng Tao 52. Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images - Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille 53. Learning Common Sense Through Visual Abstraction - Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh 54. Domain Generalization for Object Recognition With Multi-Task Autoencoders - Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi 55. Square Localization for Efficient and Accurate Object Detection - Cewu Lu, Yongyi Lu, Hao Chen, Chi-Keung Tang 56. Box Aggregation for Proposal Decimation: Last Mile of Object Detection - Shu Liu, Cewu Lu, Jiaya Jia 57. DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers - Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc Van Gool 58. Semantic Segmentation With Object Clique Potential - Xiaojuan Qi, Jianping Shi, Shu Liu, Renjie Liao, Jiaya Jia 59. Automatic Concept Discovery From Parallel Text and Visual Corpora - Chen Sun, Chuang Gan, Ram Nevatia 61. Monocular Object Instance Segmentation and Depth Ordering With CNNs - Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun 62. Multimodal Convolutional Neural Networks for Matching Image and Sentence - Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li 64. Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models - Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik 65. Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture - David Eigen, Rob Fergus 66. AttentionNet: Aggregating Weak Directions for Accurate Object Detection - Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony S. Paek, In So Kweon 67. Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images - Yoshitaka Ushiku, Masataka Yamaguchi, Yusuke Mukuta, Tatsuya Harada ## Representations for Recognition & Localization 1. 3D-Assisted Feature Synthesis for Novel Views of an Object, Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas 2. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views, Hao Su, Charles R. Qi, Yangyan Li, Leonidas J. Guibas ## Statistical Methods & Learning, Motion & Tracking, and Video Analysis 2. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving - Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao 3. Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks - Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars 4. HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition - Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis DeCoste, Wei Di, Yizhou Yu 5. Learning The Structure of Deep Convolutional Networks - Jiashi Feng, Trevor Darrell 6. FlowNet: Learning Optical Flow With Convolutional Networks - Alexey Dosovitskiy, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox 10. Unsupervised Learning of Visual Representations Using Videos - Xiaolong Wang, Abhinav Gupta 11. A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis - Sotirios P. Chatzis, Dimitrios Kosmopoulos 14. Robust Optimization for Deep Regression - Vasileios Belagiannis, Christian Rupprecht, Gustavo Carneiro, Nassir Navab 16. Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation - Sijin Li, Weichen Zhang, Antoni B. Chan 17. An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections - Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shi-Fu Chang 19. Understanding Deep Features With Computer-Generated Imagery - Mathieu Aubry, Bryan C. Russell 21. Context-Aware CNNs for Person Head Detection - Tuan-Hung Vu, Anton Osokin, Ivan Laptev 23. Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple - Oren Freifeld, Søren Hauberg, Kayhan Batmanghelich, John W. Fisher III 26. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization - Alex Kendall, Matthew Grimes, Roberto Cipolla 27. Predicting Multiple Structured Visual Interpretations - Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell 28. Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks - Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang 29. Matrix Backpropagation for Deep Networks With Structured Layers - Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu 31. Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition - Heechul Jung, Sihaeng Lee, Junho Yim, Sunjeong Park, Junmo Kim 32. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression - Takuya Narihira, Michael Maire, Stella X. Yu 33. Face Flow - Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou 42. Hierarchical Convolutional Features for Visual Tracking - Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang 44. Online Object Tracking With Proposal Selection - Yang Hua, Karteek Alahari, Cordelia Schmid 45. Understanding and Diagnosing Visual Tracking Systems - Naiyan Wang, Jianping Shi, Dit-Yan Yeung, Jiaya Jia 47. Visual Tracking With Fully Convolutional Networks - Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu 48. Multiple Feature Fusion via Weighted Entropy for Visual Tracking - Lin Ma, Jiwen Lu, Jianjiang Feng, Jie Zhou 49. Pedestrian Travel Time Estimation in Crowded Scenes - Shuai Yi, Hongsheng Li, Xiaogang Wang 52. Learning to Track for Spatio-Temporal Action Localization - Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid 53. Unsupervised Object Discovery and Tracking in Video Collections - Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid 54. Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models - Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena 58. P-CNN: Pose-Based CNN Features for Action Recognition - Guilhem Chéron, Ivan Laptev, Cordelia Schmid 59. Fully Connected Object Proposals for Video Segmentation - Federico Perazzi, Oliver Wang, Markus Gross, Alexander Sorkine-Hornung 60. Video Segmentation With Just a Few Strokes - Naveen Shankar Nagaraja, Frank R. Schmidt, Thomas Brox 61. Actionness-Assisted Recognition of Actions - Ye Luo, Loong-Fah Cheong, An Tran 66. RGB-W: When Vision Meets Wireless - Alexandre Alahi, Albert Haque, Li Fei-Fei 68. Simultaneous Foreground Detection and Classification With Hybrid Features -Jaemyun Kim, Adín Ramírez Rivera, Byungyong Ryu, Oksam Chae ## Vision & People 1. Training a Feedback Loop for Hand Pose Estimation - Markus Oberweger, Paul Wohlhart, Vincent Lepetit 2. Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose - Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton 4. Where to Buy It: Matching Street Clothing Photos in Online Shops - M. Hadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg 5. Multi-Task Recurrent Neural Network for Immediacy Prediction - Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang 6. Learning Complexity-Aware Cascades for Deep Pedestrian Detection - Zhaowei Cai, Mohammad Saberian, Nuno Vasconcelos ## Computational Photography, Face & Gesture, and Vision for X 4. TransCut: Transparent Object Segmentation From a Light-Field Image, Yichao Xu, Hajime Nagahara, Atsushi Shimada, Rin-ichiro Taniguchi 7. Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition, Tinghui Zhou, Philipp Krähenbühl, Alexei A. Efros 12. Intrinsic Depth: Improving Depth Transfer With Intrinsic Images - Naejin Kong, Michael J. Black 23. Selective Encoding for Recognizing Unreliably Localized Faces, Ang Li, Vlad Morariu, Larry S. Davis 24. Confidence Preserving Machine for Facial Action Unit Detection, Jiabei Zeng, Wen-Sheng Chu, Fernando De la Torre, Jeffrey F. Cohn, Zhang Xiong 25. Learning Social Relation Traits From Face Images - Zhanpeng Zhang, Ping Luo, Chen-Change Loy, Xiaoou Tang 26. Robust Heart Rate Measurement From Video Using Select Random Patches - Antony Lam, Yoshinori Kuno 28. Robust Facial Landmark Detection Under Significant Head Poses and Occlusion, Yue Wu, Qiang Ji 29. Conditional Convolutional Neural Network for Modality-Aware Face Recognition - Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim 30. From Facial Parts Responses to Face Detection: A Deep Learning Approach - Shuo Yang, Ping Luo, Chen-Change Loy, Xiaoou Tang 32. Pose-Invariant 3D Face Alignment, Amin Jourabloo, Xiaoming Liu 33. From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning - Adrià Ruiz, Joost Van de Weijer, Xavier Binefa 36. Deep Learning Face Attributes in the Wild - Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang 37. Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification, Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao 38. Regressing a 3D Face Shape From a Single Image, Sergey Tulyakov, Nicu Sebe 45. A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification, Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang 48. Discriminative Pose-Free Descriptors for Face and Object Matching - Soubhik Sanyal, Sivaram Prasad Mudunuri, Soma Biswas 49. Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation - Meina Kan, Shiguang Shan, Xilin Chen 51. Person Recognition in Personal Photo Collections - Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele 56. Learning to Predict Saliency on Face Images - Mai Xu, Yun Ren, Zulin Wang 57. Group Membership Prediction, Ziming Zhang, Yuting Chen, Venkatesh Saligrama 59. Robust RGB-D Odometry Using Point and Line Features, Yan Lu, Dezhen Song 60. Learning a Discriminative Model for the Perception of Realism in Composite Images, Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros 61. What Makes Tom Hanks Look Like Tom Hanks, Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman 63. Personalized Age Progression With Aging Dictionary - Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan 64. FaceDirector: Continuous Control of Facial Performance in Video - Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung 65. Synthesizing Illumination Mosaics From Internet Photo-Collections - Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm 66. Hot or Not: Exploring Correlations Between Appearance and Temperature, Daniel Glasner, Pascal Fua, Todd Zickler, Lihi Zelnik-Manor ## Motion & Correspondence 3. Dense Semantic Correspondence Where Every Pixel is a Classifier, Hilton Bristow, Jack Valmadre, Simon Lucey ## Statiscal Methods & Learning, Motion & Tracking, and Video Analysis II 1. Differential Recurrent Neural Networks for Action Recognition - Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi 4. Simultaneous Deep Transfer Across Domains and Tasks - Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko 5. Low Dimensional Explicit Feature Maps, Ondřej Chum 6. Unsupervised Learning of Spatiotemporally Coherent Metrics - Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun 7. Multi-Label Cross-Modal Retrieval - Viresh Ranjan, Nikhil Rasiwasia, C. V. Jawahar 10. Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data - Tzu Ming Harry Hsu, Wei Yu Chen, Cheng-An Hou, Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang 12. Geometry-Aware Deep Transform - Jiaji Huang, Qiang Qiu, Robert Calderbank, Guillermo Sapiro 15. Zero-Shot Learning via Semantic Similarity Embedding - Ziming Zhang, Venkatesh Saligrama 18. Multi-View Domain Generalization for Visual Recognition - Li Niu, Wen Li, Dong Xu 19. Infinite Feature Selection - Giorgio Roffo, Simone Melzi, Marco Cristani 20. Semi-Supervised Zero-Shot Classification With Label Representation Learning - Xin Li, Yuhong Guo, Dale Schuurmans 24. Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions - Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, Ruslan salakhutdinov 25. Structured Feature Selection - Tian Gao, Ziheng Wang, Qiang Ji 26. Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning - Yan Huang, Wei Wang, Liang Wang 27. Learning Image and User Features for Recommendation in Social Networks - Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua 28. Dual-Feature Warping-Based Motion Model Estimation - Shiwei Li, Lu Yuan, Jian Sun, Long Quan 29. An Adaptive Data Representation for Robust Point-Set Registration and Merging - Dylan Campbell, Lars Petersson 31. Learning Spatially Regularized Correlation Filters for Visual Tracking - Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg 32. SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging, Kensei Jo, Mohit Gupta, Shree K. Nayar 35. Recurrent Network Models for Human Dynamics - Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik 36. Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment - Huijun Di, Qingxuan Shi, Feng Lv, Ming Qin, Yao Lu 39. Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters - Arridhana Ciptadi, James M. Rehg 40. A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images - Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit 41. Linearization to Nonlinear Learning for Visual Tracking - Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli 42. Self-Occlusions and Disocclusions in Causal Video Object Segmentation - Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto 43. Large Displacement 3D Scene Flow With Occlusion Reasoning - Andrei Zanfir, Cristian Sminchisescu 46. Category-Blind Human Action Recognition: A Practical Recognition System - Wenbo Li, Longyin Wen, Mooi Choo Chuah, Siwei Lyu 48. Weakly-Supervised Alignment of Video With Text - Piotr Bojanowski, Rémi Lajugie, Edouard Grave, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid 49. Learning Temporal Embeddings for Complex Video Analysis - Vignesh Ramanathan, Kevin Tang, Greg Mori, Li Fei-Fei 50. Unsupervised Semantic Parsing of Video Collections - Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena 51. Learning Spatiotemporal Features With 3D Convolutional Networks - Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri 52. Temporal Perception and Prediction in Ego-Centric Video - Yipin Zhou, Tamara L. Berg 53. Describing Videos by Exploiting Temporal Structure - Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville 55. Storyline Representation of Egocentric Videos With an Applications to Story-Based Search - Bo Xiong, Gunhee Kim, Leonid Sigal 56. Sequence to Sequence – Video to Text - Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko 58. Action Recognition by Hierarchical Mid-Level Action Elements - Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese 63. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks - Lin Sun, Kui Jia, Dit-Yan Yeung, Bertram E. Shi 66. Love Thy Neighbors: Image Annotation by Exploiting Image Metadata - Justin Johnson, Lamberto Ballan, Li Fei-Fei 67. Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders - Huan Yang, Baoyuan Wang, Stephen Lin, David Wipf, Minyi Guo, Baining Guo ## Video -- Actions, Surveillance & Tracking 1. Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos - Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulò, Narendra Ahuja, Oswald Lanz 2. Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!, Bilge Soran, Ali Farhadi, Linda Shapiro 3. Partial Person Re-Identification, Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jianhuang Lai, Shaogang Gong 5. Multiple Hypothesis Tracking Revisited, Chanho Kim, Fuxin Li, Arridhana Ciptadi, James M. Rehg 6. Learning to Track: Online Multi-Object Tracking by Decision Making, Yu Xiang, Alexandre Alahi, Silvio Savarese