|  |  | @@ -0,0 +1,781 @@ | 
    
    |  |  | ## Vision & Language | 
    
    |  |  | 
 | 
    
    |  |  | - Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images | 
    
    |  |  | - Mateusz Malinowski, Marcus Rohrbach, Mario Fritz | 
    
    |  |  | 
 | 
    
    |  |  | - Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books | 
    
    |  |  | - Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler | 
    
    |  |  | 
 | 
    
    |  |  | - Learning Query and Image Similarities With Ranking Canonical Correlation Analysis | 
    
    |  |  | - Wah Ngo | 
    
    |  |  | 
 | 
    
    |  |  | ## Recognition, Low-Level Vision, and Biomedical Image Analysis | 
    
    |  |  | 
 | 
    
    |  |  | 1. Learning to See by Moving | 
    
    |  |  | - Pulkit Agrawal, Joao Carreira, Jitendra Malik | 
    
    |  |  | - scene recognition, object recognition, visual odometry, keypoint matching -- representation (feature) learning | 
    
    |  |  | 
 | 
    
    |  |  | 2. Convolutional Channel Features | 
    
    |  |  | - Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li | 
    
    |  |  | - pedestrian detection, face detection, edge detection, object proposal generation -- representation learning | 
    
    |  |  | 
 | 
    
    |  |  | 3. Local Convolutional Features With Unsupervised Training for Image Retrieval | 
    
    |  |  | - Mattis Paulin, Matthijs Douze, Zaid Harchaoui, Julien Mairal, Florent Perronin, Cordelia Schmid | 
    
    |  |  | - patch descriptor learning, image retrieval | 
    
    |  |  | 
 | 
    
    |  |  | 4. Discriminative Learning of Deep Convolutional Feature Point Descriptors | 
    
    |  |  | - Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer | 
    
    |  |  | - patch-level feature learning | 
    
    |  |  | 
 | 
    
    |  |  | 5. SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [[Paper]](http://www.xunhuang.org/research) | 
    
    |  |  | - Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao | 
    
    |  |  | - saliency detection | 
    
    |  |  | 
 | 
    
    |  |  | 6. Deep Networks for Image Super-Resolution With Sparse Prior | 
    
    |  |  | - Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang | 
    
    |  |  | - SR | 
    
    |  |  | 
 | 
    
    |  |  | 7. Learning Ordinal Relationships for Mid-Level Vision | 
    
    |  |  | - Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman | 
    
    |  |  | - intrinsic image decomposition, depth from single image | 
    
    |  |  | 
 | 
    
    |  |  | 8. Deep Colorization | 
    
    |  |  | - Zezhou Cheng, Qingxiong Yang, Bin Sheng | 
    
    |  |  | - image colorization | 
    
    |  |  | 
 | 
    
    |  |  | 9. High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision | 
    
    |  |  | - Gedas Bertasius, Jianbo Shi, Lorenzo Torresani | 
    
    |  |  | - **boundary detection**, semantic boundary labeling, semantic segmentation | 
    
    |  |  | 
 | 
    
    |  |  | 10. Video Super-Resolution via Deep Draft-Ensemble Learning | 
    
    |  |  | - Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia | 
    
    |  |  | - SR | 
    
    |  |  | 
 | 
    
    |  |  | 11. Compression Artifacts Reduction by a Deep Convolutional Network | 
    
    |  |  | - Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang | 
    
    |  |  | - JPEG artifact reduction | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Recognition and 3D Computer Vision | 
    
    |  |  | 
 | 
    
    |  |  | 1. Semantic Pose Using Deep Networks Trained on Synthetic RGB-D | 
    
    |  |  | - Jeremie Papon, Markus Schoeler | 
    
    |  |  | - indoor scene understanding from rgb-d | 
    
    |  |  | 
 | 
    
    |  |  | 2. Learning Informative Edge Maps for Indoor Scene Layout Prediction | 
    
    |  |  | - Arun Mallya, Svetlana Lazebnik | 
    
    |  |  | - edge map prediction, indoor scene layout prediction | 
    
    |  |  | 
 | 
    
    |  |  | 3. Multi-View Convolutional Neural Networks for 3D Shape Recognition | 
    
    |  |  | - Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller | 
    
    |  |  | - 3d shape classification and retrieval, 3d shape descriptor | 
    
    |  |  | 
 | 
    
    |  |  | 4. Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images | 
    
    |  |  | - Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother | 
    
    |  |  | - 6d pose estimation | 
    
    |  |  | 
 | 
    
    |  |  | 5. A Deep Visual Correspondence Embedding Model for Stereo Matching Costs [[KITTI-submission](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow_detail.php?benchmark=stereo&error=2&eval=all&result=810169a667c1d8f712ce4c82969a5e9b8b4956c8)] | 
    
    |  |  | - Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang | 
    
    |  |  | 
 | 
    
    |  |  | 6. Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation | 
    
    |  |  | - Xin Lu, Zhe Lin, Xiaohui Shen, Radomír Měch, James Z. Wang | 
    
    |  |  | - image style recognition, aesthetic quality categorization, image quality estimation | 
    
    |  |  | 
 | 
    
    |  |  | 7. Improving Image Classification With Location Context | 
    
    |  |  | - Kevin Tang, Manohar Paluri, Li Fei-Fei, Rob Fergus, Lubomir Bourdev | 
    
    |  |  | - image(scene) classification | 
    
    |  |  | 
 | 
    
    |  |  | 8. HICO: A Benchmark for Recognizing Human-Object Interactions in Images | 
    
    |  |  | - Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, Jia Deng | 
    
    |  |  | - benchmark paper | 
    
    |  |  | 
 | 
    
    |  |  | 9. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification | 
    
    |  |  | - Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun | 
    
    |  |  | - **ImageNet Classification** | 
    
    |  |  | 
 | 
    
    |  |  | 10. Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network | 
    
    |  |  | - Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan | 
    
    |  |  | - clothing detection/retrieval | 
    
    |  |  | 
 | 
    
    |  |  | 11. Contextual Action Recognition With R*CNN | 
    
    |  |  | - Georgia Gkioxari, Ross Girshick, Jitendra Malik | 
    
    |  |  | - action recognition | 
    
    |  |  | 
 | 
    
    |  |  | 46. What Makes an Object Memorable? | 
    
    |  |  | - Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, Bernard Ghanem | 
    
    |  |  | 
 | 
    
    |  |  | 49. Scalable Person Re-Identification: A Benchmark | 
    
    |  |  | - Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Qi Tian | 
    
    |  |  | 
 | 
    
    |  |  | 50. MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition | 
    
    |  |  | - Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham | 
    
    |  |  | 
 | 
    
    |  |  | 51. Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model | 
    
    |  |  | - Spyros Gidaris, Nikos Komodakis | 
    
    |  |  | 
 | 
    
    |  |  | 52. Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks | 
    
    |  |  | - Marcel Simon, Erik Rodner | 
    
    |  |  | 
 | 
    
    |  |  | 53. Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection | 
    
    |  |  | - David Novotny, Jiří Matas | 
    
    |  |  | 
 | 
    
    |  |  | 56. Task-Driven Feature Pooling for Image Classification | 
    
    |  |  | - Guo-Sen Xie, Xu-Yao Zhang, Xiangbo Shu, Shuicheng Yan, Cheng-Lin Liu | 
    
    |  |  | 
 | 
    
    |  |  | 57. Cutting Edge: Soft Correspondences in Multimodal Scene Parsing | 
    
    |  |  | - Sarah Taghavi Namin, Mohammad Najafi, Mathieu Salzmann, Lars Petersson | 
    
    |  |  | 
 | 
    
    |  |  | 58. One Shot Learning via Compositions of Meaningful Patches, Alex Wong, Alan L. Yuille | 
    
    |  |  | 
 | 
    
    |  |  | 59. FASText: Efficient Unconstrained Scene Text Detector | 
    
    |  |  | - Michal Bušta, Lukáš Neumann, Jiří Matas | 
    
    |  |  | 
 | 
    
    |  |  | 60. Multi-Scale Recognition With DAG-CNNs | 
    
    |  |  | - Songfan Yang, Deva Ramanan | 
    
    |  |  | 
 | 
    
    |  |  | 62. Im2Calories: Towards an Automated Mobile Vision Food Diary | 
    
    |  |  | - Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy | 
    
    |  |  | 
 | 
    
    |  |  | 66. Aggregating Local Deep Features for Image Retrieval | 
    
    |  |  | - Artem Babenko, Victor Lempitsky | 
    
    |  |  | 
 | 
    
    |  |  | 67. Learning Deep Object Detectors From 3D Models | 
    
    |  |  | - Xingchao Peng, Baochen Sun, Karim Ali, Kate Saenko | 
    
    |  |  | 
 | 
    
    |  |  | 68. Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification | 
    
    |  |  | - Ruobing Wu, Baoyuan Wang, Wenping Wang, Yizhou Yu | 
    
    |  |  | 
 | 
    
    |  |  | 69. Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval | 
    
    |  |  | - Gaurav Sharma, Bernt Schiele | 
    
    |  |  | 
 | 
    
    |  |  | 71. Unsupervised Generation of a Viewpoint Annotated Car Dataset From Videos | 
    
    |  |  | - Nima Sedaghat, Thomas Brox | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## 3D Vision (Oral) | 
    
    |  |  | 
 | 
    
    |  |  | 1. Structured Indoor Modeling | 
    
    |  |  | - Satoshi Ikehata, Hang Yang, Yasutaka Furukawa | 
    
    |  |  | 
 | 
    
    |  |  | 2. 3D Time-Lapse Reconstruction From Internet Photos | 
    
    |  |  | - Ricardo Martin-Brualla, David Gallup, Steven M. Seitz | 
    
    |  |  | 
 | 
    
    |  |  | 3. Global, Dense Multiscale Reconstruction for a Billion Points | 
    
    |  |  | - Benjamin Ummenhofer, Thomas Brox | 
    
    |  |  | 
 | 
    
    |  |  | 4. On the Visibility of Point Clouds | 
    
    |  |  | - Sagi Katz, Ayellet Tal | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Segmentation, Edges and Saliency | 
    
    |  |  | 
 | 
    
    |  |  | 2. Piecewise Flat Embedding for Image Segmentation | 
    
    |  |  | - Yizhou Yu, Chaowei Fang, Zicheng Liao | 
    
    |  |  | 
 | 
    
    |  |  | 3. Semantic Image Segmentation via Deep Parsing Network | 
    
    |  |  | - Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, Xiaoou Tang | 
    
    |  |  | 
 | 
    
    |  |  | 4. Human Parsing With Contextualized Convolutional Neural Network | 
    
    |  |  | - Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan | 
    
    |  |  | 
 | 
    
    |  |  | 5. Holistically-Nested Edge Detection | 
    
    |  |  | - Saining Xie, Zhuowen Tu | 
    
    |  |  | 
 | 
    
    |  |  | 6. Minimum Barrier Salient Object Detection at 80 FPS | 
    
    |  |  | - Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomír Měch | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Learning Representations & Attributes | 
    
    |  |  | 
 | 
    
    |  |  | 1. Learning Image Representations Tied to Ego-Motion | 
    
    |  |  | - Dinesh Jayaraman, Kristen Grauman | 
    
    |  |  | 
 | 
    
    |  |  | 2. Unsupervised Visual Representation Learning by Context Prediction | 
    
    |  |  | - Carl Doersch, Abhinav Gupta, Alexei A. Efros | 
    
    |  |  | 
 | 
    
    |  |  | 3. Webly Supervised Learning of Convolutional Networks | 
    
    |  |  | - Xinlei Chen, Abhinav Gupta | 
    
    |  |  | 
 | 
    
    |  |  | 4. Fast R-CNN, Ross Girshick | 
    
    |  |  | 
 | 
    
    |  |  | 5. Bilinear CNN Models for Fine-Grained Visual Recognition | 
    
    |  |  | - Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji | 
    
    |  |  | 
 | 
    
    |  |  | 6. Discovering the Spatial Extent of Relative Attributes | 
    
    |  |  | - Fanyi Xiao, Yong Jae Lee | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Statistical Methods & Learning | 
    
    |  |  | 
 | 
    
    |  |  | 1. Deep Neural Decision Forests | 
    
    |  |  | - Peter Kontschieder, Madalina Fiterau, Antonio Criminisi, Samuel Rota Bulò | 
    
    |  |  | 
 | 
    
    |  |  | 2. Deep Fried Convnets | 
    
    |  |  | - Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang | 
    
    |  |  | 
 | 
    
    |  |  | 3. Semantic Component Analysis | 
    
    |  |  | - Calvin Murdock, Fernando De la Torre | 
    
    |  |  | 
 | 
    
    |  |  | 6. Learning Discriminative Reconstructions for Unsupervised Outlier Removal | 
    
    |  |  | -Yan Xia, Xudong Cao, Fang Wen, Gang Hua, Jian Sun | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Optimization, Segmentation, and Recognition | 
    
    |  |  | 
 | 
    
    |  |  | 1. Learning Deconvolution Network for Semantic Segmentation | 
    
    |  |  | - Hyeonwoo Noh, Seunghoon Hong, Bohyung Han | 
    
    |  |  | 
 | 
    
    |  |  | 2. Conditional Random Fields as Recurrent Neural Networks | 
    
    |  |  | - Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr | 
    
    |  |  | 
 | 
    
    |  |  | 4. Boosting Object Proposals: From Pascal to COCO | 
    
    |  |  | - Jordi Pont-Tuset, Luc Van Gool | 
    
    |  |  | 
 | 
    
    |  |  | 7. Joint Object and Part Segmentation Using Deep Learned Potentials | 
    
    |  |  | - Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille | 
    
    |  |  | 
 | 
    
    |  |  | 9. BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies | 
    
    |  |  | - Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen | 
    
    |  |  | 
 | 
    
    |  |  | 11. Contour Guided Hierarchical Model for Shape Matching | 
    
    |  |  | - Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng | 
    
    |  |  | 
 | 
    
    |  |  | 12. Robust Image Segmentation Using Contour-Guided Color Palettes | 
    
    |  |  | - Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C.-C. Jay Kuo | 
    
    |  |  | 
 | 
    
    |  |  | 14. BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation | 
    
    |  |  | - Jifeng Dai, Kaiming He, Jian Sun | 
    
    |  |  | 
 | 
    
    |  |  | 15. Detection and Segmentation of 2D Curved Reflection Symmetric Structures | 
    
    |  |  | - Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos | 
    
    |  |  | 
 | 
    
    |  |  | 16. Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories | 
    
    |  |  | - Mihai Marian Puscas, Enver Sangineto, Dubravko Culibrk, Nicu Sebe | 
    
    |  |  | 
 | 
    
    |  |  | 17. Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes | 
    
    |  |  | - Mete Ozay, Umit Rusen Aktas, Jeremy L. Wyatt, Aleš Leonardis | 
    
    |  |  | 
 | 
    
    |  |  | 19. Learning to Combine Mid-Level Cues for Object Proposal Generation | 
    
    |  |  | - Tom Lee, Sanja Fidler, Sven Dickinson | 
    
    |  |  | 
 | 
    
    |  |  | 20. Enhancing Road Maps by Parsing Aerial Images Around the World | 
    
    |  |  | - Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun | 
    
    |  |  | 
 | 
    
    |  |  | 24. StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images | 
    
    |  |  | - Ran Ju, Tongwei Ren, Gangshan Wu | 
    
    |  |  | 
 | 
    
    |  |  | 25. Semantic Segmentation of RGBD Images With Mutex Constraints | 
    
    |  |  | - Zhuo Deng, Sinisa Todorovic, Longin Jan Latecki | 
    
    |  |  | 
 | 
    
    |  |  | 26. Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation | 
    
    |  |  | - George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille | 
    
    |  |  | 
 | 
    
    |  |  | 28. Parsimonious Labeling | 
    
    |  |  | - Puneet K. Dokania, M. Pawan Kumar | 
    
    |  |  | 
 | 
    
    |  |  | 32. Constrained Convolutional Neural Networks for Weakly Supervised Segmentation | 
    
    |  |  | - Deepak Pathak, Philipp Krähenbühl, Trevor Darrell | 
    
    |  |  | 
 | 
    
    |  |  | 35. Convolutional Sparse Coding for Image Super-Resolution | 
    
    |  |  | - Shuhang Gu, Wangmeng Zuo, Qi Xie, Deyu Meng, Xiangchu Feng, Lei Zhang | 
    
    |  |  | 
 | 
    
    |  |  | 40. Depth-Based Hand Pose Estimation: Data, Methods, and Challenges | 
    
    |  |  | - James S. Supančič III, Grégory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan | 
    
    |  |  | 
 | 
    
    |  |  | 43. Learning Deep Representation With Large-Scale Attributes | 
    
    |  |  | - Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang | 
    
    |  |  | 
 | 
    
    |  |  | 44. Deep Learning Strong Parts for Pedestrian Detection | 
    
    |  |  | - Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang | 
    
    |  |  | 
 | 
    
    |  |  | 45. Flowing ConvNets for Human Pose Estimation in Videos | 
    
    |  |  | - Tomas Pfister, James Charles, Andrew Zisserman | 
    
    |  |  | 
 | 
    
    |  |  | 47. BubbLeNet: Foveated Imaging for Visual Discovery | 
    
    |  |  | - Kevin Matzen, Noah Snavely | 
    
    |  |  | 
 | 
    
    |  |  | 49. Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions | 
    
    |  |  | - Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu | 
    
    |  |  | 
 | 
    
    |  |  | 53. Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging | 
    
    |  |  | - Jianlong Fu, Yue Wu, Tao Mei, Jinqiao Wang, Hanqing Lu, Yong Rui | 
    
    |  |  | 
 | 
    
    |  |  | 54. Visual Phrases for Exemplar Face Detection | 
    
    |  |  | - Vijay Kumar, Anoop Namboodiri, C. V. Jawahar | 
    
    |  |  | 
 | 
    
    |  |  | 55. Spatial Semantic Regularisation for Large Scale Object Detection | 
    
    |  |  | - Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell | 
    
    |  |  | 
 | 
    
    |  |  | 56. Human Pose Estimation in Videos | 
    
    |  |  | - Dong Zhang, Mubarak Shah | 
    
    |  |  | 
 | 
    
    |  |  | 57. Contour Box: Rejecting Object Proposals Without Explicit Closed Contours | 
    
    |  |  | - Cewu Lu, Shu Liu, Jiaya Jia, Chi-Keung Tang | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Recognition and 3D CV | 
    
    |  |  | 
 | 
    
    |  |  | 2. Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo, Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan | 
    
    |  |  | 
 | 
    
    |  |  | 4. Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition, Torsten Sattler, Michal Havlena, Filip Radenović, Konrad Schindler, Marc Pollefeys | 
    
    |  |  | 
 | 
    
    |  |  | 5. Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences, Mark Brown, David Windridge, Jean-Yves Guillemaut | 
    
    |  |  | 
 | 
    
    |  |  | 10. Semantically-Aware Aerial Reconstruction From Multi-Modal Data | 
    
    |  |  | - Randi Cabezas, Julian Straub, John W. Fisher III | 
    
    |  |  | 
 | 
    
    |  |  | 15. Exploiting Object Similarity in 3D Reconstruction | 
    
    |  |  | - Chen Zhou, Fatma Güney, Yizhou Wang, Andreas Geiger | 
    
    |  |  | 
 | 
    
    |  |  | 16. You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans | 
    
    |  |  | - Hang Chu, Dong Ki Kim, Tsuhan Chen | 
    
    |  |  | 
 | 
    
    |  |  | 24. The Likelihood-Ratio Test and Efficient Robust Estimation | 
    
    |  |  | - Andrea Cohen, Christopher Zach | 
    
    |  |  | 
 | 
    
    |  |  | 35. Real-Time Pose Estimation Piggybacked on Object Detection | 
    
    |  |  | - Roman Juránek, Adam Herout, Markéta Dubská, Pavel Zemčík | 
    
    |  |  | 
 | 
    
    |  |  | 36. Understanding and Predicting Image Memorability at a Large Scale | 
    
    |  |  | - Aditya Khosla, Akhil S. Raju, Antonio Torralba, Aude Oliva | 
    
    |  |  | 
 | 
    
    |  |  | 37. Multiple Granularity Descriptors for Fine-Grained Categorization | 
    
    |  |  | - Dequan Wang, Zhiqiang Shen, Jie Shao, Wei Zhang, Xiangyang Xue, Zheng Zhang | 
    
    |  |  | 
 | 
    
    |  |  | 38. Guiding the Long-Short Term Memory Model for Image Caption Generation | 
    
    |  |  | - Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars | 
    
    |  |  | 
 | 
    
    |  |  | 39. Just Noticeable Differences in Visual Attributes | 
    
    |  |  | - Aron Yu, Kristen Grauman | 
    
    |  |  | 
 | 
    
    |  |  | 40. VQA: Visual Question Answering | 
    
    |  |  | - Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh | 
    
    |  |  | 
 | 
    
    |  |  | 41. Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach | 
    
    |  |  | - Guoyu Lu, Yan Yan, Li Ren, Jingkuan Song, Nicu Sebe, Chandra Kambhamettu | 
    
    |  |  | 
 | 
    
    |  |  | 42. Dense Optical Flow Prediction From a Static Image | 
    
    |  |  | - Jacob Walker, Abhinav Gupta, Martial Hebert | 
    
    |  |  | 
 | 
    
    |  |  | 44. Visual Madlibs: Fill in the Blank Description Generation and Question Answering | 
    
    |  |  | - Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg | 
    
    |  |  | 
 | 
    
    |  |  | 45. Actions and Attributes From Wholes and Parts | 
    
    |  |  | - Georgia Gkioxari, Ross Girshick, Jitendra Malik | 
    
    |  |  | 
 | 
    
    |  |  | 46. DeepBox: Learning Objectness With Convolutional Networks | 
    
    |  |  | - Weicheng Kuo, Bharath Hariharan, Jitendra Malik | 
    
    |  |  | 
 | 
    
    |  |  | 47. Active Object Localization With Deep Reinforcement Learning | 
    
    |  |  | - Juan C. Caicedo, Svetlana Lazebnik | 
    
    |  |  | 
 | 
    
    |  |  | 48. Scene-Domain Active Part Models for Object Representation | 
    
    |  |  | - Zhou Ren, Chaohui Wang, Alan L. Yuille | 
    
    |  |  | 
 | 
    
    |  |  | 49. A Unified Multiplicative Framework for Attribute Learning | 
    
    |  |  | - Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen | 
    
    |  |  | 
 | 
    
    |  |  | 50. Contractive Rectifier Networks for Nonlinear Maximum Margin Classification | 
    
    |  |  | - Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel | 
    
    |  |  | 
 | 
    
    |  |  | 51. Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization | 
    
    |  |  | - Zhe Xu, Shaoli Huang, Ya Zhang, Dacheng Tao | 
    
    |  |  | 
 | 
    
    |  |  | 52. Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images | 
    
    |  |  | - Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille | 
    
    |  |  | 
 | 
    
    |  |  | 53. Learning Common Sense Through Visual Abstraction | 
    
    |  |  | - Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh | 
    
    |  |  | 
 | 
    
    |  |  | 54. Domain Generalization for Object Recognition With Multi-Task Autoencoders | 
    
    |  |  | - Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi | 
    
    |  |  | 
 | 
    
    |  |  | 55. Square Localization for Efficient and Accurate Object Detection | 
    
    |  |  | - Cewu Lu, Yongyi Lu, Hao Chen, Chi-Keung Tang | 
    
    |  |  | 
 | 
    
    |  |  | 56. Box Aggregation for Proposal Decimation: Last Mile of Object Detection | 
    
    |  |  | - Shu Liu, Cewu Lu, Jiaya Jia | 
    
    |  |  | 
 | 
    
    |  |  | 57. DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers | 
    
    |  |  | - Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc Van Gool | 
    
    |  |  | 
 | 
    
    |  |  | 58. Semantic Segmentation With Object Clique Potential | 
    
    |  |  | - Xiaojuan Qi, Jianping Shi, Shu Liu, Renjie Liao, Jiaya Jia | 
    
    |  |  | 
 | 
    
    |  |  | 59. Automatic Concept Discovery From Parallel Text and Visual Corpora | 
    
    |  |  | - Chen Sun, Chuang Gan, Ram Nevatia | 
    
    |  |  | 
 | 
    
    |  |  | 61. Monocular Object Instance Segmentation and Depth Ordering With CNNs | 
    
    |  |  | - Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun | 
    
    |  |  | 
 | 
    
    |  |  | 62. Multimodal Convolutional Neural Networks for Matching Image and Sentence | 
    
    |  |  | - Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li | 
    
    |  |  | 
 | 
    
    |  |  | 64. Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models | 
    
    |  |  | - Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik | 
    
    |  |  | 
 | 
    
    |  |  | 65. Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture | 
    
    |  |  | - David Eigen, Rob Fergus | 
    
    |  |  | 
 | 
    
    |  |  | 66. AttentionNet: Aggregating Weak Directions for Accurate Object Detection | 
    
    |  |  | - Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony S. Paek, In So Kweon | 
    
    |  |  | 
 | 
    
    |  |  | 67. Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images | 
    
    |  |  | - Yoshitaka Ushiku, Masataka Yamaguchi, Yusuke Mukuta, Tatsuya Harada | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Representations for Recognition & Localization | 
    
    |  |  | 
 | 
    
    |  |  | 1. 3D-Assisted Feature Synthesis for Novel Views of an Object, Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas | 
    
    |  |  | 
 | 
    
    |  |  | 2. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views, Hao Su, Charles R. Qi, Yangyan Li, Leonidas J. Guibas | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Statistical Methods & Learning, Motion & Tracking, and Video Analysis | 
    
    |  |  | 
 | 
    
    |  |  | 2. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving | 
    
    |  |  | - Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao | 
    
    |  |  | 
 | 
    
    |  |  | 3. Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks | 
    
    |  |  | - Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars | 
    
    |  |  | 
 | 
    
    |  |  | 4. HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition | 
    
    |  |  | - Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis DeCoste, Wei Di, Yizhou Yu | 
    
    |  |  | 
 | 
    
    |  |  | 5. Learning The Structure of Deep Convolutional Networks | 
    
    |  |  | - Jiashi Feng, Trevor Darrell | 
    
    |  |  | 
 | 
    
    |  |  | 6. FlowNet: Learning Optical Flow With Convolutional Networks | 
    
    |  |  | - Alexey Dosovitskiy, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox | 
    
    |  |  | 
 | 
    
    |  |  | 10. Unsupervised Learning of Visual Representations Using Videos | 
    
    |  |  | - Xiaolong Wang, Abhinav Gupta | 
    
    |  |  | 
 | 
    
    |  |  | 11. A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis | 
    
    |  |  | - Sotirios P. Chatzis, Dimitrios Kosmopoulos | 
    
    |  |  | 
 | 
    
    |  |  | 14. Robust Optimization for Deep Regression | 
    
    |  |  | - Vasileios Belagiannis, Christian Rupprecht, Gustavo Carneiro, Nassir Navab | 
    
    |  |  | 
 | 
    
    |  |  | 16. Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation | 
    
    |  |  | - Sijin Li, Weichen Zhang, Antoni B. Chan | 
    
    |  |  | 
 | 
    
    |  |  | 17. An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections | 
    
    |  |  | - Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shi-Fu Chang | 
    
    |  |  | 
 | 
    
    |  |  | 19. Understanding Deep Features With Computer-Generated Imagery | 
    
    |  |  | - Mathieu Aubry, Bryan C. Russell | 
    
    |  |  | 
 | 
    
    |  |  | 21. Context-Aware CNNs for Person Head Detection | 
    
    |  |  | - Tuan-Hung Vu, Anton Osokin, Ivan Laptev | 
    
    |  |  | 
 | 
    
    |  |  | 23. Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple | 
    
    |  |  | - Oren Freifeld, Søren Hauberg, Kayhan Batmanghelich, John W. Fisher III | 
    
    |  |  | 
 | 
    
    |  |  | 26. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization | 
    
    |  |  | - Alex Kendall, Matthew Grimes, Roberto Cipolla | 
    
    |  |  | 
 | 
    
    |  |  | 27. Predicting Multiple Structured Visual Interpretations | 
    
    |  |  | - Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell | 
    
    |  |  | 
 | 
    
    |  |  | 28. Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks | 
    
    |  |  | - Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang | 
    
    |  |  | 
 | 
    
    |  |  | 29. Matrix Backpropagation for Deep Networks With Structured Layers | 
    
    |  |  | - Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu | 
    
    |  |  | 
 | 
    
    |  |  | 31. Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition | 
    
    |  |  | - Heechul Jung, Sihaeng Lee, Junho Yim, Sunjeong Park, Junmo Kim | 
    
    |  |  | 
 | 
    
    |  |  | 32. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression | 
    
    |  |  | - Takuya Narihira, Michael Maire, Stella X. Yu | 
    
    |  |  | 
 | 
    
    |  |  | 33. Face Flow | 
    
    |  |  | - Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou | 
    
    |  |  | 
 | 
    
    |  |  | 42. Hierarchical Convolutional Features for Visual Tracking | 
    
    |  |  | - Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang | 
    
    |  |  | 
 | 
    
    |  |  | 44. Online Object Tracking With Proposal Selection | 
    
    |  |  | - Yang Hua, Karteek Alahari, Cordelia Schmid | 
    
    |  |  | 
 | 
    
    |  |  | 45. Understanding and Diagnosing Visual Tracking Systems | 
    
    |  |  | - Naiyan Wang, Jianping Shi, Dit-Yan Yeung, Jiaya Jia | 
    
    |  |  | 
 | 
    
    |  |  | 47. Visual Tracking With Fully Convolutional Networks | 
    
    |  |  | - Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu | 
    
    |  |  | 
 | 
    
    |  |  | 48. Multiple Feature Fusion via Weighted Entropy for Visual Tracking | 
    
    |  |  | - Lin Ma, Jiwen Lu, Jianjiang Feng, Jie Zhou | 
    
    |  |  | 
 | 
    
    |  |  | 49. Pedestrian Travel Time Estimation in Crowded Scenes | 
    
    |  |  | - Shuai Yi, Hongsheng Li, Xiaogang Wang | 
    
    |  |  | 
 | 
    
    |  |  | 52. Learning to Track for Spatio-Temporal Action Localization | 
    
    |  |  | - Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid | 
    
    |  |  | 
 | 
    
    |  |  | 53. Unsupervised Object Discovery and Tracking in Video Collections | 
    
    |  |  | - Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid | 
    
    |  |  | 
 | 
    
    |  |  | 54. Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models | 
    
    |  |  | - Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena | 
    
    |  |  | 
 | 
    
    |  |  | 58. P-CNN: Pose-Based CNN Features for Action Recognition | 
    
    |  |  | - Guilhem Chéron, Ivan Laptev, Cordelia Schmid | 
    
    |  |  | 
 | 
    
    |  |  | 59. Fully Connected Object Proposals for Video Segmentation | 
    
    |  |  | - Federico Perazzi, Oliver Wang, Markus Gross, Alexander Sorkine-Hornung | 
    
    |  |  | 
 | 
    
    |  |  | 60. Video Segmentation With Just a Few Strokes | 
    
    |  |  | - Naveen Shankar Nagaraja, Frank R. Schmidt, Thomas Brox | 
    
    |  |  | 
 | 
    
    |  |  | 61. Actionness-Assisted Recognition of Actions | 
    
    |  |  | - Ye Luo, Loong-Fah Cheong, An Tran | 
    
    |  |  | 
 | 
    
    |  |  | 66. RGB-W: When Vision Meets Wireless | 
    
    |  |  | - Alexandre Alahi, Albert Haque, Li Fei-Fei | 
    
    |  |  | 
 | 
    
    |  |  | 68. Simultaneous Foreground Detection and Classification With Hybrid Features | 
    
    |  |  | -Jaemyun Kim, Adín Ramírez Rivera, Byungyong Ryu, Oksam Chae | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Vision & People | 
    
    |  |  | 
 | 
    
    |  |  | 1. Training a Feedback Loop for Hand Pose Estimation | 
    
    |  |  | - Markus Oberweger, Paul Wohlhart, Vincent Lepetit | 
    
    |  |  | 
 | 
    
    |  |  | 2. Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose | 
    
    |  |  | - Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton | 
    
    |  |  | 
 | 
    
    |  |  | 4. Where to Buy It: Matching Street Clothing Photos in Online Shops | 
    
    |  |  | - M. Hadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg | 
    
    |  |  | 
 | 
    
    |  |  | 5. Multi-Task Recurrent Neural Network for Immediacy Prediction | 
    
    |  |  | - Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang | 
    
    |  |  | 
 | 
    
    |  |  | 6. Learning Complexity-Aware Cascades for Deep Pedestrian Detection | 
    
    |  |  | - Zhaowei Cai, Mohammad Saberian, Nuno Vasconcelos | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Computational Photography, Face & Gesture, and Vision for X | 
    
    |  |  | 
 | 
    
    |  |  | 4. TransCut: Transparent Object Segmentation From a Light-Field Image, Yichao Xu, Hajime Nagahara, Atsushi Shimada, Rin-ichiro Taniguchi | 
    
    |  |  | 
 | 
    
    |  |  | 7. Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition, Tinghui Zhou, Philipp Krähenbühl, Alexei A. Efros | 
    
    |  |  | 
 | 
    
    |  |  | 12. Intrinsic Depth: Improving Depth Transfer With Intrinsic Images | 
    
    |  |  | - Naejin Kong, Michael J. Black | 
    
    |  |  | 
 | 
    
    |  |  | 23. Selective Encoding for Recognizing Unreliably Localized Faces, Ang Li, Vlad Morariu, Larry S. Davis | 
    
    |  |  | 
 | 
    
    |  |  | 24. Confidence Preserving Machine for Facial Action Unit Detection, Jiabei Zeng, Wen-Sheng Chu, Fernando De la Torre, Jeffrey F. Cohn, Zhang Xiong | 
    
    |  |  | 
 | 
    
    |  |  | 25. Learning Social Relation Traits From Face Images | 
    
    |  |  | - Zhanpeng Zhang, Ping Luo, Chen-Change Loy, Xiaoou Tang | 
    
    |  |  | 
 | 
    
    |  |  | 26. Robust Heart Rate Measurement From Video Using Select Random Patches | 
    
    |  |  | - Antony Lam, Yoshinori Kuno | 
    
    |  |  | 
 | 
    
    |  |  | 28. Robust Facial Landmark Detection Under Significant Head Poses and Occlusion, Yue Wu, Qiang Ji | 
    
    |  |  | 
 | 
    
    |  |  | 29. Conditional Convolutional Neural Network for Modality-Aware Face Recognition | 
    
    |  |  | - Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim | 
    
    |  |  | 
 | 
    
    |  |  | 30. From Facial Parts Responses to Face Detection: A Deep Learning Approach | 
    
    |  |  | - Shuo Yang, Ping Luo, Chen-Change Loy, Xiaoou Tang | 
    
    |  |  | 
 | 
    
    |  |  | 32. Pose-Invariant 3D Face Alignment, Amin Jourabloo, Xiaoming Liu | 
    
    |  |  | 
 | 
    
    |  |  | 33. From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning | 
    
    |  |  | - Adrià Ruiz, Joost Van de Weijer, Xavier Binefa | 
    
    |  |  | 
 | 
    
    |  |  | 36. Deep Learning Face Attributes in the Wild | 
    
    |  |  | - Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang | 
    
    |  |  | 
 | 
    
    |  |  | 37. Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification, Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao | 
    
    |  |  | 
 | 
    
    |  |  | 38. Regressing a 3D Face Shape From a Single Image, Sergey Tulyakov, Nicu Sebe | 
    
    |  |  | 
 | 
    
    |  |  | 45. A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification, Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang | 
    
    |  |  | 
 | 
    
    |  |  | 48. Discriminative Pose-Free Descriptors for Face and Object Matching | 
    
    |  |  | - Soubhik Sanyal, Sivaram Prasad Mudunuri, Soma Biswas | 
    
    |  |  | 
 | 
    
    |  |  | 49. Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation | 
    
    |  |  | - Meina Kan, Shiguang Shan, Xilin Chen | 
    
    |  |  | 
 | 
    
    |  |  | 51. Person Recognition in Personal Photo Collections | 
    
    |  |  | - Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele | 
    
    |  |  | 
 | 
    
    |  |  | 56. Learning to Predict Saliency on Face Images | 
    
    |  |  | - Mai Xu, Yun Ren, Zulin Wang | 
    
    |  |  | 
 | 
    
    |  |  | 57. Group Membership Prediction, Ziming Zhang, Yuting Chen, Venkatesh Saligrama | 
    
    |  |  | 
 | 
    
    |  |  | 59. Robust RGB-D Odometry Using Point and Line Features, Yan Lu, Dezhen Song | 
    
    |  |  | 
 | 
    
    |  |  | 60. Learning a Discriminative Model for the Perception of Realism in Composite Images, Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros | 
    
    |  |  | 
 | 
    
    |  |  | 61. What Makes Tom Hanks Look Like Tom Hanks, Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman | 
    
    |  |  | 
 | 
    
    |  |  | 63. Personalized Age Progression With Aging Dictionary | 
    
    |  |  | - Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan | 
    
    |  |  | 
 | 
    
    |  |  | 64. FaceDirector: Continuous Control of Facial Performance in Video | 
    
    |  |  | - Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung | 
    
    |  |  | 
 | 
    
    |  |  | 65. Synthesizing Illumination Mosaics From Internet Photo-Collections | 
    
    |  |  | - Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm | 
    
    |  |  | 
 | 
    
    |  |  | 66. Hot or Not: Exploring Correlations Between Appearance and Temperature, Daniel Glasner, Pascal Fua, Todd Zickler, Lihi Zelnik-Manor | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Motion & Correspondence | 
    
    |  |  | 
 | 
    
    |  |  | 3. Dense Semantic Correspondence Where Every Pixel is a Classifier, Hilton Bristow, Jack Valmadre, Simon Lucey | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Statiscal Methods & Learning, Motion & Tracking, and Video Analysis II | 
    
    |  |  | 
 | 
    
    |  |  | 1. Differential Recurrent Neural Networks for Action Recognition | 
    
    |  |  | - Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi | 
    
    |  |  | 
 | 
    
    |  |  | 4. Simultaneous Deep Transfer Across Domains and Tasks | 
    
    |  |  | - Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko | 
    
    |  |  | 
 | 
    
    |  |  | 5. Low Dimensional Explicit Feature Maps, Ondřej Chum | 
    
    |  |  | 
 | 
    
    |  |  | 6. Unsupervised Learning of Spatiotemporally Coherent Metrics | 
    
    |  |  | - Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun | 
    
    |  |  | 
 | 
    
    |  |  | 7. Multi-Label Cross-Modal Retrieval | 
    
    |  |  | - Viresh Ranjan, Nikhil Rasiwasia, C. V. Jawahar | 
    
    |  |  | 
 | 
    
    |  |  | 10. Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data | 
    
    |  |  | - Tzu Ming Harry Hsu, Wei Yu Chen, Cheng-An Hou, Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang | 
    
    |  |  | 
 | 
    
    |  |  | 12. Geometry-Aware Deep Transform | 
    
    |  |  | - Jiaji Huang, Qiang Qiu, Robert Calderbank, Guillermo Sapiro | 
    
    |  |  | 
 | 
    
    |  |  | 15. Zero-Shot Learning via Semantic Similarity Embedding | 
    
    |  |  | - Ziming Zhang, Venkatesh Saligrama | 
    
    |  |  | 
 | 
    
    |  |  | 18. Multi-View Domain Generalization for Visual Recognition | 
    
    |  |  | - Li Niu, Wen Li, Dong Xu | 
    
    |  |  | 
 | 
    
    |  |  | 19. Infinite Feature Selection | 
    
    |  |  | - Giorgio Roffo, Simone Melzi, Marco Cristani | 
    
    |  |  | 
 | 
    
    |  |  | 20. Semi-Supervised Zero-Shot Classification With Label Representation Learning | 
    
    |  |  | - Xin Li, Yuhong Guo, Dale Schuurmans | 
    
    |  |  | 
 | 
    
    |  |  | 24. Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions | 
    
    |  |  | - Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, Ruslan salakhutdinov | 
    
    |  |  | 
 | 
    
    |  |  | 25. Structured Feature Selection | 
    
    |  |  | - Tian Gao, Ziheng Wang, Qiang Ji | 
    
    |  |  | 
 | 
    
    |  |  | 26. Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning | 
    
    |  |  | - Yan Huang, Wei Wang, Liang Wang | 
    
    |  |  | 
 | 
    
    |  |  | 27. Learning Image and User Features for Recommendation in Social Networks | 
    
    |  |  | - Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua | 
    
    |  |  | 
 | 
    
    |  |  | 28. Dual-Feature Warping-Based Motion Model Estimation | 
    
    |  |  | - Shiwei Li, Lu Yuan, Jian Sun, Long Quan | 
    
    |  |  | 
 | 
    
    |  |  | 29. An Adaptive Data Representation for Robust Point-Set Registration and Merging | 
    
    |  |  | - Dylan Campbell, Lars Petersson | 
    
    |  |  | 
 | 
    
    |  |  | 31. Learning Spatially Regularized Correlation Filters for Visual Tracking | 
    
    |  |  | - Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg | 
    
    |  |  | 
 | 
    
    |  |  | 32. SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging, Kensei Jo, Mohit Gupta, Shree K. Nayar | 
    
    |  |  | 
 | 
    
    |  |  | 35. Recurrent Network Models for Human Dynamics | 
    
    |  |  | - Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik | 
    
    |  |  | 
 | 
    
    |  |  | 36. Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment | 
    
    |  |  | - Huijun Di, Qingxuan Shi, Feng Lv, Ming Qin, Yao Lu | 
    
    |  |  | 
 | 
    
    |  |  | 39. Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters | 
    
    |  |  | - Arridhana Ciptadi, James M. Rehg | 
    
    |  |  | 
 | 
    
    |  |  | 40. A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images | 
    
    |  |  | - Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit | 
    
    |  |  | 
 | 
    
    |  |  | 41. Linearization to Nonlinear Learning for Visual Tracking | 
    
    |  |  | - Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli | 
    
    |  |  | 
 | 
    
    |  |  | 42. Self-Occlusions and Disocclusions in Causal Video Object Segmentation | 
    
    |  |  | - Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto | 
    
    |  |  | 
 | 
    
    |  |  | 43. Large Displacement 3D Scene Flow With Occlusion Reasoning | 
    
    |  |  | - Andrei Zanfir, Cristian Sminchisescu | 
    
    |  |  | 
 | 
    
    |  |  | 46. Category-Blind Human Action Recognition: A Practical Recognition System | 
    
    |  |  | - Wenbo Li, Longyin Wen, Mooi Choo Chuah, Siwei Lyu | 
    
    |  |  | 
 | 
    
    |  |  | 48. Weakly-Supervised Alignment of Video With Text | 
    
    |  |  | - Piotr Bojanowski, Rémi Lajugie, Edouard Grave, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid | 
    
    |  |  | 
 | 
    
    |  |  | 49. Learning Temporal Embeddings for Complex Video Analysis | 
    
    |  |  | - Vignesh Ramanathan, Kevin Tang, Greg Mori, Li Fei-Fei | 
    
    |  |  | 
 | 
    
    |  |  | 50. Unsupervised Semantic Parsing of Video Collections | 
    
    |  |  | - Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena | 
    
    |  |  | 
 | 
    
    |  |  | 51. Learning Spatiotemporal Features With 3D Convolutional Networks | 
    
    |  |  | - Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri | 
    
    |  |  | 
 | 
    
    |  |  | 52. Temporal Perception and Prediction in Ego-Centric Video | 
    
    |  |  | - Yipin Zhou, Tamara L. Berg | 
    
    |  |  | 
 | 
    
    |  |  | 53. Describing Videos by Exploiting Temporal Structure | 
    
    |  |  | - Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville | 
    
    |  |  | 
 | 
    
    |  |  | 55. Storyline Representation of Egocentric Videos With an Applications to Story-Based Search | 
    
    |  |  | - Bo Xiong, Gunhee Kim, Leonid Sigal | 
    
    |  |  | 
 | 
    
    |  |  | 56. Sequence to Sequence – Video to Text | 
    
    |  |  | - Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko | 
    
    |  |  | 
 | 
    
    |  |  | 58. Action Recognition by Hierarchical Mid-Level Action Elements | 
    
    |  |  | - Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese | 
    
    |  |  | 
 | 
    
    |  |  | 63. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks | 
    
    |  |  | - Lin Sun, Kui Jia, Dit-Yan Yeung, Bertram E. Shi | 
    
    |  |  | 
 | 
    
    |  |  | 66. Love Thy Neighbors: Image Annotation by Exploiting Image Metadata | 
    
    |  |  | - Justin Johnson, Lamberto Ballan, Li Fei-Fei | 
    
    |  |  | 
 | 
    
    |  |  | 67. Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders | 
    
    |  |  | - Huan Yang, Baoyuan Wang, Stephen Lin, David Wipf, Minyi Guo, Baining Guo | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | 
 | 
    
    |  |  | ## Video -- Actions, Surveillance & Tracking | 
    
    |  |  | 
 | 
    
    |  |  | 1. Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos | 
    
    |  |  | - Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulò, Narendra Ahuja, Oswald Lanz | 
    
    |  |  | 
 | 
    
    |  |  | 2. Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!, Bilge Soran, Ali Farhadi, Linda Shapiro | 
    
    |  |  | 
 | 
    
    |  |  | 3. Partial Person Re-Identification, Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jianhuang Lai, Shaogang Gong | 
    
    |  |  | 
 | 
    
    |  |  | 5. Multiple Hypothesis Tracking Revisited, Chanho Kim, Fuxin Li, Arridhana Ciptadi, James M. Rehg | 
    
    |  |  | 
 | 
    
    |  |  | 6. Learning to Track: Online Multi-Object Tracking by Decision Making, Yu Xiang, Alexandre Alahi, Silvio Savarese | 
    
    |  |  | 
 | 
    
    |  |  | 
 |