|
|
@@ -0,0 +1,781 @@ |
|
|
## Vision & Language |
|
|
|
|
|
- Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images |
|
|
- Mateusz Malinowski, Marcus Rohrbach, Mario Fritz |
|
|
|
|
|
- Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books |
|
|
- Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler |
|
|
|
|
|
- Learning Query and Image Similarities With Ranking Canonical Correlation Analysis |
|
|
- Wah Ngo |
|
|
|
|
|
## Recognition, Low-Level Vision, and Biomedical Image Analysis |
|
|
|
|
|
1. Learning to See by Moving |
|
|
- Pulkit Agrawal, Joao Carreira, Jitendra Malik |
|
|
- scene recognition, object recognition, visual odometry, keypoint matching -- representation (feature) learning |
|
|
|
|
|
2. Convolutional Channel Features |
|
|
- Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li |
|
|
- pedestrian detection, face detection, edge detection, object proposal generation -- representation learning |
|
|
|
|
|
3. Local Convolutional Features With Unsupervised Training for Image Retrieval |
|
|
- Mattis Paulin, Matthijs Douze, Zaid Harchaoui, Julien Mairal, Florent Perronin, Cordelia Schmid |
|
|
- patch descriptor learning, image retrieval |
|
|
|
|
|
4. Discriminative Learning of Deep Convolutional Feature Point Descriptors |
|
|
- Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer |
|
|
- patch-level feature learning |
|
|
|
|
|
5. SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [[Paper]](http://www.xunhuang.org/research) |
|
|
- Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao |
|
|
- saliency detection |
|
|
|
|
|
6. Deep Networks for Image Super-Resolution With Sparse Prior |
|
|
- Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang |
|
|
- SR |
|
|
|
|
|
7. Learning Ordinal Relationships for Mid-Level Vision |
|
|
- Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman |
|
|
- intrinsic image decomposition, depth from single image |
|
|
|
|
|
8. Deep Colorization |
|
|
- Zezhou Cheng, Qingxiong Yang, Bin Sheng |
|
|
- image colorization |
|
|
|
|
|
9. High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision |
|
|
- Gedas Bertasius, Jianbo Shi, Lorenzo Torresani |
|
|
- **boundary detection**, semantic boundary labeling, semantic segmentation |
|
|
|
|
|
10. Video Super-Resolution via Deep Draft-Ensemble Learning |
|
|
- Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia |
|
|
- SR |
|
|
|
|
|
11. Compression Artifacts Reduction by a Deep Convolutional Network |
|
|
- Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang |
|
|
- JPEG artifact reduction |
|
|
|
|
|
|
|
|
|
|
|
## Recognition and 3D Computer Vision |
|
|
|
|
|
1. Semantic Pose Using Deep Networks Trained on Synthetic RGB-D |
|
|
- Jeremie Papon, Markus Schoeler |
|
|
- indoor scene understanding from rgb-d |
|
|
|
|
|
2. Learning Informative Edge Maps for Indoor Scene Layout Prediction |
|
|
- Arun Mallya, Svetlana Lazebnik |
|
|
- edge map prediction, indoor scene layout prediction |
|
|
|
|
|
3. Multi-View Convolutional Neural Networks for 3D Shape Recognition |
|
|
- Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller |
|
|
- 3d shape classification and retrieval, 3d shape descriptor |
|
|
|
|
|
4. Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images |
|
|
- Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother |
|
|
- 6d pose estimation |
|
|
|
|
|
5. A Deep Visual Correspondence Embedding Model for Stereo Matching Costs [[KITTI-submission](http://www.cvlibs.net/datasets/kitti/eval_stereo_flow_detail.php?benchmark=stereo&error=2&eval=all&result=810169a667c1d8f712ce4c82969a5e9b8b4956c8)] |
|
|
- Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang |
|
|
|
|
|
6. Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation |
|
|
- Xin Lu, Zhe Lin, Xiaohui Shen, Radomír Měch, James Z. Wang |
|
|
- image style recognition, aesthetic quality categorization, image quality estimation |
|
|
|
|
|
7. Improving Image Classification With Location Context |
|
|
- Kevin Tang, Manohar Paluri, Li Fei-Fei, Rob Fergus, Lubomir Bourdev |
|
|
- image(scene) classification |
|
|
|
|
|
8. HICO: A Benchmark for Recognizing Human-Object Interactions in Images |
|
|
- Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, Jia Deng |
|
|
- benchmark paper |
|
|
|
|
|
9. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification |
|
|
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun |
|
|
- **ImageNet Classification** |
|
|
|
|
|
10. Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network |
|
|
- Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan |
|
|
- clothing detection/retrieval |
|
|
|
|
|
11. Contextual Action Recognition With R*CNN |
|
|
- Georgia Gkioxari, Ross Girshick, Jitendra Malik |
|
|
- action recognition |
|
|
|
|
|
46. What Makes an Object Memorable? |
|
|
- Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, Bernard Ghanem |
|
|
|
|
|
49. Scalable Person Re-Identification: A Benchmark |
|
|
- Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Qi Tian |
|
|
|
|
|
50. MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition |
|
|
- Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham |
|
|
|
|
|
51. Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model |
|
|
- Spyros Gidaris, Nikos Komodakis |
|
|
|
|
|
52. Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks |
|
|
- Marcel Simon, Erik Rodner |
|
|
|
|
|
53. Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection |
|
|
- David Novotny, Jiří Matas |
|
|
|
|
|
56. Task-Driven Feature Pooling for Image Classification |
|
|
- Guo-Sen Xie, Xu-Yao Zhang, Xiangbo Shu, Shuicheng Yan, Cheng-Lin Liu |
|
|
|
|
|
57. Cutting Edge: Soft Correspondences in Multimodal Scene Parsing |
|
|
- Sarah Taghavi Namin, Mohammad Najafi, Mathieu Salzmann, Lars Petersson |
|
|
|
|
|
58. One Shot Learning via Compositions of Meaningful Patches, Alex Wong, Alan L. Yuille |
|
|
|
|
|
59. FASText: Efficient Unconstrained Scene Text Detector |
|
|
- Michal Bušta, Lukáš Neumann, Jiří Matas |
|
|
|
|
|
60. Multi-Scale Recognition With DAG-CNNs |
|
|
- Songfan Yang, Deva Ramanan |
|
|
|
|
|
62. Im2Calories: Towards an Automated Mobile Vision Food Diary |
|
|
- Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy |
|
|
|
|
|
66. Aggregating Local Deep Features for Image Retrieval |
|
|
- Artem Babenko, Victor Lempitsky |
|
|
|
|
|
67. Learning Deep Object Detectors From 3D Models |
|
|
- Xingchao Peng, Baochen Sun, Karim Ali, Kate Saenko |
|
|
|
|
|
68. Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification |
|
|
- Ruobing Wu, Baoyuan Wang, Wenping Wang, Yizhou Yu |
|
|
|
|
|
69. Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval |
|
|
- Gaurav Sharma, Bernt Schiele |
|
|
|
|
|
71. Unsupervised Generation of a Viewpoint Annotated Car Dataset From Videos |
|
|
- Nima Sedaghat, Thomas Brox |
|
|
|
|
|
|
|
|
|
|
|
## 3D Vision (Oral) |
|
|
|
|
|
1. Structured Indoor Modeling |
|
|
- Satoshi Ikehata, Hang Yang, Yasutaka Furukawa |
|
|
|
|
|
2. 3D Time-Lapse Reconstruction From Internet Photos |
|
|
- Ricardo Martin-Brualla, David Gallup, Steven M. Seitz |
|
|
|
|
|
3. Global, Dense Multiscale Reconstruction for a Billion Points |
|
|
- Benjamin Ummenhofer, Thomas Brox |
|
|
|
|
|
4. On the Visibility of Point Clouds |
|
|
- Sagi Katz, Ayellet Tal |
|
|
|
|
|
|
|
|
## Segmentation, Edges and Saliency |
|
|
|
|
|
2. Piecewise Flat Embedding for Image Segmentation |
|
|
- Yizhou Yu, Chaowei Fang, Zicheng Liao |
|
|
|
|
|
3. Semantic Image Segmentation via Deep Parsing Network |
|
|
- Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, Xiaoou Tang |
|
|
|
|
|
4. Human Parsing With Contextualized Convolutional Neural Network |
|
|
- Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan |
|
|
|
|
|
5. Holistically-Nested Edge Detection |
|
|
- Saining Xie, Zhuowen Tu |
|
|
|
|
|
6. Minimum Barrier Salient Object Detection at 80 FPS |
|
|
- Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian Price, Radomír Měch |
|
|
|
|
|
|
|
|
|
|
|
## Learning Representations & Attributes |
|
|
|
|
|
1. Learning Image Representations Tied to Ego-Motion |
|
|
- Dinesh Jayaraman, Kristen Grauman |
|
|
|
|
|
2. Unsupervised Visual Representation Learning by Context Prediction |
|
|
- Carl Doersch, Abhinav Gupta, Alexei A. Efros |
|
|
|
|
|
3. Webly Supervised Learning of Convolutional Networks |
|
|
- Xinlei Chen, Abhinav Gupta |
|
|
|
|
|
4. Fast R-CNN, Ross Girshick |
|
|
|
|
|
5. Bilinear CNN Models for Fine-Grained Visual Recognition |
|
|
- Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji |
|
|
|
|
|
6. Discovering the Spatial Extent of Relative Attributes |
|
|
- Fanyi Xiao, Yong Jae Lee |
|
|
|
|
|
|
|
|
## Statistical Methods & Learning |
|
|
|
|
|
1. Deep Neural Decision Forests |
|
|
- Peter Kontschieder, Madalina Fiterau, Antonio Criminisi, Samuel Rota Bulò |
|
|
|
|
|
2. Deep Fried Convnets |
|
|
- Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang |
|
|
|
|
|
3. Semantic Component Analysis |
|
|
- Calvin Murdock, Fernando De la Torre |
|
|
|
|
|
6. Learning Discriminative Reconstructions for Unsupervised Outlier Removal |
|
|
-Yan Xia, Xudong Cao, Fang Wen, Gang Hua, Jian Sun |
|
|
|
|
|
|
|
|
|
|
|
## Optimization, Segmentation, and Recognition |
|
|
|
|
|
1. Learning Deconvolution Network for Semantic Segmentation |
|
|
- Hyeonwoo Noh, Seunghoon Hong, Bohyung Han |
|
|
|
|
|
2. Conditional Random Fields as Recurrent Neural Networks |
|
|
- Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr |
|
|
|
|
|
4. Boosting Object Proposals: From Pascal to COCO |
|
|
- Jordi Pont-Tuset, Luc Van Gool |
|
|
|
|
|
7. Joint Object and Part Segmentation Using Deep Learned Potentials |
|
|
- Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille |
|
|
|
|
|
9. BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies |
|
|
- Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen |
|
|
|
|
|
11. Contour Guided Hierarchical Model for Shape Matching |
|
|
- Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng |
|
|
|
|
|
12. Robust Image Segmentation Using Contour-Guided Color Palettes |
|
|
- Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C.-C. Jay Kuo |
|
|
|
|
|
14. BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation |
|
|
- Jifeng Dai, Kaiming He, Jian Sun |
|
|
|
|
|
15. Detection and Segmentation of 2D Curved Reflection Symmetric Structures |
|
|
- Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos |
|
|
|
|
|
16. Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories |
|
|
- Mihai Marian Puscas, Enver Sangineto, Dubravko Culibrk, Nicu Sebe |
|
|
|
|
|
17. Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes |
|
|
- Mete Ozay, Umit Rusen Aktas, Jeremy L. Wyatt, Aleš Leonardis |
|
|
|
|
|
19. Learning to Combine Mid-Level Cues for Object Proposal Generation |
|
|
- Tom Lee, Sanja Fidler, Sven Dickinson |
|
|
|
|
|
20. Enhancing Road Maps by Parsing Aerial Images Around the World |
|
|
- Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun |
|
|
|
|
|
24. StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images |
|
|
- Ran Ju, Tongwei Ren, Gangshan Wu |
|
|
|
|
|
25. Semantic Segmentation of RGBD Images With Mutex Constraints |
|
|
- Zhuo Deng, Sinisa Todorovic, Longin Jan Latecki |
|
|
|
|
|
26. Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation |
|
|
- George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille |
|
|
|
|
|
28. Parsimonious Labeling |
|
|
- Puneet K. Dokania, M. Pawan Kumar |
|
|
|
|
|
32. Constrained Convolutional Neural Networks for Weakly Supervised Segmentation |
|
|
- Deepak Pathak, Philipp Krähenbühl, Trevor Darrell |
|
|
|
|
|
35. Convolutional Sparse Coding for Image Super-Resolution |
|
|
- Shuhang Gu, Wangmeng Zuo, Qi Xie, Deyu Meng, Xiangchu Feng, Lei Zhang |
|
|
|
|
|
40. Depth-Based Hand Pose Estimation: Data, Methods, and Challenges |
|
|
- James S. Supančič III, Grégory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan |
|
|
|
|
|
43. Learning Deep Representation With Large-Scale Attributes |
|
|
- Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang |
|
|
|
|
|
44. Deep Learning Strong Parts for Pedestrian Detection |
|
|
- Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang |
|
|
|
|
|
45. Flowing ConvNets for Human Pose Estimation in Videos |
|
|
- Tomas Pfister, James Charles, Andrew Zisserman |
|
|
|
|
|
47. BubbLeNet: Foveated Imaging for Visual Discovery |
|
|
- Kevin Matzen, Noah Snavely |
|
|
|
|
|
49. Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions |
|
|
- Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu |
|
|
|
|
|
53. Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging |
|
|
- Jianlong Fu, Yue Wu, Tao Mei, Jinqiao Wang, Hanqing Lu, Yong Rui |
|
|
|
|
|
54. Visual Phrases for Exemplar Face Detection |
|
|
- Vijay Kumar, Anoop Namboodiri, C. V. Jawahar |
|
|
|
|
|
55. Spatial Semantic Regularisation for Large Scale Object Detection |
|
|
- Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell |
|
|
|
|
|
56. Human Pose Estimation in Videos |
|
|
- Dong Zhang, Mubarak Shah |
|
|
|
|
|
57. Contour Box: Rejecting Object Proposals Without Explicit Closed Contours |
|
|
- Cewu Lu, Shu Liu, Jiaya Jia, Chi-Keung Tang |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
## Recognition and 3D CV |
|
|
|
|
|
2. Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo, Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan |
|
|
|
|
|
4. Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition, Torsten Sattler, Michal Havlena, Filip Radenović, Konrad Schindler, Marc Pollefeys |
|
|
|
|
|
5. Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences, Mark Brown, David Windridge, Jean-Yves Guillemaut |
|
|
|
|
|
10. Semantically-Aware Aerial Reconstruction From Multi-Modal Data |
|
|
- Randi Cabezas, Julian Straub, John W. Fisher III |
|
|
|
|
|
15. Exploiting Object Similarity in 3D Reconstruction |
|
|
- Chen Zhou, Fatma Güney, Yizhou Wang, Andreas Geiger |
|
|
|
|
|
16. You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans |
|
|
- Hang Chu, Dong Ki Kim, Tsuhan Chen |
|
|
|
|
|
24. The Likelihood-Ratio Test and Efficient Robust Estimation |
|
|
- Andrea Cohen, Christopher Zach |
|
|
|
|
|
35. Real-Time Pose Estimation Piggybacked on Object Detection |
|
|
- Roman Juránek, Adam Herout, Markéta Dubská, Pavel Zemčík |
|
|
|
|
|
36. Understanding and Predicting Image Memorability at a Large Scale |
|
|
- Aditya Khosla, Akhil S. Raju, Antonio Torralba, Aude Oliva |
|
|
|
|
|
37. Multiple Granularity Descriptors for Fine-Grained Categorization |
|
|
- Dequan Wang, Zhiqiang Shen, Jie Shao, Wei Zhang, Xiangyang Xue, Zheng Zhang |
|
|
|
|
|
38. Guiding the Long-Short Term Memory Model for Image Caption Generation |
|
|
- Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars |
|
|
|
|
|
39. Just Noticeable Differences in Visual Attributes |
|
|
- Aron Yu, Kristen Grauman |
|
|
|
|
|
40. VQA: Visual Question Answering |
|
|
- Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh |
|
|
|
|
|
41. Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach |
|
|
- Guoyu Lu, Yan Yan, Li Ren, Jingkuan Song, Nicu Sebe, Chandra Kambhamettu |
|
|
|
|
|
42. Dense Optical Flow Prediction From a Static Image |
|
|
- Jacob Walker, Abhinav Gupta, Martial Hebert |
|
|
|
|
|
44. Visual Madlibs: Fill in the Blank Description Generation and Question Answering |
|
|
- Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg |
|
|
|
|
|
45. Actions and Attributes From Wholes and Parts |
|
|
- Georgia Gkioxari, Ross Girshick, Jitendra Malik |
|
|
|
|
|
46. DeepBox: Learning Objectness With Convolutional Networks |
|
|
- Weicheng Kuo, Bharath Hariharan, Jitendra Malik |
|
|
|
|
|
47. Active Object Localization With Deep Reinforcement Learning |
|
|
- Juan C. Caicedo, Svetlana Lazebnik |
|
|
|
|
|
48. Scene-Domain Active Part Models for Object Representation |
|
|
- Zhou Ren, Chaohui Wang, Alan L. Yuille |
|
|
|
|
|
49. A Unified Multiplicative Framework for Attribute Learning |
|
|
- Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen |
|
|
|
|
|
50. Contractive Rectifier Networks for Nonlinear Maximum Margin Classification |
|
|
- Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel |
|
|
|
|
|
51. Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization |
|
|
- Zhe Xu, Shaoli Huang, Ya Zhang, Dacheng Tao |
|
|
|
|
|
52. Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images |
|
|
- Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille |
|
|
|
|
|
53. Learning Common Sense Through Visual Abstraction |
|
|
- Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh |
|
|
|
|
|
54. Domain Generalization for Object Recognition With Multi-Task Autoencoders |
|
|
- Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi |
|
|
|
|
|
55. Square Localization for Efficient and Accurate Object Detection |
|
|
- Cewu Lu, Yongyi Lu, Hao Chen, Chi-Keung Tang |
|
|
|
|
|
56. Box Aggregation for Proposal Decimation: Last Mile of Object Detection |
|
|
- Shu Liu, Cewu Lu, Jiaya Jia |
|
|
|
|
|
57. DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers |
|
|
- Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc Van Gool |
|
|
|
|
|
58. Semantic Segmentation With Object Clique Potential |
|
|
- Xiaojuan Qi, Jianping Shi, Shu Liu, Renjie Liao, Jiaya Jia |
|
|
|
|
|
59. Automatic Concept Discovery From Parallel Text and Visual Corpora |
|
|
- Chen Sun, Chuang Gan, Ram Nevatia |
|
|
|
|
|
61. Monocular Object Instance Segmentation and Depth Ordering With CNNs |
|
|
- Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun |
|
|
|
|
|
62. Multimodal Convolutional Neural Networks for Matching Image and Sentence |
|
|
- Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li |
|
|
|
|
|
64. Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models |
|
|
- Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik |
|
|
|
|
|
65. Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture |
|
|
- David Eigen, Rob Fergus |
|
|
|
|
|
66. AttentionNet: Aggregating Weak Directions for Accurate Object Detection |
|
|
- Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony S. Paek, In So Kweon |
|
|
|
|
|
67. Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images |
|
|
- Yoshitaka Ushiku, Masataka Yamaguchi, Yusuke Mukuta, Tatsuya Harada |
|
|
|
|
|
|
|
|
|
|
|
## Representations for Recognition & Localization |
|
|
|
|
|
1. 3D-Assisted Feature Synthesis for Novel Views of an Object, Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas |
|
|
|
|
|
2. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views, Hao Su, Charles R. Qi, Yangyan Li, Leonidas J. Guibas |
|
|
|
|
|
|
|
|
|
|
|
## Statistical Methods & Learning, Motion & Tracking, and Video Analysis |
|
|
|
|
|
2. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving |
|
|
- Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao |
|
|
|
|
|
3. Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks |
|
|
- Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars |
|
|
|
|
|
4. HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition |
|
|
- Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis DeCoste, Wei Di, Yizhou Yu |
|
|
|
|
|
5. Learning The Structure of Deep Convolutional Networks |
|
|
- Jiashi Feng, Trevor Darrell |
|
|
|
|
|
6. FlowNet: Learning Optical Flow With Convolutional Networks |
|
|
- Alexey Dosovitskiy, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox |
|
|
|
|
|
10. Unsupervised Learning of Visual Representations Using Videos |
|
|
- Xiaolong Wang, Abhinav Gupta |
|
|
|
|
|
11. A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis |
|
|
- Sotirios P. Chatzis, Dimitrios Kosmopoulos |
|
|
|
|
|
14. Robust Optimization for Deep Regression |
|
|
- Vasileios Belagiannis, Christian Rupprecht, Gustavo Carneiro, Nassir Navab |
|
|
|
|
|
16. Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation |
|
|
- Sijin Li, Weichen Zhang, Antoni B. Chan |
|
|
|
|
|
17. An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections |
|
|
- Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shi-Fu Chang |
|
|
|
|
|
19. Understanding Deep Features With Computer-Generated Imagery |
|
|
- Mathieu Aubry, Bryan C. Russell |
|
|
|
|
|
21. Context-Aware CNNs for Person Head Detection |
|
|
- Tuan-Hung Vu, Anton Osokin, Ivan Laptev |
|
|
|
|
|
23. Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple |
|
|
- Oren Freifeld, Søren Hauberg, Kayhan Batmanghelich, John W. Fisher III |
|
|
|
|
|
26. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization |
|
|
- Alex Kendall, Matthew Grimes, Roberto Cipolla |
|
|
|
|
|
27. Predicting Multiple Structured Visual Interpretations |
|
|
- Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell |
|
|
|
|
|
28. Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks |
|
|
- Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang |
|
|
|
|
|
29. Matrix Backpropagation for Deep Networks With Structured Layers |
|
|
- Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu |
|
|
|
|
|
31. Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition |
|
|
- Heechul Jung, Sihaeng Lee, Junho Yim, Sunjeong Park, Junmo Kim |
|
|
|
|
|
32. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression |
|
|
- Takuya Narihira, Michael Maire, Stella X. Yu |
|
|
|
|
|
33. Face Flow |
|
|
- Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou |
|
|
|
|
|
42. Hierarchical Convolutional Features for Visual Tracking |
|
|
- Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang |
|
|
|
|
|
44. Online Object Tracking With Proposal Selection |
|
|
- Yang Hua, Karteek Alahari, Cordelia Schmid |
|
|
|
|
|
45. Understanding and Diagnosing Visual Tracking Systems |
|
|
- Naiyan Wang, Jianping Shi, Dit-Yan Yeung, Jiaya Jia |
|
|
|
|
|
47. Visual Tracking With Fully Convolutional Networks |
|
|
- Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu |
|
|
|
|
|
48. Multiple Feature Fusion via Weighted Entropy for Visual Tracking |
|
|
- Lin Ma, Jiwen Lu, Jianjiang Feng, Jie Zhou |
|
|
|
|
|
49. Pedestrian Travel Time Estimation in Crowded Scenes |
|
|
- Shuai Yi, Hongsheng Li, Xiaogang Wang |
|
|
|
|
|
52. Learning to Track for Spatio-Temporal Action Localization |
|
|
- Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid |
|
|
|
|
|
53. Unsupervised Object Discovery and Tracking in Video Collections |
|
|
- Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid |
|
|
|
|
|
54. Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models |
|
|
- Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena |
|
|
|
|
|
58. P-CNN: Pose-Based CNN Features for Action Recognition |
|
|
- Guilhem Chéron, Ivan Laptev, Cordelia Schmid |
|
|
|
|
|
59. Fully Connected Object Proposals for Video Segmentation |
|
|
- Federico Perazzi, Oliver Wang, Markus Gross, Alexander Sorkine-Hornung |
|
|
|
|
|
60. Video Segmentation With Just a Few Strokes |
|
|
- Naveen Shankar Nagaraja, Frank R. Schmidt, Thomas Brox |
|
|
|
|
|
61. Actionness-Assisted Recognition of Actions |
|
|
- Ye Luo, Loong-Fah Cheong, An Tran |
|
|
|
|
|
66. RGB-W: When Vision Meets Wireless |
|
|
- Alexandre Alahi, Albert Haque, Li Fei-Fei |
|
|
|
|
|
68. Simultaneous Foreground Detection and Classification With Hybrid Features |
|
|
-Jaemyun Kim, Adín Ramírez Rivera, Byungyong Ryu, Oksam Chae |
|
|
|
|
|
|
|
|
|
|
|
## Vision & People |
|
|
|
|
|
1. Training a Feedback Loop for Hand Pose Estimation |
|
|
- Markus Oberweger, Paul Wohlhart, Vincent Lepetit |
|
|
|
|
|
2. Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose |
|
|
- Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton |
|
|
|
|
|
4. Where to Buy It: Matching Street Clothing Photos in Online Shops |
|
|
- M. Hadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg |
|
|
|
|
|
5. Multi-Task Recurrent Neural Network for Immediacy Prediction |
|
|
- Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang |
|
|
|
|
|
6. Learning Complexity-Aware Cascades for Deep Pedestrian Detection |
|
|
- Zhaowei Cai, Mohammad Saberian, Nuno Vasconcelos |
|
|
|
|
|
|
|
|
## Computational Photography, Face & Gesture, and Vision for X |
|
|
|
|
|
4. TransCut: Transparent Object Segmentation From a Light-Field Image, Yichao Xu, Hajime Nagahara, Atsushi Shimada, Rin-ichiro Taniguchi |
|
|
|
|
|
7. Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition, Tinghui Zhou, Philipp Krähenbühl, Alexei A. Efros |
|
|
|
|
|
12. Intrinsic Depth: Improving Depth Transfer With Intrinsic Images |
|
|
- Naejin Kong, Michael J. Black |
|
|
|
|
|
23. Selective Encoding for Recognizing Unreliably Localized Faces, Ang Li, Vlad Morariu, Larry S. Davis |
|
|
|
|
|
24. Confidence Preserving Machine for Facial Action Unit Detection, Jiabei Zeng, Wen-Sheng Chu, Fernando De la Torre, Jeffrey F. Cohn, Zhang Xiong |
|
|
|
|
|
25. Learning Social Relation Traits From Face Images |
|
|
- Zhanpeng Zhang, Ping Luo, Chen-Change Loy, Xiaoou Tang |
|
|
|
|
|
26. Robust Heart Rate Measurement From Video Using Select Random Patches |
|
|
- Antony Lam, Yoshinori Kuno |
|
|
|
|
|
28. Robust Facial Landmark Detection Under Significant Head Poses and Occlusion, Yue Wu, Qiang Ji |
|
|
|
|
|
29. Conditional Convolutional Neural Network for Modality-Aware Face Recognition |
|
|
- Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim |
|
|
|
|
|
30. From Facial Parts Responses to Face Detection: A Deep Learning Approach |
|
|
- Shuo Yang, Ping Luo, Chen-Change Loy, Xiaoou Tang |
|
|
|
|
|
32. Pose-Invariant 3D Face Alignment, Amin Jourabloo, Xiaoming Liu |
|
|
|
|
|
33. From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning |
|
|
- Adrià Ruiz, Joost Van de Weijer, Xavier Binefa |
|
|
|
|
|
36. Deep Learning Face Attributes in the Wild |
|
|
- Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang |
|
|
|
|
|
37. Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification, Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao |
|
|
|
|
|
38. Regressing a 3D Face Shape From a Single Image, Sergey Tulyakov, Nicu Sebe |
|
|
|
|
|
45. A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification, Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang |
|
|
|
|
|
48. Discriminative Pose-Free Descriptors for Face and Object Matching |
|
|
- Soubhik Sanyal, Sivaram Prasad Mudunuri, Soma Biswas |
|
|
|
|
|
49. Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation |
|
|
- Meina Kan, Shiguang Shan, Xilin Chen |
|
|
|
|
|
51. Person Recognition in Personal Photo Collections |
|
|
- Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele |
|
|
|
|
|
56. Learning to Predict Saliency on Face Images |
|
|
- Mai Xu, Yun Ren, Zulin Wang |
|
|
|
|
|
57. Group Membership Prediction, Ziming Zhang, Yuting Chen, Venkatesh Saligrama |
|
|
|
|
|
59. Robust RGB-D Odometry Using Point and Line Features, Yan Lu, Dezhen Song |
|
|
|
|
|
60. Learning a Discriminative Model for the Perception of Realism in Composite Images, Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros |
|
|
|
|
|
61. What Makes Tom Hanks Look Like Tom Hanks, Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman |
|
|
|
|
|
63. Personalized Age Progression With Aging Dictionary |
|
|
- Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan |
|
|
|
|
|
64. FaceDirector: Continuous Control of Facial Performance in Video |
|
|
- Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung |
|
|
|
|
|
65. Synthesizing Illumination Mosaics From Internet Photo-Collections |
|
|
- Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm |
|
|
|
|
|
66. Hot or Not: Exploring Correlations Between Appearance and Temperature, Daniel Glasner, Pascal Fua, Todd Zickler, Lihi Zelnik-Manor |
|
|
|
|
|
|
|
|
|
|
|
## Motion & Correspondence |
|
|
|
|
|
3. Dense Semantic Correspondence Where Every Pixel is a Classifier, Hilton Bristow, Jack Valmadre, Simon Lucey |
|
|
|
|
|
|
|
|
|
|
|
## Statiscal Methods & Learning, Motion & Tracking, and Video Analysis II |
|
|
|
|
|
1. Differential Recurrent Neural Networks for Action Recognition |
|
|
- Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi |
|
|
|
|
|
4. Simultaneous Deep Transfer Across Domains and Tasks |
|
|
- Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko |
|
|
|
|
|
5. Low Dimensional Explicit Feature Maps, Ondřej Chum |
|
|
|
|
|
6. Unsupervised Learning of Spatiotemporally Coherent Metrics |
|
|
- Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun |
|
|
|
|
|
7. Multi-Label Cross-Modal Retrieval |
|
|
- Viresh Ranjan, Nikhil Rasiwasia, C. V. Jawahar |
|
|
|
|
|
10. Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data |
|
|
- Tzu Ming Harry Hsu, Wei Yu Chen, Cheng-An Hou, Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang |
|
|
|
|
|
12. Geometry-Aware Deep Transform |
|
|
- Jiaji Huang, Qiang Qiu, Robert Calderbank, Guillermo Sapiro |
|
|
|
|
|
15. Zero-Shot Learning via Semantic Similarity Embedding |
|
|
- Ziming Zhang, Venkatesh Saligrama |
|
|
|
|
|
18. Multi-View Domain Generalization for Visual Recognition |
|
|
- Li Niu, Wen Li, Dong Xu |
|
|
|
|
|
19. Infinite Feature Selection |
|
|
- Giorgio Roffo, Simone Melzi, Marco Cristani |
|
|
|
|
|
20. Semi-Supervised Zero-Shot Classification With Label Representation Learning |
|
|
- Xin Li, Yuhong Guo, Dale Schuurmans |
|
|
|
|
|
24. Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions |
|
|
- Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, Ruslan salakhutdinov |
|
|
|
|
|
25. Structured Feature Selection |
|
|
- Tian Gao, Ziheng Wang, Qiang Ji |
|
|
|
|
|
26. Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning |
|
|
- Yan Huang, Wei Wang, Liang Wang |
|
|
|
|
|
27. Learning Image and User Features for Recommendation in Social Networks |
|
|
- Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua |
|
|
|
|
|
28. Dual-Feature Warping-Based Motion Model Estimation |
|
|
- Shiwei Li, Lu Yuan, Jian Sun, Long Quan |
|
|
|
|
|
29. An Adaptive Data Representation for Robust Point-Set Registration and Merging |
|
|
- Dylan Campbell, Lars Petersson |
|
|
|
|
|
31. Learning Spatially Regularized Correlation Filters for Visual Tracking |
|
|
- Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg |
|
|
|
|
|
32. SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging, Kensei Jo, Mohit Gupta, Shree K. Nayar |
|
|
|
|
|
35. Recurrent Network Models for Human Dynamics |
|
|
- Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik |
|
|
|
|
|
36. Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment |
|
|
- Huijun Di, Qingxuan Shi, Feng Lv, Ming Qin, Yao Lu |
|
|
|
|
|
39. Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters |
|
|
- Arridhana Ciptadi, James M. Rehg |
|
|
|
|
|
40. A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images |
|
|
- Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit |
|
|
|
|
|
41. Linearization to Nonlinear Learning for Visual Tracking |
|
|
- Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli |
|
|
|
|
|
42. Self-Occlusions and Disocclusions in Causal Video Object Segmentation |
|
|
- Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto |
|
|
|
|
|
43. Large Displacement 3D Scene Flow With Occlusion Reasoning |
|
|
- Andrei Zanfir, Cristian Sminchisescu |
|
|
|
|
|
46. Category-Blind Human Action Recognition: A Practical Recognition System |
|
|
- Wenbo Li, Longyin Wen, Mooi Choo Chuah, Siwei Lyu |
|
|
|
|
|
48. Weakly-Supervised Alignment of Video With Text |
|
|
- Piotr Bojanowski, Rémi Lajugie, Edouard Grave, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid |
|
|
|
|
|
49. Learning Temporal Embeddings for Complex Video Analysis |
|
|
- Vignesh Ramanathan, Kevin Tang, Greg Mori, Li Fei-Fei |
|
|
|
|
|
50. Unsupervised Semantic Parsing of Video Collections |
|
|
- Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena |
|
|
|
|
|
51. Learning Spatiotemporal Features With 3D Convolutional Networks |
|
|
- Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri |
|
|
|
|
|
52. Temporal Perception and Prediction in Ego-Centric Video |
|
|
- Yipin Zhou, Tamara L. Berg |
|
|
|
|
|
53. Describing Videos by Exploiting Temporal Structure |
|
|
- Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville |
|
|
|
|
|
55. Storyline Representation of Egocentric Videos With an Applications to Story-Based Search |
|
|
- Bo Xiong, Gunhee Kim, Leonid Sigal |
|
|
|
|
|
56. Sequence to Sequence – Video to Text |
|
|
- Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko |
|
|
|
|
|
58. Action Recognition by Hierarchical Mid-Level Action Elements |
|
|
- Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese |
|
|
|
|
|
63. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks |
|
|
- Lin Sun, Kui Jia, Dit-Yan Yeung, Bertram E. Shi |
|
|
|
|
|
66. Love Thy Neighbors: Image Annotation by Exploiting Image Metadata |
|
|
- Justin Johnson, Lamberto Ballan, Li Fei-Fei |
|
|
|
|
|
67. Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders |
|
|
- Huan Yang, Baoyuan Wang, Stephen Lin, David Wipf, Minyi Guo, Baining Guo |
|
|
|
|
|
|
|
|
|
|
|
## Video -- Actions, Surveillance & Tracking |
|
|
|
|
|
1. Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos |
|
|
- Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulò, Narendra Ahuja, Oswald Lanz |
|
|
|
|
|
2. Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!, Bilge Soran, Ali Farhadi, Linda Shapiro |
|
|
|
|
|
3. Partial Person Re-Identification, Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jianhuang Lai, Shaogang Gong |
|
|
|
|
|
5. Multiple Hypothesis Tracking Revisited, Chanho Kim, Fuxin Li, Arridhana Ciptadi, James M. Rehg |
|
|
|
|
|
6. Learning to Track: Online Multi-Object Tracking by Decision Making, Yu Xiang, Alexandre Alahi, Silvio Savarese |
|
|
|
|
|
|