Skip to content

Weichern/Paper-Collection

 
 

Repository files navigation

Object Detection in Image:

✅ RCNN [Paper] [Note] [Code]

  • Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik, Rich feature hierarchies for accurate object detection and semantic segmentation

✅ Spatial pyramid pooling in deep convolutional networks for visual recognition [[Paper]] (http://arxiv.org/abs/1406.4729) [Note] [Code]

  • He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2015, 37(9): 1904-1916.

✅ Fast R-CNN [[Paper]] (http://arxiv.org/pdf/1504.08083) [Note] [Code]

  • Ross Girshick, Fast R-CNN, arXiv:1504.08083.

✅ Faster R-CNN, Microsoft Research [[Paper]] (http://arxiv.org/pdf/1506.01497) [Note] [Code] [Python Code]

  • Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, arXiv:1506.01497.

✅ End-to-end people detection in crowded scenes [[Paper]] (http://arxiv.org/abs/1506.04878) [Note] [Code]

  • Russell Stewart, Mykhaylo Andriluka, End-to-end people detection in crowded scenes, arXiv:1506.04878.

✅ You Only Look Once: Unified, Real-Time Object Detection [[Paper]] (http://arxiv.org/abs/1506.02640) [Note] [Code]

  • Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once: Unified, Real-Time Object Detection, arXiv:1506.02640

✅ Adaptive Object Detection Using Adjacency and Zoom Prediction [[Paper]] (http://arxiv.org/abs/1512.07711) [Note]

  • Lu Y, Javidi T, Lazebnik S. Adaptive Object Detection Using Adjacency and Zoom Prediction[J]. arXiv:1512.07711, 2015.

✅ Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [Paper] [Note]

  • Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick. arXiv:1512.04143, 2015.

✅ G-CNN: an Iterative Grid Based Object Detector [Paper]

  • Mahyar Najibi, Mohammad Rastegari, Larry S. Davis. arXiv:1512.07729, 2015.

  • SSD [Paper]

    • Liu W, Anguelov D, Erhan D, et al. SSD: Single Shot MultiBox Detector[J]. arXiv preprint arXiv:1512.02325, 2015.
  • Deep Residual Learning for Image Recognition [Paper]

    • Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition. arXiv:1512.03385, 2015
  • Diagnosing error in object detectors [Paper]

    • Hoiem D, Chodpathumwan Y, Dai Q. Diagnosing error in object detectors[M]//Computer Vision–ECCV 2012. Springer Berlin Heidelberg, 2012: 340-353.

Object Detection in Video:

✅ Seq-NMS for Video Object Detection [Paper] [Note]

  • Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang. Seq-NMS for Video Object Detection. arXiv preprint arXiv:1602.08465, 2016

Image Caption:

✅ Exploring Nearest Neighbor Approaches for Image Captioning [Paper]

  • Devlin J, Gupta S, Girshick R, et al. Exploring Nearest Neighbor Approaches for Image Captioning[J]. arXiv preprint arXiv:1505.04467, 2015.

Theories

  • Efficient Back Prop [Paper]
    • LeCun Y A, Bottou L, Orr G B, et al. Efficient backprop[M]//Neural networks: Tricks of the trade. Springer Berlin Heidelberg, 2012: 9-48.
  • Batch Normalization [Paper]
    • Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[J]. arXiv preprint arXiv:1502.03167, 2015.

Others

  • Practical recommendations for gradient-based training of deep architectures [Paper]
    • Bengio Y. Practical recommendations for gradient-based training of deep architectures[M]//Neural Networks: Tricks of the Trade. Springer Berlin Heidelberg, 2012: 437-478.

Open Courses

  • CS231n: Convolutional Neural Networks for Visual Recognition [Course Page]
  • CS224d: Deep Learning for Natural Language Processing [Course Page]

Online Books

  • Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Mathmatics

  • Introduction to Probability Models, Sheldon M. Ross

About

A track of papers I read and papers I collect to read in future

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published