GitHub - Weichern/Paper-Collection: A track of papers I read and papers I collect to read in future

Object Detection in Image:

Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik, Rich feature hierarchies for accurate object detection and semantic segmentation

✅ Spatial pyramid pooling in deep convolutional networks for visual recognition [[Paper]] (http://arxiv.org/abs/1406.4729) [Note] [Code]

He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2015, 37(9): 1904-1916.

✅ Fast R-CNN [[Paper]] (http://arxiv.org/pdf/1504.08083) [Note] [Code]

Ross Girshick, Fast R-CNN, arXiv:1504.08083.

✅ Faster R-CNN, Microsoft Research [[Paper]] (http://arxiv.org/pdf/1506.01497) [Note] [Code] [Python Code]

Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, arXiv:1506.01497.

✅ End-to-end people detection in crowded scenes [[Paper]] (http://arxiv.org/abs/1506.04878) [Note] [Code]

Russell Stewart, Mykhaylo Andriluka, End-to-end people detection in crowded scenes, arXiv:1506.04878.

✅ You Only Look Once: Unified, Real-Time Object Detection [[Paper]] (http://arxiv.org/abs/1506.02640) [Note] [Code]

Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once: Unified, Real-Time Object Detection, arXiv:1506.02640

✅ Adaptive Object Detection Using Adjacency and Zoom Prediction [[Paper]] (http://arxiv.org/abs/1512.07711) [Note]

Lu Y, Javidi T, Lazebnik S. Adaptive Object Detection Using Adjacency and Zoom Prediction[J]. arXiv:1512.07711, 2015.

✅ Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [Paper] [Note]

Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick. arXiv:1512.04143, 2015.

✅ G-CNN: an Iterative Grid Based Object Detector [Paper]

Mahyar Najibi, Mohammad Rastegari, Larry S. Davis. arXiv:1512.07729, 2015.
SSD [Paper]
- Liu W, Anguelov D, Erhan D, et al. SSD: Single Shot MultiBox Detector[J]. arXiv preprint arXiv:1512.02325, 2015.
Deep Residual Learning for Image Recognition [Paper]
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition. arXiv:1512.03385, 2015
Diagnosing error in object detectors [Paper]
- Hoiem D, Chodpathumwan Y, Dai Q. Diagnosing error in object detectors[M]//Computer Vision–ECCV 2012. Springer Berlin Heidelberg, 2012: 340-353.

Object Detection in Video:

✅ Seq-NMS for Video Object Detection [Paper] [Note]

Wei Han, Pooya Khorrami, Tom Le Paine, Prajit Ramachandran, Mohammad Babaeizadeh, Honghui Shi, Jianan Li, Shuicheng Yan, Thomas S. Huang. Seq-NMS for Video Object Detection. arXiv preprint arXiv:1602.08465, 2016

Image Caption:

✅ Exploring Nearest Neighbor Approaches for Image Captioning [Paper]

Devlin J, Gupta S, Girshick R, et al. Exploring Nearest Neighbor Approaches for Image Captioning[J]. arXiv preprint arXiv:1505.04467, 2015.

Theories

Efficient Back Prop [Paper]
- LeCun Y A, Bottou L, Orr G B, et al. Efficient backprop[M]//Neural networks: Tricks of the trade. Springer Berlin Heidelberg, 2012: 9-48.
Batch Normalization [Paper]
- Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift[J]. arXiv preprint arXiv:1502.03167, 2015.

Others

Practical recommendations for gradient-based training of deep architectures [Paper]
- Bengio Y. Practical recommendations for gradient-based training of deep architectures[M]//Neural Networks: Tricks of the Trade. Springer Berlin Heidelberg, 2012: 437-478.

Open Courses

CS231n: Convolutional Neural Networks for Visual Recognition [Course Page]
CS224d: Deep Learning for Natural Language Processing [Course Page]

Online Books

Deep Learning by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Mathmatics

Introduction to Probability Models, Sheldon M. Ross

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
["images"]		["images"]
images		images
AZNet.md		AZNet.md
End-to-end-people-detection-in-crowded-scenes.md		End-to-end-people-detection-in-crowded-scenes.md
Fast-RCNN.md		Fast-RCNN.md
Faster R-CNN.md		Faster R-CNN.md
ITPTNotes.md		ITPTNotes.md
Inside-Outside-Net.md		Inside-Outside-Net.md
RCNN.md		RCNN.md
README.md		README.md
SPPNet.md		SPPNet.md
Seq-NMS.md		Seq-NMS.md
YOLO.md		YOLO.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Object Detection in Image:

Object Detection in Video:

Image Caption:

Theories

Others

Open Courses

Online Books

Mathmatics

About

Uh oh!

Releases

Packages

Weichern/Paper-Collection

Folders and files

Latest commit

History

Repository files navigation

Object Detection in Image:

Object Detection in Video:

Image Caption:

Theories

Others

Open Courses

Online Books

Mathmatics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages