SSD: Single Shot MultiBox Detector, Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg, 2016European Conference on Computer Vision (ECCV)DOI: 10.1007/978-3-319-46448-0_2 - The original paper introducing the Single Shot MultiBox Detector (SSD) architecture, detailing its multi-scale feature maps, default boxes, and training strategy for fast object detection.
Focal Loss for Dense Object Detection, Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár, 2017IEEE International Conference on Computer Vision (ICCV)DOI: 10.48550/arXiv.1708.02002 - This paper introduces RetinaNet, a single-stage detector, and the Focal Loss function, designed to address the severe class imbalance problem inherent in dense object detection.
Feature Pyramid Networks for Object Detection, Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie, 2017IEEE Conference on Computer Vision and Pattern Recognition (CVPR)DOI: 10.48550/arXiv.1612.03144 - Describes the Feature Pyramid Network (FPN) architecture, which constructs a multi-scale feature pyramid from a single-resolution input, a component often employed with RetinaNet to improve detection across scales.