Object detection

R-CNN

ConvNet Region ConvNet Object Detection

R-CNN (Region-based CNN) detects objects by first generating region proposals using Selective Search, then classifies each using a shared CNN. It combines region warping, feature extraction, and per-region classification with Non-Maximum Suppression.

Girshick, Ross, et al. “Rich feature hierarchies for accurate object detection and semantic segmentation.” Proceedings of the IEEE conference on computer vision and pattern recognition (2014): 580-587.

Name

Model

Input Shape

Parameter Count

FLOPs

R-CNN

RCNN

\((N,C_{in},H,W)\)

\(\mathcal{O}(P_{\text{cnn}} + F^2 + F \cdot K)\)

\(\mathcal{O}\left(P_{\text{cnn}} + N \cdot (FHW + F^2 + FK)\right)\)

To be implemented…🔮