EfficientDet: Scalable and Efficient Object Detection

Source: https://openaccess.thecvf.com/content_CVPR_2020/papers/Tan_EfficientDet_Scalable_and_Efficient_Object_Detection_CVPR_2020_paper.pdf


EfficientDet is better than Yolo v3, it has achieved 52.2% accuracy on the COCO dataset for the object detection tasks.

EfficientDet is a family of models which are more efficient, faster, and accurate in object detection.

Architecture:

It uses EfficientNet-B3 as the backbone and bi-directional feature pyramid network (BiFPN).

BiFPN

BiFPN takes input as P3, P4, P5, P6, P7. these Pn are features of the input image, the formula for finding the size of Pn is (Images Size / 2^n)

P3 = 1024/2^3 = 128

P4 = 1024/2^4 = 64

P5 = 1024/2^5 = 32

P6 = 1024/2^6 = 16

P7 = 1024/2^7 = 8

These features are resized first then aggregates, resize is either done by upsampling or downsampling.

Aggregation of these features is not like normal summation but it is weighted summation, and weights are learnable parameters. this method is called Fast Normalized Fusion, it is 1.26 to 1.31 times faster than softmax, still, give the same accuracy.

COCO Dataset

Source: https://cocodataset.org/#home

COCO is a large-scale dataset. this dataset is used for the following task.

  • Object Detection
  • Object segmentation
  • Keypoint Detection
  • Recognition in context
  • Superpixel stuff segmentation

Building a CLI-Based People Tracking and Dwell Time Analytics System Using YOLOv8 and DeepSORT

  Introduction Tracking people across video frames and analyzing their behavior (like  dwell time ) is a crucial task for many real-world ap...