Open Access
Table 3
Overview of object detection models used in the study.
Model | Backbone | Layers | Parameters | Speed (FPS) | Pretrained dataset |
---|---|---|---|---|---|
SSD | MobileNet v2 | 324 | 4627367 | 4.55 | COCO 2017 |
Faster R-CNN | ResNet50 V1 | 275 | 28278415 | 14.29 | COCO 2017 |
YOLOv5 | CSP-Darknet | 270 | 8923647 | 100 | COCO 2017 |
EfficientDet D1 | EfficientNet B1 | 849 | 4991810 | 1.72 | COCO 2017 |
Non-pretrained CNN | custom Conv2D layers(a) | 21 | 810523 | 11.11 | – |
Notes. (a)See Sect. 4.5. Layers refer to the total number of layers and parameters are the total number of trainable parameters. Speed corresponds to detection speed in frame per second. The values differ from the defaults as the models were modified to fit our task.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.