Vehicle and pedestrian detection (VaPD) is one of the most critical tasks in an advanced driver assistance system which help the driver to drive safely and save the pedestrian life. VaPD is a typical object detection problem that requires a trade-off among accuracy, speed, and memory consumption. Most existing methods focus on improving detection accuracy, while ignoring VaPD requires real-time detection speed with limited computational resources. Thus, it is of primary importance to study light-weight and real-time VaPD methods for embedded devices, that is hardware platforms with limited computation and memory resources. To deal with these issues, this paper proposes a low-rank (LR) Tiny YOLO v3 architecture that meets the requirements of real-time VaPD on embedded systems. The architecture has been developed starting from Tiny YOLO v3 adopting a convolutional neural network compression technique based on Tucker tensor decomposition, able to reduce the computational complexity of the network. A wide experimentation has been carried out on two embedded platforms, Raspberry Pi 4 and NVIDIA Jetson Nano 2 GB, and two datasets commonly used for VaPD, PASCAL VOC and KITTI dataset, showing the superiority of the LR Tiny YOLO v3 with respect to the state-of-the-art networks in obtaining the best compromise between inference time, accuracy and memory occupancy. Moreover, the proposed architecture meets the requirements of VaPD on embedded systems using only 22% of the memory required by the baseline Tiny YOLO v3 Darknet, and always providing better inference time (36.46 FPS) with only a marginal decrease in accuracy ( $\sim$ 2%).

Embedded Real-Time Vehicle and Pedestrian Detection Using a Compressed Tiny YOLO v3 Architecture / Falaschetti, Laura; Manoni, Lorenzo; Palma, Lorenzo; Pierleoni, Paola; Turchetti, Claudio. - In: IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS. - ISSN 1524-9050. - ELETTRONICO. - (2024). [Epub ahead of print] [10.1109/tits.2024.3447453]

Embedded Real-Time Vehicle and Pedestrian Detection Using a Compressed Tiny YOLO v3 Architecture

Falaschetti, Laura
;
Manoni, Lorenzo;Palma, Lorenzo;Pierleoni, Paola;Turchetti, Claudio
2024-01-01

Abstract

Vehicle and pedestrian detection (VaPD) is one of the most critical tasks in an advanced driver assistance system which help the driver to drive safely and save the pedestrian life. VaPD is a typical object detection problem that requires a trade-off among accuracy, speed, and memory consumption. Most existing methods focus on improving detection accuracy, while ignoring VaPD requires real-time detection speed with limited computational resources. Thus, it is of primary importance to study light-weight and real-time VaPD methods for embedded devices, that is hardware platforms with limited computation and memory resources. To deal with these issues, this paper proposes a low-rank (LR) Tiny YOLO v3 architecture that meets the requirements of real-time VaPD on embedded systems. The architecture has been developed starting from Tiny YOLO v3 adopting a convolutional neural network compression technique based on Tucker tensor decomposition, able to reduce the computational complexity of the network. A wide experimentation has been carried out on two embedded platforms, Raspberry Pi 4 and NVIDIA Jetson Nano 2 GB, and two datasets commonly used for VaPD, PASCAL VOC and KITTI dataset, showing the superiority of the LR Tiny YOLO v3 with respect to the state-of-the-art networks in obtaining the best compromise between inference time, accuracy and memory occupancy. Moreover, the proposed architecture meets the requirements of VaPD on embedded systems using only 22% of the memory required by the baseline Tiny YOLO v3 Darknet, and always providing better inference time (36.46 FPS) with only a marginal decrease in accuracy ( $\sim$ 2%).
File in questo prodotto:
File Dimensione Formato  
Embedded_Real-Time_Vehicle_and_Pedestrian_Detection_Using_a_Compressed_Tiny_YOLO_v3_Architecture.pdf

accesso aperto

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza d'uso: Creative commons
Dimensione 1.69 MB
Formato Adobe PDF
1.69 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11566/335112
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact