基于改进YOLOv5的轻量级航拍图像目标检测方法

赵贺; 白天平; 冯越

基于改进YOLOv5的轻量级航拍图像目标检测方法

Lightweight Object Detection Algorithm Based on Improved YOLOv5 for UAV Aerial Images

摘要

摘要: 目前无人机航拍图像的目标检测算法存在模型体积大、网络参数量大、检测精度低等问题，导致其难以部署应用在计算资源有限和功耗较低的嵌入式或者边缘设备上。针对以上问题，本文提出一种基于改进YOLOv5的轻量级目标检测模型GSC-YOLO。首先，通过在主干特征提取网络中引入幻影卷积，降低检测网络的参数量和模型体积。其次，添加简单注意力模块，利用特征图中的全局信息来增强局部特征的表征能力，提高网络对图像中关键信息的提取能力。最后，优化后处理模块中的非极大值抑制算法，采用Cluster_NMS算法更加高效地删除冗余的检测框，同时增加得分惩罚机制和中心点距离机制，进一步提升模型的检测性能。在无人机航拍数据集VisDrone2019进行实验，结果表明，相较于基线模型YOLOv5s，GSC-YOLO模型平均精度提高了3.1%，模型参数量下降了82.1%，模型体积降低了77.4%，对比其他YOLO主流模型和同类轻量级模型，GSC-YOLO模型有更优的模型体积和检测性能，更有利于在嵌入式和边缘设备上部署，在KV260部署验证后，能够较好完成检测任务。

Abstract: Object detection in UAV aerial imagery often faces challenges such as large model size, high parameter count, and limited detection accuracy, which restrict its deployment on embedded and edge devices with constrained computing power and low energy budgets. To address these issues, GSC-YOLO, a lightweight object detection model based on an improved YOLOv5 architecture is proposed. Specifically, ghost convolution is introduced into the backbone network to significantly reduce the number of parameters and overall model size. A simple attention mechanism is also integrated to enhance the network’s ability to extract critical features by leveraging global contextual information. Furthermore, the traditional non-maximum suppression （NMS） is replaced by the more efficient Cluster-NMS, and additional improvements such as score penalty and center-distance mechanisms are applied to further refine detection results. Experiments on the VisDrone2019 dataset demonstrate that GSC-YOLO achieves a 3.1% improvement in mean average precision （mAP）, with an 82.1% reduction in parameters and a 77.4% decrease in model size compared to the baseline YOLOv5s. These results show that GSC-YOLO offers a better trade-off between accuracy and efficiency than other mainstream YOLO variants and lightweight models. Its successful deployment on the KV260 platform further validates its practicality for real-time UAV applications on embedded and edge devices.

HTML全文

参考文献(36)

施引文献

资源附件(0)