面向复杂海况和轻量化特性的SAP−YOLOv8船舶目标检测算法

曹阳; 高涵; 沈琴琴; 包银鑫; 施佺

doi:10.19693/j.issn.1673-3185.04880

面向复杂海况和轻量化特性的SAP−YOLOv8船舶目标检测算法

SAP−YOLOv8 ship target detection algorithm for complex sea conditions and lightweight features

摘要

摘要:
目的 YOLOv8算法凭借其高效的处理能力和优异的检测效果在目标检测领域广受认可，但在船舶目标检测中仍存在抗干扰能力不足、多尺度特征提取效果不佳及参数量偏大等问题，为此综合运用空间深度转换分组膨胀（SDGD）卷积、注意力同尺度特征交互（AIFI）、部分卷积（PConv）与坐标注意力机制（CA），提出一种特征增强的轻量级SAP−YOLOv8目标检测算法。
方法首先，在普通卷积中融入空间深度转换卷积与膨胀卷积，构建SDGD模块，提升对海杂波干扰的抑制能力以及多尺度特征的提取能力；然后，引入RT−DETR中的AIFI模块，取代SPPF模块以增强复杂海况下的上下文建模能力；最后，为优化算法的计算效率，基于PConv与CA的原理，构建C3k2_PCCA模块以降低参数量和复杂度，同时提升算法在复杂海况下的轻量化性能和运行效率。
结果在公开数据集HRSID上进行的实验结果表明SAP−YOLOv8算法与原始算法的参数量几乎相同，但精确率、召回率以及平均精度均值分别提高了1.5%、0.7%、1.6%，且检测效果明显优于其他经典算法。
结论 SAP−YOLOv8算法具有更高的检测精度和运行效率，且能够在复杂海况下表现出更强的鲁棒性和实用价值。

Abstract:
Objectives The YOLOv8 algorithm has been widely recognized in the field of object detection due to its high computational efficiency and excellent detection performance. However, when applied to ship detection tasks in complex maritime environments, it still suffers from several limitations, including insufficient robustness to sea clutter and wake interference, inadequate capability for multi-scale feature extraction, and relatively large model parameters, which restrict its deployment on resource-constrained devices. These problems are particularly prominent in scenarios involving small targets, dense distributions, and complex backgrounds. To address these challenges, this paper proposes a lightweight and feature-enhanced ship detection algorithm, termed SAP-YOLOv8, aiming to improve detection accuracy, robustness, and efficiency under complex sea conditions while maintaining a good balance between performance and computational cost.
Methods First, to enhance feature representation capability, spatial depthwise convolution and dilated convolution are integrated into standard convolution to construct the SDGD module. By combining spatial channel decomposition with expanded receptive fields, the SDGD module effectively suppresses sea clutter interference while capturing richer contextual information, thereby significantly improving multi-scale feature extraction performance, especially for small and weak targets. In addition, this design enables the network to better adapt to variations in target scale and background complexity. Next, the AIFI module from the RT-DETR framework is introduced to replace the original SPPF module. Through attention-based same-scale feature interaction, this module strengthens global context modeling and enhances the ability of the network to perceive long-range dependencies and complex spatial relationships in challenging maritime environments. Finally, to further optimize computational efficiency, a novel C3k2_PCCA module is designed based on partial convolution (PConv) and coordinate attention (CA). By reducing redundant feature computation and introducing spatial coordinate information into channel attention, this module effectively decreases parameter redundancy and computational complexity, while maintaining discriminative feature representation, thus improving the lightweight performance and inference efficiency of the model.
Results Experimental results on the public HRSID dataset demonstrate that the proposed SAP-YOLOv8 achieves nearly the same parameter scale as the baseline YOLOv8 model, while improving precision, recall, and mean average precision by 1.5%, 0.7%, and 1.6%, respectively. Moreover, the proposed method shows more stable detection performance in complex maritime scenarios, particularly under strong background interference and significant scale variation, indicating its enhanced robustness and generalization capability compared with several representative classical detection algorithms. Furthermore, the proposed method maintains competitive efficiency while achieving these improvements, making it suitable for practical deployment.
Conclusions The proposed SAP-YOLOv8 algorithm not only improves detection accuracy and computational efficiency but also demonstrates stronger robustness and adaptability in complex sea environments. These advantages indicate that it has promising practical value for real-world ship detection tasks, especially in scenarios with limited computational resources and high environmental complexity.

HTML全文

参考文献(35)

施引文献

资源附件(0)