融合动态卷积与可变形注意力的钢材缺陷检测
DOI:
CSTR:
作者:
作者单位:

1.长春理工大学电子信息工程学院长春130022; 2.长春理工大学光电工程学院长春130022

作者简介:

通讯作者:

中图分类号:

TP391.41TH165

基金项目:

某部靶场测试专项规划项目(2T-2018-06)资助


A steel defect detection with fused dynamic convolution and deformable attention
Author:
Affiliation:

1.School of Electronic Information Engineering, Changchun University of Technology, Changchun 130022, China; 2.School of Optoelectronic Engineering, Changchun University of Technology, Changchun 130022, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    钢材表面缺陷的精准检测是工业质量控制的关键环节。尤其在机械制造、汽车工业、电子设备、航空航天及火炮深管等精密制造领域,其表面质量直接决定终端产品的安全性与可靠性。针对现有钢材表面缺陷检测方法中存在多尺度缺陷检测能力有限,小目标与低对比度缺陷漏检率高,边界框回归精度不佳等问题,提出了一种基于YOLOv11n改进的多尺度钢材表面缺陷检测方法。设计多尺度动态卷积,通过并行异构卷积与动态权重融合机制,增强模型对多尺度缺陷的捕捉能力;构建动态残差融合模块,以分组卷积与双重残差结构替换基线C3K2模块,在显著降低参数量的同时提升多尺度特征融合与梯度流通效率,缓解深层网络训练退化问题;改进可变形三元注意力机制,融合可变形卷积与跨维度交互,使注意力感受野随缺陷形态动态调整,精准聚焦微小、低对比度区域,抑制复杂背景干扰;采用Shape-IoU损失函数,通过引入形状与尺度因子优化边界框回归精度,解决传统CIoU在宽高比一致时惩罚失效的问题。在NEU-DET数据集上的实验结果表明:改进模型的mAP@0.5达到81.9%,相比基线YOLOv11n提升了6%,参数量仅为2.3 M,计算量降至5.9 GFLOPs,满足边缘设备部署需求。泛化实验在GC10-DET数据集上较基线模型提升4.1%。可视化分析与泛化性实验进一步验证了其在复杂工业场景下的鲁棒性与实用性。

    Abstract:

    Accurate detection of steel surface defects is a critical aspect of industrial quality control. Especially in precision manufacturing fields such as mechanical engineering, automotive industry, electronics, aerospace, and artillery barrel production, surface quality directly determines the safety and reliability of end products. To address the limitations of existing steel surface defect detection methods, including insufficient multi-scale defect detection capability, high missed detection rates for small and low-contrast defects, and suboptimal bounding box regression accuracy, this article proposes an improved multi-scale steel surface defect detection method based on YOLOv11n. A multi-scale dynamic convolution module is designed, which employs parallel heterogeneous convolutions and a dynamic weight fusion mechanism to enhance the model′s ability to capture multi-scale defects. A dynamic residual fusion module is formulated, replacing the baseline C3K2 module with grouped convolution and a dual-residual structure. This significantly reduces the parameter count while improving multi-scale feature fusion and gradient flow efficiency, alleviating the degradation issue in deep network training. The deformable triple attention mechanism is enhanced by integrating deformable convolution and cross-dimensional interaction, enabling the attention receptive field to dynamically adjust according to defect morphology, thereby precisely focusing on small, low-contrast regions and suppressing complex background interference. The Shape-IoU loss function is adopted, which incorporates shape and scale factors to optimize bounding box regression accuracy, addressing the failure of the penalty term in traditional CIoU when the aspect ratios are identical. Experimental results on the NEU-DET dataset show that the improved model achieves an mAP@0.5 of 81.9%, representing a 6% improvement over the baseline YOLOv11n. The parameter count is only 2.3 M, and the computational cost is reduced to 5.9 GFLOPs, meeting the requirements for deployment on edge devices. Generalization experiments on the GC10-DET dataset show a 4.1% improvement over the baseline model. Visualization analysis and generalization experiments further validate its robustness and practicality in complex industrial scenarios.

    参考文献
    相似文献
    引证文献
引用本文

赵海丽,狄子隆,景文博,宋明喆.融合动态卷积与可变形注意力的钢材缺陷检测[J].仪器仪表学报,2025,46(12):75-86

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-02
  • 出版日期:
文章二维码