多模态融合的输电线路部件多尺度检测方法
DOI:
CSTR:
作者:
作者单位:

华北电力大学控制与计算机工程学院 北京 102206

作者简介:

通讯作者:

中图分类号:

TP391.41;TM75;TN919.8

基金项目:

国家自然科学基金(62301220)项目资助


Multi-scale detection method for transmission line components based on multimodal fusion
Author:
Affiliation:

School of Control and Computer Engineering, North China Electric Power University,Beijing 102206, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在输电线路无人机巡检航拍图像的关键部件检测任务中,针对单一模态检测方法精度低和小目标漏检率高的问题,提出了一种融合可见光图像和红外图像的多模态多尺度目标检测方法。首先,该网络构建了并行的双流特征提取主干,旨在同步处理可见光与红外图像,以充分利用前者丰富的色彩与纹理细节信息,以及后者卓越的成像稳定性与高对比度特性。其次,为实现跨模态信息的交互与互补,设计了多模态特征交互融合模块(MFIFM),该模块能动态地调整不同模态特征的融合权重,自适应地整合最具判别力的信息,有效缓解模态差异带来的信息冲突。此外,为提升对小目标部件的感知能力,提出了混合残差多尺度Transformer(HRMS Transformer)模块嵌入到双流主干中,通过多头窗口注意力机制,层级式特征重组以及与残差相结合的策略,增强全局上下文信息提取能力。实验结果表明,该模型精度mAP@50和mAP@50:95较现有单模态方法分别提升5.35%和4.48%。验证了多模态融合技术在输电线路检测领域的有效性和可用性。

    Abstract:

    In the key component detection task of UAV inspection of aerial images for transmission lines, a multimodal multi-scale target detection approach is proposed to address the challenges of accuracy degradation and high miss rates for small targets in single-modal detection methods. This approach integrates visible light and infrared images. First, the network constructs a parallel two-stream feature extraction backbone designed to simultaneously process visible light and infrared images. This design fully utilizes the rich color and texture detail information from the visible light images, along with the superior imaging stability and high contrast characteristics of the infrared images. Next, to facilitate cross-modal information interaction and complementarity, a Multimodal Feature Fusion Interactive Module (MFIFM) is developed. This module dynamically adjusts the fusion weights of features from different modalities, adaptively integrating the most discriminative information and effectively mitigating conflicts arising from modality differences. Additionally, to enhance the perception of small target components, a Hybrid Residual Multi-Scale Transformer (HRMS Transformer) module is incorporated into the dual-stream backbone. By utilizing a multi-head attention mechanism, hierarchical feature reorganization, and a residual-based strategy, the model′s ability to extract global context information is significantly strengthened. Experimental results demonstrate that the model′s mean Average Precision (mAP) at IoU thresholds of 0.50 and 0.50:0.95 improves by 5.35% and 4.48%, respectively, compared to existing single-modal methods. These findings confirm the effectiveness and applicability of multimodal fusion technology in transmission line inspection.

    参考文献
    相似文献
    引证文献
引用本文

周景,赵毅,刘心.多模态融合的输电线路部件多尺度检测方法[J].电子测量技术,2026,49(1):188-198

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-02-11
  • 出版日期:
文章二维码

重要通知公告

①《电子测量技术》期刊收款账户变更公告
×
《电子测量技术》
关于防范虚假编辑部邮件的郑重公告