基于改进 RTMPose 的水下光阵关键点检测
DOI:
CSTR:
作者:
作者单位:

哈尔滨工程大学智能科学与工程学院哈尔滨150001

作者简介:

通讯作者:

中图分类号:

TH39

基金项目:

国家自然科学基金面上项目(52271313)资助


The keypoint detection of underwater optical guidance arrays based on the enhanced RTMPose method
Author:
Affiliation:

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    视觉光学导引是无人潜航器(UUV)近端回收的主流方案,但其检测精度与鲁棒性易受机载设备算力限制及复杂水下环境干扰。针对刚体光阵目标结构特性与水下光学导引任务需求,提出一种基于改进RTMPose的水下光阵关键点实时检测方法。核心贡献在于:1)建立水下光阵骨架模型与水下光阵关键点检测数据集,将人体姿态估计自顶向下范式适配至光学导引任务;2)设计融合重参数化技术和坐标注意力机制的轻量化骨干网络,通过解耦训练-推理过程和嵌入分方向的位置信息融合机制,显著降低参数量和计算复杂度的同时,补偿轻量化带来的性能损失,提升模型对水下图像光学衰减退化、遮挡等干扰的鲁棒性。实验结果表明,本方法在自建数据集上平均精度(AP)达到90.8%,平均端点误差(EPE)为0.828 pixels;相比于基线模型,改进模型参数量与浮点运算数(Flops)分别降低约22.8%和26.6%;在Jetson AGX Orin(64 GB)嵌入式平台上,改进模型端到端推理延迟较基线降低了1.14 ms。泛化性实验表明,该方法能通过训练快速适配不同光阵形态与多种水域条件,具备良好的通用性与可靠性。本研究提出的水下光阵关键点检测方法在检测精度、速度以及鲁棒性这3方面取得良好平衡,满足水下光阵关键点检测任务精度及实时性要求,为UUV等机载设备上实现高鲁棒性光学导引提供有效的解决思路。

    Abstract:

    The visual optical guidance is a mainstream solution for the near-end recovery of unmanned underwater vehicles (UUVs), whose accuracy and robustness are susceptible to the limited computational power of onboard devices and interferences of complex underwater environments. In order to address the structural characteristics of rigid optical array targets and the requirements of underwater optical guidance tasks, this study proposes a real-time keypoint detection method for the underwater optical arrays based on the improved RTMPose method. The core contributions are as follows: 1) Establishing a skeleton model for the underwater optical arrays, constructing a dataset for the underwater optical array keypoint detection and adapting the top-down human pose estimation paradigm for the optical guidance tasks; 2) Designing a lightweight backbone network integrating re-parameterization technology and a coordinate attention mechanism. By decoupling the training-inference process and embedding the directional positional information fusion mechanism, the number of parameters and computational complexity are significantly reduced with the compensating ability of performance loss caused by lightweight design, thereby enhancing the model′s robustness against the underwater image degradation, occlusion, and other interferences. Experimental results show that the proposed method achieves an average precision (AP) of 90.8% and a mean pixels error of 0.828 with our dataset. Compared to the baseline model, the proposed model reduces the number of parameters and floating-point operations by approximately 22.8% and 26.6% with the Jetson AGX Orin (64 GB) embedded platform, the corresponding end-to-end inference latency is also reduced by 1.14 ms compared to the baseline. The generalization experiments demonstrate that the method can quickly adapt to different optical array configurations and various water conditions through training, which exhibits the fair versatility and reliability. In conclusion, the proposed underwater optical array keypoint detection method achieves a favorable balance among detection accuracy, speed, and robustness, which meets the precision and real-time requirements for the underwater optical array keypoint detection tasks and provides an effective solution for the high-robustness optical guidance on UUVs and other onboard devices.

    参考文献
    相似文献
    引证文献
引用本文

赵世泉,刘佳兴,王宇超.基于改进 RTMPose 的水下光阵关键点检测[J].仪器仪表学报,2026,47(3):158-169

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-05-22
  • 出版日期:
文章二维码