基于伪点云融合的多模态三维目标检测方法
DOI:
CSTR:
作者:
作者单位:

1.南京信息工程大学自动化院 南京 210044; 2.无锡学院物联网工程学院 无锡 214105

作者简介:

通讯作者:

中图分类号:

TN958.98

基金项目:

国家自然科学基金(42175157,42475151,42305158)、无锡市“太湖之光”科技攻关计划(基础研究)项目(K20231021)资助


Multi-modal 3D object detection based on pseudo point cloud fusion
Author:
Affiliation:

1.School of Automation, Nanjing University of Information Science and Technology,Nanjing 210044, China; 2.School of Internet Engineering, Wuxi University,Wuxi 214105, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对目前的纯激光雷达三维检测方法不可避免地受到点云稀疏性的影响,且激光雷达扫描得到的点云数据在远距离表现比近距离更加稀疏导致模型训练过程中正负样本不均衡的问题,提出一种新的基于伪点云融合的多模态框架MCA-VoxelNet,它由两个关键设计组成:利用深度补全产生的伪点云来解决点云稀疏性问题,并且通过距离感知采样模块丢弃大量附近的冗余体素来提高计算效率;利用多阶段级联注意力检测结构聚合多个检测阶段的目标特征,平衡正负样本数量并逐步完善RPN网络输出的区域建议。在权威的KITTI自动驾驶数据集上的实验结果表明,MCA-VoxelNet以17.54的FPS在简单、中等和困难三个难度类别上的汽车精度分别达到94.19%、85.93%和86.17%,比次优的方法分别高出2.64%、1.16%和1.91%。

    Abstract:

    To address the inevitable limitations of current LiDAR-only 3D detection methods, which are affected by point cloud sparsity—where LiDAR-scanned point clouds exhibit significantly higher sparsity at long range compared to short range, leading to imbalanced positive and negative samples during model training—we propose a novel multi-modal framework named MCA-VoxelNet, based on pseudo-point-cloud fusion.It consists of two key designs: the pseudo-point clouds generated by depth completion are utilized to solve the problem of point cloud sparsity, and a large number of nearby redundant voxels are discarded through the distance-aware sampling module to enhance computational efficiency; a multi-stage cascaded attention detection structure is employed to aggregate the target features of multiple detection stages, balance the number of positive and negative samples, and gradually improve the region proposals output by the Region Proposal Network. Experiments on the authoritative KITTI autonomous driving dataset demonstrate that MCA-VoxelNet achieves an inference speed of 17.54 FPS and attains car detection accuracies of 94.19%, 85.93%, and 86.17% on the easy, moderate, and hard difficulty levels, respectively. These results outperform the second-best method by 2.64%, 1.16%, and 1.91%.

    参考文献
    相似文献
    引证文献
引用本文

李旭,张永宏,朱灵龙,阚希.基于伪点云融合的多模态三维目标检测方法[J].电子测量技术,2026,49(1):121-132

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-02-11
  • 出版日期:
文章二维码

重要通知公告

①《电子测量技术》期刊收款账户变更公告
×
《电子测量技术》
关于防范虚假编辑部邮件的郑重公告