面向密集场景结合TC-YOLOX的小目标检测方法
DOI:
作者:
作者单位:

西安工程大学 西安 710048

作者简介:

通讯作者:

中图分类号:

TP391.4

基金项目:

2021年中国高校产学研创新基金(2021ALA02002)、2021年“纺织之光”中国纺织工业联合会高等教育教学改革研究项目(2021BKJGLX004)、西安工程大学2020年高等教育研究项目(20GJ05)资助


Small target detection method for dense scenes combined with TC-YOLOX
Author:
Affiliation:

Xi′an Polytechnic University,Xi′an 710048, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    密集场景下小目标的高效精确检测是目标检测领域的关键问题。为了解决环境的多样性和小目标自身复杂性存在着特征难以提取、检测精度低等问题,提出一种面向密集场景结合TCYOLOX的小目标检测方法。首先,通过在CSPNet中引入Transformer Encode模块,不断更新目标权重实现增强目标特征信息,提高网络的特征提取能力;其次,在特征金字塔网络中增加卷积注意力机制模块,关注重要特征并抑制不必要特征,提高不同尺度目标的检测准确度;然后,采用CIoU代替IoU作为回归损失函数,使得模型训练过程中网络收敛更快,性能更好;最后在PASCAL VOC 2007数据集上验证。实验结果表明,所设计的TCYOLOX模型能够有效的检测出多样化场景中正常、密集、稀疏、黑暗条件下的小目标物体,mAP和检测速度可以达到946%和38 fps,与原始模型相比提升了109%和1 fps,对多种密集场景下的小目标检测任务均具有较好的适用性。

    Abstract:

    Efficient and accurate detection of small targets in dense scenes is a key problem in the field of target detection. In order to solve the problems of diversity of environments and complexity of small targets, such as difficult feature extraction and low detection accuracy, a small target detection method for dense scenes combined with TCYOLOX is proposed. Firstly, by introducing Transformer Encode module into CSPNet, the target weight is continuously updated to enhance the target feature information and improve the network feature extraction capability. Secondly, the convolutional attention mechanism module is added to the feature pyramid network to focus on important features and suppress unnecessary features, so as to improve the detection accuracy of targets of different scales. Then, CIoU is used to replace IoU as the regression loss function, which makes the network converge faster and has better performance in the process of model training. Finally, it is verified on PASCAL VOC 2007 dataset. The experimental results show that the designed TCYOLOX model can effectively detect small target objects under normal, dense, sparse and dark conditions in diversified scenes. The mAP and detection speed can reach 946% and 38 fps, which is 109% and 1 fps higher than the original model. It has good applicability to small target detection tasks in multiple dense scenes.

    参考文献
    相似文献
    引证文献
引用本文

李翔宇,王伟,王峰萍,韩岩江.面向密集场景结合TC-YOLOX的小目标检测方法[J].电子测量技术,2023,46(15):133-142

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2024-01-08
  • 出版日期: