Journal of Northeastern University(Natural Science) ›› 2025, Vol. 46 ›› Issue (6): 8-15.DOI: 10.12068/j.issn.1005-3026.2025.20230341

• Information & Control • Previous Articles     Next Articles

X-ray Image Prohibited Item Detection Algorithm Based on X-ray-RTDETR

Li-zhen LI, Shu-hua MA, Ze-xu GUO, Xiao-chen CHE   

  1. School of Control Engineering,Northeastern University at Qinhuangdao,Qinhuangdao 066004,China. Corresponding author: LI Li-zhen,E-mail: lilizhen559@163. com
  • Received:2023-12-25 Online:2025-06-15 Published:2025-09-01

Abstract:

In response to the problem of low detection precision caused by inconsistent size, high background noise, and large-scale changes in X-ray image prohibited item, the optimization is performed based on RT-DETR-R18 and an X-ray image prohibited item detection algorithm named X-ray-RTDETR is proposed. Firstly, the algorithm employs CSPRepResNet embedded with efficient multi-scale attention as the backbone network to enhance feature extraction capabilities. Secondly, the simplified fast spatial pyramid pooling module is introduced after the three features maps output by the backbone network to improve the robustness and generalization ability of the model. Finally, the SPoolFormer encoder is applied to high-level feature maps with richer semantic concepts for intra-scale feature interaction. The experimental results show that the detection accuracy of X-ray-RTDETR achieves 74.6% on PIDray test set, surpassing RT-DETR-R18 by 8.5%, while reducing the number of parameters and nFLOP by 1.67×106 and 2.24×109, respectively. Compared to the state-of-the-art object detection algorithms at the same scale shows that X-ray-RTDETR not only has higher detection accuracy, but also has less number of parameters and nFLOP. At the same time, its inference speed reaches 85.47 frames per second on RTX2070 Max-Q GPU.

Key words: prohibited item detection, multi-scale attention, feature extraction, pyramid pooling, SPoolFormer encoder

CLC Number: