Journal of Northeastern University(Natural Science) ›› 2023, Vol. 44 ›› Issue (7): 953-963.DOI: 10.12068/j.issn.1005-3026.2023.07.006

• Mechanical Engineering • Previous Articles     Next Articles

Gesture Recognition in the Complex Environment Based on Gan-St-YOLOv5

HAO Bo1,2, YIN Xing-chao1, YAN Jun-wei1, ZHANG Li1,2   

  1. 1. Key Laboratory of Vibration and Control of Aero-Propulsion System of Ministry of Education, Northeastern University, Shenyang 110819, China; 2. School of Control Engineering, Northeastern University at Qinhuangdao, Qinhuangdao 066004, China.
  • Published:2023-07-13
  • Contact: YIN Xing-chao
  • About author:-
  • Supported by:
    -

Abstract: During the human-computer interaction of gesture recognition in the complex environment of intelligent industrial production, gesture features are affected by local occlusion, strong illumination and small distant targets, leading to the reduction of gesture features recognized in the process of target detection and recognition, and even classification errors. Given that improving the accuracy of gesture recognition in the complex environment has become an urgent problem to be solved in human-computer interaction tasks, an innovative Gan-St-YOLOv5 model is proposed. On the basis of YOLOv5, GAN and Swin Transformer modules are integrated into SENet channel attention mechanism, and Confluence detection box selection algorithm is used to enhance the accuracy of model detection. In order to verify the superiority of the model, the YOLOv5 model is used for comparison and it is concluded that the mAP_0.5 of Gan-St-YOLOv5 is up to 96.1% on the fully visible test set, as high as 92.3% in the intense illumination test set, as high as 86.6% in the partial occlusion test set, and as high as 96.4% in the remote small target test set, all of which are superior to the YOLOv5 target detection algorithm and achieve higher accuracy with less efficiency loss.

Key words: Gan-St-YOLOv5; gesture recognition; local occlusion; strong illumination; remote small target

CLC Number: