Journal of Northeastern University(Natural Science) ›› 2025, Vol. 46 ›› Issue (10): 44-50.DOI: 10.12068/j.issn.1005-3026.2025.20240079

• Information & Control • Previous Articles     Next Articles

3D Gesture Estimation Algorithm Based on Geometric Attention Mechanism

Hui ZOU, Li-huang SHE, Ye-han CHEN, Yi YUE   

  1. School of Computer Science & Engineering,Northeastern University,Shenyang 110169,China. Corresponding author: SHE Li-huang,E-mail: shelihuang@ise. neu. edu. cn
  • Received:2024-04-08 Online:2025-10-15 Published:2026-01-13

Abstract:

A gesture recognition network based on the coding and decoding infrastructure of Transformer was designed, and an optimized offset attention mechanism was introduced to extract hand features based on the self-attention mechanism. At the same time, in order to extract the local features of the hand structure better, a neighborhood aggregation strategy was designed. The three-dimensional (3D) complexity of the hand structure itself led to different levels of smoothness in different regions. When estimating gestures, ignoring this feature usually leads to the loss of local key information of the hand structure. In order to solve this problem, geometric decomposition of the hand structure was carried out, and sharp and flexible components were used to represent the sharp and flat regions of the hand structure, respectively. Different attention was paid to the characteristics of these two components through the attention mechanism. Experiments on MSRA, ICVL, and NYU datasets demonstrate that the accuracy of this algorithm is comparable to that of SOTA.

Key words: gesture recognition, 3D point cloud, attention mechanism, Transformer model, deep learning

CLC Number: