Journal of Northeastern University(Natural Science) ›› 2021, Vol. 42 ›› Issue (12): 1688-1695.DOI: 10.12068/j.issn.1005-3026.2021.12.003

• Information & Control • Previous Articles     Next Articles

Feature Selection Algorithm Based on LightGBM

LI Zhan-shan, YAO Xin, LIU Zhao-geng, ZHANG Jia-chen   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China.
  • Revised:2021-04-09 Accepted:2021-04-09 Published:2021-12-17
  • Contact: ZHANG Jia-chen
  • About author:-
  • Supported by:
    -

Abstract: In order to solve the shortcomings of the following two types of feature selection algorithms, filtering and wrapping based on evolutionary learning, a new wrapping feature selection algorithm LGBFS(LightGBM feature selection) was proposed. First, LightGBM was introduced to construct an gradient boosting tree model for the original features and measure the importance of features; then the proposed LR sequential forward search strategy LRSFFS was combined to select features; finally, the proposed algorithm was compared with nine algorithms in 21 standard datasets. The results show that 16 of the 21 standard data sets of LGBFS have achieved the best classification accuracy, and 18 standard data sets have achieved the best dimensionality reduction rate and the best CPU running time. In addition, time complexity analysis and significance test were carried out. The test shows that LGBFS is significantly different from the six comparison algorithms, and it also shows that LGBFS can balance the calculation efficiency and classification accuracy of feature subsets.

Key words: feature selection; LightGBM; boosting tree; wrapped method; sequential search

CLC Number: