Journal of Northeastern University Natural Science ›› 2020, Vol. 41 ›› Issue (11): 1550-1556.DOI: 10.12068/j.issn.1005-3026.2020.11.005

• Information & Control • Previous Articles     Next Articles

A Feature Selection Method Based on New Redundancy Measurement

LI Zhan-shan, LYU Ai-na   

  1. School of Computer Science and Technology, Jilin University, Changchun 130012, China.
  • Received:2019-12-16 Revised:2019-12-16 Online:2020-11-15 Published:2020-11-16
  • Contact: LYU Ai-na
  • About author:-
  • Supported by:
    -

Abstract: The current filter feature selection models use greedy strategy combined with mutual information to evaluate feature subsets, which are easy to fall into the local optimum trap. Considering the effect of label information on redundancy, an improved MIFS-U method is used to measure the redundancy under the condition of a given label. A decomposition-based multi-objective optimization framework combined with a differential evolution operator that introduces polynomial mutation is used for global search to avoid searching into local optimum. The l1 regularization is introduced to ensure the sparsity of the feature subset, and a new feature selection algorithm MOEA/D-DEFS is proposed. In the experimental stage, the knn-5 classifier is used to verify the learning effect, by the tests on multiple sets of data sets from different fields. The results show that considering feature selection as a multi-objective problem and using a global search strategy can provide better performance in terms of feature subset dimensions and classification accuracy.

Key words: feature selection, mutual information, multi-objective evolution algorithm, l1-regularization, redundancy

CLC Number: