Journal of Northeastern University Natural Science ›› 2019, Vol. 40 ›› Issue (1): 11-15.DOI: 10.12068/j.issn.1005-3026.2019.01.003

• Information & Control • Previous Articles     Next Articles

Random Forest Based Quality Analysis and Prediction Method for Hot-Rolled Strip

JI Ying-jun1, YONG Xiao-yue1, LIU Ying-lin2, LIU Shi-xin1   

  1. 1. School of Information Science & Engineering, Northeastern University, Shenyang 110819, China; 2. Big Data Department, Shanghai Baosight Software Co., Ltd., Shanghai 201203, China.
  • Received:2018-04-19 Revised:2018-04-19 Online:2019-01-15 Published:2019-01-28
  • Contact: LIU Shi-xin
  • About author:-
  • Supported by:
    -

Abstract: The process data of hot-rolled strips from an iron and steel enterprise were analyzed to find out the inherent relationship between process parameters and production quality by using an improved random forests algorithm. After critical features being extracted, a defect prediction model was built. According to the experiment, balancing operation can improve the prediction accuracy of the imbalanced data sets. Meanwhile, the combination of CART and C4.5 can further improve the prediction accuracy than each single method. Furthermore, in consideration of the characteristics whose features have high or low correlations with the response variable, mutual information was introduced as an evaluation criterion for feature selection. Mutual information makes great contribution to classification effect of random forest algorithm, and recognition rate of defects of hot-rolled strips is obviously improved by using three strategies.

Key words: hot-rolled strip, defect prediction, data driven, feature selection, random forests

CLC Number: