东北大学学报(自然科学版) ›› 2023, Vol. 44 ›› Issue (6): 889-897.DOI: 10.12068/j.issn.1005-3026.2023.06.017

• 管理科学 • 上一篇    下一篇

基于数据分布特征的分层无量纲化方法及其均衡性分析

易平涛, 袁建荣, 李伟伟   

  1. (东北大学 工商管理学院, 辽宁 沈阳110169)
  • 发布日期:2023-06-20
  • 通讯作者: 易平涛
  • 作者简介:易平涛(1981-),男,湖南永州人,东北大学教授,博士生导师.
  • 基金资助:
    国家自然科学基金资助项目(72171040,72171041); 中央高校基本科研业务费专项资金资助项目(N2006013).

Hierarchical Dimensionless Method Based on Data Distribution Characteristics and Its Equilibrium Analysis

YI Ping-tao, YUAN Jian-rong, LI Wei-wei   

  1. School of Business Administration, Northeastern University, Shenyang 110169, China.
  • Published:2023-06-20
  • Contact: YUAN Jian-rong
  • About author:-
  • Supported by:
    -

摘要: 分层无量纲化方法能够有效去除指标量纲影响的同时解决异常指标造成的数据分布不均衡、区分度低等问题.然而,该方法的使用需要人为指定区间数,使得无量纲化结果受人为因素的干扰,失去客观性.针对该问题,考虑原始数据的分布特征,提出了密度分层无量纲化方法.该方法按照数据分布的疏密程度进行区间划分,客观确定分层级数,同时兼顾分层无量纲化方法的优点,计算相对简单且减少了人为干扰.此外,通过随机模拟发现,该方法对于异常值具有较好的抗干扰性,且无量纲化结果的均衡性受原始数据规模影响.

关键词: 无量纲化方法;异常值;分层无量纲化方法;数据密度;客观分层

Abstract: The hierarchical dimensionless method can effectively remove the effect of different index dimensions, and solve imbalanced data distribution and low discrimination caused by anomalous index values. However, when using this method, it is necessary to artificially specify the number of partition intervals so that the dimensionless results are interfered by human factors and lose objectivity. To solve this problem, a dimensionless method of density hierarchy is proposed considering the distribution characteristics of raw data. This method divides the interval according to the density of data distribution, objectively determines the hierarchical series, and takes into account the advantages of the hierarchical dimensionless method. The calculation is comparatively simple and reduces human factors. In addition, through the stochastic simulation method, it is found that the method has good anti-interference to outliers, and the balance of dimensionless results is affected by the scale of raw data.

Key words: dimensionless method; outlier; hierarchical dimensionless method; data density; objective hierarchy

中图分类号: