Journal of Northeastern University(Natural Science) ›› 2022, Vol. 43 ›› Issue (5): 639-645.DOI: 10.12068/j.issn.1005-3026.2022.05.005

• Information & Control • Previous Articles     Next Articles

Interpolation of Missing Physiological Data of ICU Patients Based on Deep Embedded Clustering

LI Jian-hua1, ZHU Ze-yang1, XU Li-sheng1,2, SUN Guo-zhe3   

  1. 1. School of Medicine and Biological Information Engineering, Northeastern University, Shenyang 110169, China; 2. Neusoft Research of Intelligent Healthcare Technology, Co.,Ltd., Shenyang 110167, China; 3. Department of Cardiovascular Medicine, The First Hospital of China Medical University, Shenyang 110001, China.
  • Revised:2021-07-13 Accepted:2021-07-13 Published:2022-06-20
  • Contact: XU Li-sheng
  • About author:-
  • Supported by:
    -

Abstract: The data in electronic medical records are often missing, significantly affecting the analysis results. The ICU(intensive care unit)patients’ data in MIMIC database were analyzed for missing value interpolation, and the dataset consists of 23 groups of commonly used clinical physiological variables and 5260 samples without missing values. A K-nearest neighbor interpolation method was proposed based on deep embedded clustering. This method takes deep embedded clustering as the core, constructs the sample proximity matrix through multiple clustering, and then regards the average value of K-nearest neighbor of missing samples as the missing values. Compared with mean interpolation, median interpolation, a posteriori distribution estimation interpolation and conditional mean interpolation, the proposed method obtains higher similarity between the interpolated results and the original data, and better retains the differences between various samples.

Key words: intensive care unit(ICU); electronic medical record; missing value interpolation; deep embedded clustering; proximity matrix

CLC Number: