东北大学学报(自然科学版) ›› 2026, Vol. 47 ›› Issue (1): 31-41.DOI: 10.12068/j.issn.1005-3026.2026.20259019

• 智慧医疗专栏 • 上一篇    下一篇

一种基于电子健康记录的多尺度图表示学习模型

樊捷杰1, 班晓娟1(), 张志研2   

  1. 1.北京科技大学 人工智能学院,北京 100083
    2.北京科技大学 钢铁技术协同创新中心,北京 100083
  • 收稿日期:2025-09-14 出版日期:2026-01-15 发布日期:2026-03-17
  • 通讯作者: 班晓娟
  • 作者简介:樊捷杰(1985—),男,江西余干人,北京科技大学博士研究生.
  • 基金资助:
    国家自然科学基金资助项目(62106020);国家自然科学基金资助项目(62332017)

A Multi-scale Graph Representation Learning Model Based on Electronic Health Records

Jie-jie FAN1, Xiao-juan BAN1(), Zhi-yan ZHANG2   

  1. 1.School of Intelligence Science and Technology,University of Science and Technology Beijing,Beijing 100083,China
    2.Collaborative Innovation Center of Steel Technology,University of Science and Technology Beijing,Beijing 100083,China. cn
  • Received:2025-09-14 Online:2026-01-15 Published:2026-03-17
  • Contact: Xiao-juan BAN

摘要:

现有的电子健康记录(electronic health records, EHR)的图表示学习方法多依赖单个患者的局部信息,忽视了群体患者在疾病演化和诊疗路径上的潜在关联,从而限制了模型的泛化性与鲁棒性.针对这一问题,本文提出一种混合多层级图神经网络(hybrid multi-level graph neural network, H-MGNN)模型,并将其应用于重症监护室(intensive care unit, ICU)患者的死亡预测.该模型通过构建宏观层面的患者关系图(patient-patient graph, P-P)、微观层面的分类-笔记-词汇超图(taxonomy-note-word hypergraph, T-N-W),结合超图的时序依赖关系,实现多尺度上的患者特征融合.同时,本文设计了融合算法(hybrid embedding, Hybrid-E),用于提取和整合患者嵌入的潜在特征,以提升预测准确性.实验结果表明,H-MGNN在MIMIC-Ⅲ(medical information mart for intensive care Ⅲ)数据集上的住院死亡率预测等任务中显著优于现有方法,验证了其在复杂EHR数据挖掘中的有效性和先进性.

关键词: 电子健康记录, 多尺度, 超图, 图神经网络

Abstract:

Existing graph representation learning methods for electronic health records (EHR) primarily rely on local information of a single patient, overlooking potential associations among patients in disease progression and treatment pathways. This limits the models’ generalizability and robustness. To address this issue, a hybrid multi-level graph neural network (H-MGNN) model was proposed, and it was applied to mortality prediction for intensive care unit (ICU) patients. The model constructed a patient-patient graph (P-P) at the macroscopic level and a taxonomy-note-word hypergraph (T-N-W) at the microscopic level, while incorporating temporal dependencies within the hypergraph to achieve multi-scale fusion of patient features. Meanwhile, a hybrid embedding (Hybrid-E) algorithm was designed to extract and integrate latent patient features and improve the prediction accuracy. Experimental results demonstrate that H-MGNN on the medical information mart for intensive care Ⅲ (MIMIC-Ⅲ) dataset significantly outperforms existing methods in terms of in-hospital mortality prediction and other tasks, validating its effectiveness and superiority in complex EHR data mining.

Key words: electronic health record, multi-scale, hypergraph, graph neural network

中图分类号: