东北大学学报(自然科学版) ›› 2026, Vol. 47 ›› Issue (1): 89-98.DOI: 10.12068/j.issn.1005-3026.2026.20240234

• 信息与控制 • 上一篇    下一篇

航空事故领域的知识抽取方法研究与实现

刘军1(), 曹悦1, 刘向军2, 王宏艳1   

  1. 1.东北大学 计算机科学与工程学院,辽宁 沈阳 110169
    2.中软信息系统工程有限公司,北京 100081
  • 收稿日期:2024-12-24 出版日期:2026-01-15 发布日期:2026-03-17
  • 通讯作者: 刘军
  • 作者简介:刘 军(1969—),男,辽宁沈阳人,东北大学教授,博士生导师.
  • 基金资助:
    国家自然科学基金青年基金资助项目(62501133)

Research and Implementation of Knowledge Extraction in Aviation Accident Domain

Jun LIU1(), Yue CAO1, Xiang-jun LIU2, Hong-yan WANG1   

  1. 1.School of Computer Science & Engineering,Northeastern University,Shenyang 110169,China
    2.China Software Information System Engineering Co. Ltd. ,Beijing 100081,China. Corresponding author: LIU Jun,E-mail: liujun@cse. neu. edu. cn
  • Received:2024-12-24 Online:2026-01-15 Published:2026-03-17
  • Contact: Jun LIU

摘要:

随着航空运输业与信息技术的快速发展,航空应急管理给海量、异构的航空安全数据高效利用带来了挑战.本文针对航空事故知识图谱的知识抽取问题,即命名实体识别与关系抽取问题,提出以下方法:1) 提出基于BERT(bidirectional encoder representations from Transformers)的改进BiGRU-IDCNN-CRF模型,实现94.69%的命名实体识别精确率;2) 构建基于强化学习的聚类远程监督关系抽取模型,结合改进K均值聚类与远程监督标注降低数据噪声,并通过强化学习优化去噪过程,最终结合分段卷积神经网络(PCNN)与注意力机制,实现84.16%的关系抽取精确率.实验结果表明,本文方法有效提升了航空事故知识图谱的信息提取质量,为航空安全管理提供了精准的信息支撑.

关键词: 航空事故, 知识抽取, 命名实体识别, 关系抽取, 远程监督, 强化学习

Abstract:

In light of the rapid development of air transportation and information technology, the efficient utilization of massive and heterogeneous aviation safety data in aviation emergency management faces challenges. The problem of knowledge extraction for an aviation accident knowledge graph was studied, specifically named entity recognition and relation extraction, and the following methods were proposed: 1) An improved BiGRU-IDCNN-CRF model based on bidirectional encoder representations from Transformers (BERT) was presented, achieving a named entity recognition accuracy of 94.69%; 2) A reinforcement learning-based clustering distant supervision relation extraction model was constructed, in which data noise was reduced by integrating improved K-means clustering with distant supervision labeling, and the denoising process was optimized via reinforcement learning; a combination of piecewise convolutional neural network (PCNN) and an attention mechanism was applied to achieve a relation extraction accuracy of 84.16%. Experimental results indicate that the quality of information extraction for the aviation accident knowledge graph is effectively improved, providing accurate information support for aviation safety management.

Key words: aviation accident, knowledge extraction, named entity recognition, relation extraction, distant supervision, reinforcement learning

中图分类号: