[1]徐丽娜,李 燕,钟昕妤,等.基于Bert的中医方剂文本命名实体识别[J].医学信息,2023,36(04):32-37.[doi:10.3969/j.issn.1006-1959.2023.04.006]
 XU Li-na,LI Yan,ZHONG Xin-yu,et al.Named Entity Recognition of Traditional Chinese Medicine Prescription Based on Bert[J].Journal of Medical Information,2023,36(04):32-37.[doi:10.3969/j.issn.1006-1959.2023.04.006]
点击复制

基于Bert的中医方剂文本命名实体识别()

医学信息[ISSN:1006-1959/CN:61-1278/R]

卷:
36卷
期数:
2023年04期
页码:
32-37
栏目:
中医药信息学
出版日期:
2023-02-15

文章信息/Info

Title:
Named Entity Recognition of Traditional Chinese Medicine Prescription Based on Bert
文章编号:
1006-1959(2023)04-0032-06
作者:
徐丽娜李 燕钟昕妤
(甘肃中医药大学信息工程学院,甘肃 兰州 730000)
Author(s):
XU Li-naLI YanZHONG Xin-yuet al.
(Information Engineering Institution of Gansu University of Chinese Medicine,Lanzhou 730000,Gansu,China)
关键词:
深度学习中医方剂命名实体识别模型
Keywords:
Deep learningTraditional Chinese medicine prescriptionsNamed entity recognition models
分类号:
TP391.1;R289
DOI:
10.3969/j.issn.1006-1959.2023.04.006
文献标志码:
A
摘要:
针对中医药领域常用命名实体识别模型存在的边界模糊和歧义性等问题,本文提出基于大规模预处理中文语言模型(Bert)的中医方剂文本命名实体识别方法。通过Bert预训练模型接受其相对应的词向量,将预处理完成的词向量输入到长短期记忆(Bi-LSTM)模块中,完成对文本上下文语义信息的捕获,最后使用条件随机场(CRF)模块解码输出得到的预测标签排序,依次检索和排序各类中医方剂文本实体,从而完成整个实体识别步骤,结果显示出Bert对中医方剂各类实体识别具有较高的适用性,中医方剂各类实体识别的准确率得到显著提升。
Abstract:
Aiming at the boundary ambiguity and ambiguity of named entity recognition models commonly used in the field of traditional Chinese medicine, a named entity recognition method of TCM prescription text based on large-scale preprocessed Chinese language model (Bert) is proposed. The corresponding word vector is accepted by the Bert pre-training model, and the preprocessed word vector is input into the long-term and short-term memory (Bi-LSTM) module to capture the semantic information of the text context. Finally, the conditional random field (CRF) module is used to sort the predictive tags obtained from the output, and the text entities of all kinds of TCM prescriptions are retrieved and sorted in turn, so as to complete the whole entity recognition step. The results show that Bert has high applicability to all kinds of entity recognition of traditional Chinese medicine prescription, and the accuracy of entity recognition of traditional Chinese medicine prescription has been significantly improved.

参考文献/References:

[1]王丽娜,胡建鹏,范婧婧,等.方剂学理论体系形成与发展[J].中医药临床杂志,2017,29(12):2044-2047.[2]李振岳.中药方剂数据挖掘研究[D].广州:广东药学院,2010.[3]田瑾.基于复杂网络及关联规则的失眠用药中医临床数据挖掘研究[D].北京:北京中医药大学,2015.[4]周荣荣,闫润红,王丽萍,等.数据挖掘技术在中医方剂科学问题研究中的应用[J].中华中医药杂志,2018,33(9):4016-4020.[5]杨璐妍,聂付敏,金维捷,等.基于数据挖掘《中医方剂大辞典》治疗乳痈用药规律探讨[J].云南中医中药杂志,2022,43(2):24-28.[6]贾杨春,朱定局.基于深度学习的医疗命名实体识别[J].计算机系统应用,2022,31(9):70-81.[7]黄敏婷,赵静,于涛.基于医学大数据的预训练语言模型及其医学文本分类研究[J].中华医学图书情报杂志,2020,29(11):39-46.[8]Prottasha NJ,Sami AA,Kowsher M,et al.Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning[J].Sensors (Basel),2022,22(11):4157. [9]Ronran C,Lee S.Effect of Character and Word Features in Bidirectional LSTM-CRF for NER[C]//2020 IEEE International Conference on Big Data and Smart Computing (BigComp).IEEE,2020.[10]刘苏文,邵一帆,钱龙华.基于联合学习的生物医学因果关系抽取[J].中文信息学报,2020,34(4):60-68.[11]肖瑞,胡冯菊,裴卫.基于BiLSTM-CRF的中医文本命名实体识别[J].世界科学技术-中医药现代化,2020,22(7):2504-2510.[12]张立.基于综合性临床诊断数据分析系统的医院导诊应用[J].医学信息学杂志,2020,41(12):68-72.[13]都丽婷,夏晨曦,赵冬,等.基于条件随机域的临床文本去识别研究[J].中国卫生信息管理杂志,2017,14(2):217-222.[14]屈倩倩,阚红星.基于Bert-BiLSTM-CRF的中医文本命名实体识别[J].电子设计工程,2021,29(19):40-43,48. [15]陈锦,常致全,许军.基于HMM的生物医学命名实体的识别与分类[J].计算机时代,2006(10):40-42.[16]李彦鹏,杨志豪,林鸿飞.基于条件随机域的生物医学命名实体识别[C]//第三届学生计算语言学研讨会论文集,2006.[17]王浩畅,赵铁军,刘延力,等.生物医学文本中命名实体识别的智能化方法[C]//2006年首届ICT大会信息、知识、智能及其转换理论第一次高峰论坛会议论文集,2006.[18]Crichton G,Pyysalo S,Chiu B,et al.A neural network multi-task learning approach to biomedical named entity recognition[J].BMC Bioinformatics,2017,18(1):368.[19]于尤婧.面向可解释性双向编码语言模型的文本分类研究[D].长春:吉林大学,2020.

相似文献/References:

[1]姜荣荣,高佳奕,杨 涛.基于司外揣内原理的中医“舌象-体质”深度学习模型构建研究[J].医学信息,2022,35(18):10.[doi:10.3969/j.issn.1006-1959.2022.18.003]
 JIANG Rong-rong,GAO Jia-yi,YANG Tao.Research on the Construction of Deep Learning Model of “Tongue Manifestation-Physical Constitution” in Traditional Chinese Medicine Based on the Principle of “Governing Exterior to Infer Interior”[J].Journal of Medical Information,2022,35(04):10.[doi:10.3969/j.issn.1006-1959.2022.18.003]
[2]富 坤,李佳宁.LncRNA-疾病关联预测方法研究[J].医学信息,2023,36(04):166.[doi:10.3969/j.issn.1006-1959.2023.04.036]
 FU Kun,LI Jia-ning.Research on LncRNA-disease Association Prediction Method[J].Journal of Medical Information,2023,36(04):166.[doi:10.3969/j.issn.1006-1959.2023.04.036]
[3]曾 晨,孔俊沣,钟 雯,等.深度学习骨龄智能评估系统在重庆三峡库区的初步验证[J].医学信息,2022,35(07):69.[doi:10.3969/j.issn.1006-1959.2022.07.017]
 ZENG Chen,KONG Jun-feng,ZHONG Wen,et al.Preliminary Verification of Deep Learning Bone Age Intelligent Assessment System in Chongqing Three Gorges Reservoir[J].Journal of Medical Information,2022,35(04):69.[doi:10.3969/j.issn.1006-1959.2022.07.017]
[4]段永飞,高继学,倪建鑫,等.基于人工智能学习模型在结节性硬化症相关肾血管平滑肌脂肪瘤诊疗中的研究[J].医学信息,2023,36(11):171.[doi:10.3969/j.issn.1006-1959.2023.11.037]
 DUAN Yong-fei,GAO Ji-xue,NI Jian-xin,et al.Research on the Diagnosis and Treatment of Tuberous Sclerosis Complex-associated Renal Angiomyolipoma Based on Artificial Intelligence Learning Model[J].Journal of Medical Information,2023,36(04):171.[doi:10.3969/j.issn.1006-1959.2023.11.037]
[5]黄辛迪,黄 慧,刘佳俊,等.决策树及支持向量机与深度学习模型在肝癌鉴别诊断中的比较研究[J].医学信息,2023,36(15):70.[doi:10.3969/j.issn.1006-1959.2023.15.012]
 HUANG Xin-di,HUANG Hui,LIU Jia-jun,et al.Comparative Study of Decision Tree, Support Vector Machine and Deep Learning Model in Differential Diagnosis of Liver Cancer[J].Journal of Medical Information,2023,36(04):70.[doi:10.3969/j.issn.1006-1959.2023.15.012]
[6]张亚洲,李智伟,农卫霞,等.基于深度学习的急性白血病流式细胞术检测报告文本资料自动分类研究[J].医学信息,2025,38(08):15.[doi:10.3969/j.issn.1006-1959.2025.08.003]
 ZHANG Yazhou,LI Zhiwei,NONG Weixia,et al.Automatic Classification of Text Data for Flow Cytometry Detection of Acute Leukemia Based on Deep Learning[J].Journal of Medical Information,2025,38(04):15.[doi:10.3969/j.issn.1006-1959.2025.08.003]
[7]李春林,周根鸿,谭 沁,等.基于卷积神经网络深度学习的病理图像自动诊断研究[J].医学信息,2025,38(18):7.[doi:10.3969/j.issn.1006-1959.2025.18.002]
 LI Chunlin,ZHOU Genhong,TAN Qin,et al.Research on Automatic Diagnosis of Pathological Images Basedon Convolutional Neural Network and Deep Learning[J].Journal of Medical Information,2025,38(04):7.[doi:10.3969/j.issn.1006-1959.2025.18.002]
[8]陈睿鹏,高 亚,刘崇梅.构建多组学融合模型预测ⅠA期肺腺癌术后复发转移风险并探究肿瘤发生发展机制[J].医学信息,2026,39(02):12.[doi:10.3969/j.issn.1006-1959.2026.02.003]
 CHEN Ruipeng,GAO Ya,LIU Chongmei.Construct a Multi-omics Fusion Model to Predict the Risk of Postoperative Recurrenceand Metastasis of Stage ⅠA Lung Adenocarcinoma and Explore its Mechanism ofTumor Development and Progression[J].Journal of Medical Information,2026,39(04):12.[doi:10.3969/j.issn.1006-1959.2026.02.003]

更新日期/Last Update: 1900-01-01