首页 | 本学科首页   官方微博 | 高级检索  
     检索      

阿尔茨海默病基因-疾病关联的知识挖掘
引用本文:王雪,武俊伟,陈观群,李燕琼,马路.阿尔茨海默病基因-疾病关联的知识挖掘[J].图书情报工作,2020,64(13):120-132.
作者姓名:王雪  武俊伟  陈观群  李燕琼  马路
作者单位:1.首都医科大学医学人文学院 北京 100069;2.首都医科大学宣武医院图书馆 北京 100053;3.中国人民解放军总医院医学信息室 北京 100853;4.首都医科大学宣武医院神经内科 北京 100053
基金项目:本文系首都医科大学宣武医院院级管理课题"基于科技影响力排行的医院重点学科影响力分析"(项目编号:XWGL-2019003)和首都医科大学宣武医院院级教学课题"基于元素养理论的医学生信息素养教学路径研究"(项目编号:2019XWJXGG-10)研究成果之一。
摘    要:目的/意义] 对阿尔茨海默病(AD)进行基因-疾病关联挖掘,以捕捉潜力研究方向。方法/过程] 基于LBD理论构建开放式知识发现架构,结合MeSH词表、DisGeNET等医学术语、组学数据对PubMed中AD文献进行知识挖掘,采用关联规则与算法排序等方法对部分基因重合的强关联主题共现疾病和优先候选基因进行筛选,结合时间切片和其他LBD工具对比加以验证。结果/结论] 对88 334篇AD文献进行基因-疾病识别,并与2 120种AD基因进行匹配;以XYZ分析视角对识别出的992种主题共现疾病及11 899种候选基因进行关联排序;精炼10种强关联疾病与25种优选候选基因,结合文献报道加以论述。通过LBD挖掘目标疾病-共现疾病-基因之间潜在关联,可快速捕捉潜力研究方向,缩小基因测序范围,为新研究假设的生成提供重要指导依据。

关 键 词:知识发现  基因组学  阿尔茨海默病  实体识别  数据挖掘  排序算法  时间分析  
收稿时间:2020-01-03
修稿时间:2020-02-28

Knowledge Mining of Alzheimer's Disease Gene-Disease Associations
Wang Xue,Wu Junwei,Chen Guanqun,Li Yanqiong,Ma Lu.Knowledge Mining of Alzheimer's Disease Gene-Disease Associations[J].Library and Information Service,2020,64(13):120-132.
Authors:Wang Xue  Wu Junwei  Chen Guanqun  Li Yanqiong  Ma Lu
Institution:1.Medical Humanities School, Capital Medical University, Beijing 100069;2.Department of Library, Xuanwu Hospital, Capital Medical University, Beijing 100053;3.Medical Information Section, Chinese PLA General Hospital, Beijing 100853;4.Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing 100053
Abstract:Purpose/significance] To explore the gene-disease association of Alzheimer's disease (AD) in order to capture the potential research directions.Method/process] An open knowledge discovery framework was constructed based on LBD theory. Combined with MeSH thesaurus, DisGeNET and other medical terms and group data, knowledge mining was carried out in AD literatures in PubMed. Association rules and algorithm sorting were used to screen strongly associated MeSH terms co-occurrence diseases and priority candidate genes for partial gene coincidence, results of time slicing and comparison with other LBD tools were used to verify them.Result/conclusion] 88 334 AD literatures were identified and matched with 2 120 AD genes, 11 899 candidate genes and 992 comorbidity genes were identified according to XYZ analysis, 10 strongly associated co-occurrence diseases and 25 preferred candidate genes were refined and discussed in combination with literature reports. Mining the potential associations between target disease, co-occurrence diseases and genes by LBD can quickly capture the potential research directions, narrow the scopes of gene sequencing, and provide important guidance for the generations of new research hypotheses.
Keywords:literature based discovery  genomics  Alzheimer's disease  entity recognition  data mining  sorting algorithm  time analysis  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号