首页 | 本学科首页   官方微博 | 高级检索  
     


An ensemble-based likelihood ratio approach for family-based genomic risk prediction
Authors:Hui An  Chang-shuai Wei  Oliver Wang  Da-hui Wang  Liang-wen Xu  Qing Lu  Cheng-yin Ye
Affiliation:1.Department of Health Management, School of Medicine,Hangzhou Normal University,Hangzhou,China;2.Department of Biostatistics and Epidemiology,University of North Texas Health Science Center,Fort Worth,USA;3.HBI Solutions Inc,Palo Alto,USA;4.Department of Preventive Medicine, School of Medicine,Hangzhou Normal University,Hangzhou,China;5.Department of Epidemiology and Biostatistics, College of Human Medicine,Michigan State University,East Lansing,USA
Abstract:

Objective

As one of the most popular designs used in genetic research, family-based design has been well recognized for its advantages, such as robustness against population stratification and admixture. With vast amounts of genetic data collected from family-based studies, there is a great interest in studying the role of genetic markers from the aspect of risk prediction. This study aims to develop a new statistical approach for family-based risk prediction analysis with an improved prediction accuracy compared with existing methods based on family history.

Methods

In this study, we propose an ensemble-based likelihood ratio (ELR) approach, Fam-ELR, for family-based genomic risk prediction. Fam-ELR incorporates a clustered receiver operating characteristic (ROC) curve method to consider correlations among family samples, and uses a computationally efficient tree-assembling procedure for variable selection and model building.

Results

Through simulations, Fam-ELR shows its robustness in various underlying disease models and pedigree structures, and attains better performance than two existing family-based risk prediction methods. In a real-data application to a family-based genome-wide dataset of conduct disorder, Fam-ELR demonstrates its ability to integrate potential risk predictors and interactions into the model for improved accuracy, especially on a genome-wide level.

Conclusions

By comparing existing approaches, such as genetic risk-score approach, Fam-ELR has the capacity of incorporating genetic variants with small or moderate marginal effects and their interactions into an improved risk prediction model. Therefore, it is a robust and useful approach for high-dimensional family-based risk prediction, especially on complex disease with unknown or less known disease etiology.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号