PreprintArticleVersion 1Preserved in Portico This version is not peer-reviewed
LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data
Version 1
: Received: 8 May 2020 / Approved: 9 May 2020 / Online: 9 May 2020 (08:20:12 CEST)
How to cite:
Sharma, S. LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data. Preprints2020, 2020050154. https://doi.org/10.20944/preprints202005.0154.v1
Sharma, S. LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data. Preprints 2020, 2020050154. https://doi.org/10.20944/preprints202005.0154.v1
Sharma, S. LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data. Preprints2020, 2020050154. https://doi.org/10.20944/preprints202005.0154.v1
APA Style
Sharma, S. (2020). LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data. Preprints. https://doi.org/10.20944/preprints202005.0154.v1
Chicago/Turabian Style
Sharma, S. 2020 "LMM-22: An Enhanced Linear Mixed Model (LMM) Approach for Genome-Wide Association Studies (GWAS) for the Prediction of Diseases and Traits among Humans from Genomics Data" Preprints. https://doi.org/10.20944/preprints202005.0154.v1
Abstract
Increasingly, genomics is being used for the prediction of specific traits and diseases (phenotypes) among humans. Wider availability of genomics data through multiple research projects (such as International HapMap Project1 and 1000 Genomes2) has been a catalyst in that direction. With the recent advances in machine learning and big data analysis, data computation resources and data models needed for genomics data analysis are readily available. However, the prediction of traits and diseases has its own challenges in terms of computational requirements and computational analysis, statistical analysis (example: confounding variables), and limited quality of data collection. Linear Mixed Models (LMM, a type of linear regression) is a common approach for Genome-wide Association Studies (GWAS) for the prediction of common traits among humans using genomics. This paper researches the existing LMM-based approaches for Genome-wide Association Studies (GWAS), describes the experiment performed on FaST-LMM approach from Microsoft Research, and then proposes an enhanced approach (called LMM-22) on how to address computational and statistical issues. LMM-22 focuses on the parallelization of LMM computations and execution of LMM-22 on General Purpose Graphics Processing Units (GPU) as against CPUs to accelerate the LMM approach for GWAS studies.
GWAS Studies; Linear Mixed Models; GPU Acceleration
Subject
Computer Science and Mathematics, Analysis
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.