Yılmaz, Serhan and Taştan, Öznur and Çiçek, Ercüment (2019) SPADIS: an algorithm for selecting predictive and diverse SNPs in GWAS. IEEE/ACM Transactions on Computational Biology and Bioinformatics . ISSN 1545-5963 (Print) 1557-9964 (Online) Published Online First http://dx.doi.org/10.1109/TCBB.2019.2935437
There is a more recent version of this item available.
Official URL: http://dx.doi.org/10.1109/TCBB.2019.2935437
Abstract
Phenotypic heritability of complex traits and diseases is seldom explained by individual genetic variants identified in genome-wide association studies (GWAS). Many methods have been developed to select a subset of variant loci, which are associated with or predictive of the phenotype. Selecting connected SNPs on SNP-SNP networks have been proven successful in finding biologically interpretable and predictive SNPs. However, we argue that the connectedness constraint favors selecting redundant features that affect similar biological processes and therefore does not necessarily yield better predictive performance. In this paper, we propose a novel method called SPADIS that favors the selection of remotely located SNPs in order to account for their complementary effects in explaining a phenotype. SPADIS selects a diverse set of loci on a SNP-SNP network. This is achieved by maximizing a submodular set function with a greedy algorithm that ensures a constant factor approximation to the optimal solution. We compare SPADIS to the state-of-the-art method SConES, on a dataset of Arabidopsis Thaliana with continuous flowering time phenotypes. SPADIS has better average phenotype prediction performance in 15 out of 17 phenotypes when the same number of SNPs are selected and provides consistent improvements. Moreover, it identifies more candidate genes and runs faster.
Item Type: | Article |
---|---|
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Biological Sciences & Bio Eng. Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | Öznur Taştan |
Date Deposited: | 29 Aug 2019 09:50 |
Last Modified: | 26 Apr 2022 10:11 |
URI: | https://research.sabanciuniv.edu/id/eprint/39135 |
Available Versions of this Item
-
SPADIS: an algorithm for selecting predictive and diverse SNPs in GWAS. (deposited 31 Jul 2019 22:23)
- SPADIS: an algorithm for selecting predictive and diverse SNPs in GWAS. (deposited 29 Aug 2019 09:50) [Currently Displayed]