PAMOGK: A pathway graph kernel based multi-omics clustering approach for discovering cancer patient subgroups

Tepeli, Yasin İlkağan and Ünal, Ali Burak and Akdemir, Mustafa Furkan and Taştan, Öznur (2020) PAMOGK: A pathway graph kernel based multi-omics clustering approach for discovering cancer patient subgroups. In: Satellite Workshop on Computational Cancer Biology (RECOMB-CCB) - Jointly with RECOMB 2020, Virtual

[thumbnail of PAMOGK_RECOMBCCB.pdf] PDF
PAMOGK_RECOMBCCB.pdf

Download (750kB)

Abstract

Accurate classification of patients into homogeneous molecular subgroups is critical for the developmentof effective therapeutics and for deciphering what drives these different subtypes to cancer. However, the extensivemolecular heterogeneity observed among cancer patients presents a challenge. The availability of multi-omic datacatalogs for large cohorts of cancer patients provides multiple views into the molecular biology of the tumorswith unprecedented resolution. In this work, we develop PAMOGK, which integrates multi-omics patient data andincorporates the existing knowledge on biological pathways. PAMOGK is well suited to deal with the sparsity ofalterations in assessing patient similarities. We develop a novel graph kernel which we denote as smoothed shortestpath graph kernel, which evaluates patient similarities based on a single molecular alteration type in the contextof pathway. To corroborate multiple views of patients evaluated by hundreds of pathways and molecular alterationcombinations, PAMOGK uses multi-view kernel clustering. We apply PAMOGK to find subgroups of kidney renalclear cell carcinoma (KIRC) patients, which results in four clusters with significantly different survival times (p-value =7.4e-10). The patient subgroups also differ with respect to other clinical parameters such as tumor stage andgrade, and primary tumor and metastasis tumor spreads. When we compare PAMOGK to 8 other state-of-the-artexisting multi-omics clustering methods, PAMOGK consistently outperforms these in terms of its ability to partitionpatients into groups with different survival distributions. PAMOGK enables extracting the relative importance ofpathways and molecular data types. PAMOGK is available at github.com/tastanlab/pamogk
Item Type: Papers in Conference Proceedings
Additional Information: The paper is 10 page and peer-reviewed. Only the selected papers are invited for a talk. Our work was selected as a talk. There is no proceedings since we opt out of the option for the journal publication (it was under review in another journal). In place of a proceeding we were required to put the work on a preprint server. The work was under revision in a journal at the time. The journal submission and the conference submission versions are different. We improved the work in the journal.
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Biological Sciences & Bio Eng.
Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Center of Excellence in Data Analytics
Faculty of Engineering and Natural Sciences
Depositing User: Öznur Taştan
Date Deposited: 27 Sep 2020 10:04
Last Modified: 26 Apr 2022 09:37
URI: https://research.sabanciuniv.edu/id/eprint/40977

Actions (login required)

View Item
View Item