A pathway graph kernel based multi-omics approach for patient clustering

Tepeli, Yasin İlkağan (2020) A pathway graph kernel based multi-omics approach for patient clustering. [Thesis]

[thumbnail of 10352109_Tepeli_Yasin_İlkagan.pdf] PDF

Download (5MB)


Accurate classification of patients into molecular subgroups is critical for the development of effective therapeutics and for deciphering the underlining mechanisms for these subgroups. The availability of multi-omics data catalogs for large cohorts of cancer patients provides multiple views into the molecular biology of the tumors and the alterations that take place in patient genes such as mutations and differential expression patterns. At the same time, the molecular interaction networks provide the biological context for these alterations. We develop PAMOGK (Pathway based Multi Omic Graph Kernel clustering framework) that integrates multi-omics patient data with existing biological knowledge on pathways. We use a novel graph kernel that evaluates patient similarities based on a single molecular alteration type in the context of a pathway. To corroborate multiple views of patients that are evaluated by hundreds of pathways and molecular alteration combinations, we use a multi-view kernel clustering approach. Applying PAMOGK to kidney renal clear cell carcinoma (KIRC) patients results in four clusters with significantly different survival times (p-value = 1.24e-11). When we compare PAMOGK to eight other state-of-the-art multi-omics clustering methods, PAMOGK consistently outperforms these in terms of its ability to partition KIRC patients into groups with different survival distributions. The discovered patient subgroups also differ with respect to other clinical parameters such as tumor stage and grade, and primary tumor and metastasis tumor spreads. The pathways identified as important are highly relevant to KIRC. We also extend our analysis to eight other cancer types with available mutation, protein and gene expression data. PAMOGK framework is available in github.com/tastanlab/pamogk
Item Type: Thesis
Uncontrolled Keywords: Cancer, Multi-view clustering. -- Kernel methods. -- Graph kernels. -- Pathways. -- Multi-omics data. -- Kanser, Çoklu-bakıs kümeleme. -- Çekirdek metodları. -- Çizge çekirdegi. -- Yolaklar. -- çoklu-omik verisi.
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 03 Nov 2020 09:47
Last Modified: 26 Apr 2022 10:34
URI: https://research.sabanciuniv.edu/id/eprint/41212

Actions (login required)

View Item
View Item