Discovering cross-cancer patients with a semi-supervised deep clustering approach

Ay, Duygu (2020) Discovering cross-cancer patients with a semi-supervised deep clustering approach. [Thesis]

[thumbnail of 10354209_Ay_Duygu.pdf] PDF

Download (10MB)


In traditional medicine, the treatment decisions for a cancer patient are typically based on the patient’s cancer type. The availability of molecular profiles for a large cohort of multiple cancer patients opens up possibilities to characterize patients at the molecular level. There have been reports of cases where patients with different cancers bear similarities. Motivated from these observations, in this thesis, we specifically focus on developing a method to discover cross-cancer patients. We define cross-cancer patients as those who have molecular profiles that bear a high level of similarity to other patient(s) diagnosed with a different cancer type and are not representative of their cancer type. To find cross-cancer similar patients, we develop a framework where we identify patients that co-cluster frequently when clustered based on their transcriptomic profiles. To solve the clustering problem, we propose a semi-supervised deep learning clustering in which the clustering task is guided by the cancer types of the patients and the survival times. The deep representation obtained in the network is used in the clustering module of DeepCrossCancer. Applying the method to nine different cancers from The Cancer Genome Atlas project using patient tumor gene expression data, we discover twenty patients similar to a patient or multiple patients in another cancer type. We analyze these patients in light of other genomic alterations. Our results find significant similarities both in mutation and copy number variations of the crosscancer patients. The detection of cross-cancer patients opens up possibilities for transferring clinical decisions from one patient to another and expediting the investigation of novel cancer drivers shared among them. The method is available at
Item Type: Thesis
Uncontrolled Keywords: Cancer. -- Deep learning. -- Semi-supervised clustering. -- Patient similarity. -- Kanser. -- Derin Ögrenme. -- Yarı Gözetimli Öbekleme. -- Hasta Benzerliği.
Subjects: T Technology > T Technology (General) > T055.4-60.8 Industrial engineering. Management engineering
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Industrial Engineering
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 24 Oct 2020 17:48
Last Modified: 26 Apr 2022 10:34

Actions (login required)

View Item
View Item