Generating explanations for complex biomedical queries

Öztok, Umut (2012) Generating explanations for complex biomedical queries. [Thesis]

[thumbnail of UmutOztok_440088.pdf] PDF
UmutOztok_440088.pdf

Download (1MB)

Abstract

Recent advances in health and life sciences have led to generation of a large amount of biomedical data. To facilitate access to its desired parts, such a big mass of data has been represented in structured forms, like databases and ontologies. On the other hand, representing these databases and ontologies in different formats, constructing them independently from each other, and storing them at different locations have brought about many challenges for answering queries about the knowledge represented in these ontologies and databases. One of the challenges for the users is to be able to represent such a biomedical query in a natural language, and get its answers in an understandable form. Another challenge is to extract relevant knowledge from different knowledge resources, and integrate them appropriately using also definitions, such as, chains of gene-gene interactions, cliques of genes based on gene-gene relations, or similarity/diversity of genes/drugs. Furthermore, once an answer is found for a complex query, the experts may need further explanations about the answer. The first two challenges have been addressed earlier using Answer Set Programming (ASP), with the development of a software system (called BIOQUERY-ASP). This thesis addresses the third challenge: explanation generation in ASP. In this thesis, we extend the earlier work on the first two challenges, to new forms of biomedical queries (e.g., about drug similarity) and to new biomedical knowledge resources. We introduce novel mathematical models and algorithms to generate (shortest or k different) explanations for queries in ASP, and provide a comprehensive theoretical analysis of these methods. We implement these algorithms and integrate them in BIOQUERY-ASP, and provide an experimental evaluation of our methods with some complex biomedical queries over the biomedical knowledge resources PHARMGKB, DRUGBANK, BIOGRID, CTD, SIDER, DISEASEONTOLOGY and ORPHADATA.
Item Type: Thesis
Uncontrolled Keywords: Answer set programming. -- Biomedical query answering. -- Explanation generation. -- Çözüm kümesi programlama. -- Sorgu cevaplama. -- Açıklama üretme.
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800-8360 Electronics > TK7885-7895 Computer engineering. Computer hardware
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 17 Jul 2018 09:45
Last Modified: 26 Apr 2022 10:24
URI: https://research.sabanciuniv.edu/id/eprint/35126

Actions (login required)

View Item
View Item