Detecting motifs for computational classification of dockerin and cohesin sequences

Şahin, Ebru (2013) Detecting motifs for computational classification of dockerin and cohesin sequences. [Thesis]

[thumbnail of EbruSahin_460115.pdf] PDF
EbruSahin_460115.pdf

Download (1MB)

Abstract

Cellulose is the most abundant biopolymer in nature. It has several usage areas in industry. The initial hydrolysis of cellulose is the rate determining step in cellulose degradation. Cellulosomes are the complex structures composed of non-catalytic units and enzymes that take part in cellulose degradation. Cellulosomal units are attached via the interaction between cohesin and dockerin domains which are divided into three subclasses; type I, type II and type III. Development and rational design of novel cohesin and dockerin domains to enhance synergistic actions is very important research topic for biotechnological applications. In this aspect, accurate classification of the subunits and identification of key interaction sites are of great importance for design purposes. In this thesis, we propose a multiple sequence alignment and information theory based classification method that identifies potential key interaction sites. Based on the multiple sequence alignments, the residues that are conserved only in one subclass are determined as the motifs. Classification performance of these motifs is determined using a majority voting based normalized scoring scheme. In addition, reduced amino acid alphabets are introduced to capture the similarities that are invisible in 20-letter alphabet. In this work, we classify cohesin sequences with 99% accuracy, 96% sensitivity and 97% specificity, on average. For dockerin, the sequences are classified with up to 95% accuracy. 76% sensitivity and 92% specificity are observed on average. Potential interaction sites between cohesins and dockerins are determined from the correlated mutation analysis
Item Type: Thesis
Uncontrolled Keywords: Cellulosome. -- Dockerin classification. -- Cohesin classification. -- Motif Detection. -- Reduced amino acid alphabets. -- Correlated mutation. -- Selülozom. -- Dockerin sınıflandırılması. -- Kohezin sınıflandırılması. -- Motif tespiti. -- İndirgenmiş aminoasit alfabeleri. -- İlintili mutasyon.
Subjects: T Technology > TA Engineering (General). Civil engineering (General) > TA164 Bioengineering
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Biological Sciences & Bio Eng.
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 09 Mar 2016 12:27
Last Modified: 26 Apr 2022 10:06
URI: https://research.sabanciuniv.edu/id/eprint/29198

Actions (login required)

View Item
View Item