Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder

Yavuz, Ahmet Sinan and Sezerman, Uğur (2014) Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder. BMC Genomics, 15 (Supple). ISSN 1471-2164 (Print) 1471-2164 (Online)

[thumbnail of Listed in DOAJ as an open access journal] PDF (Listed in DOAJ as an open access journal)
BMCgenomics.pdf

Download (647kB)

Abstract

Background Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein sumoylation has been linked with a variety of disorders and developmental anomalies. Experimental approaches to identification of sumoylation sites may not be effective due to the dynamic nature of sumoylation, laborsome experiments and their cost. Therefore, computational approaches may guide experimental identification of sumoylation sites and provide insights for further understanding sumoylation mechanism. Results In this paper, the effectiveness of using various sequence properties in predicting sumoylation sites was investigated with statistical analyses and machine learning approach employing support vector machines. These sequence properties were derived from windows of size 7 including position-specific amino acid composition, hydrophobicity, estimated sub-window volumes, predicted disorder, and conformational flexibility. 5-fold cross-validation results on experimentally identified sumoylation sites revealed that our method successfully predicts sumoylation sites with a Matthew's correlation coefficient, sensitivity, specificity, and accuracy equal to 0.66, 73%, 98%, and 97%, respectively. Additionally, we have showed that our method compares favorably to the existing prediction methods and basic regular expressions scanner. Conclusions By using support vector machines, a new, robust method for sumoylation site prediction was introduced. Besides, the possible effects of predicted conformational flexibility and disorder on sumoylation site recognition were explored computationally for the first time to our knowledge as an additional parameter that could aid in sumoylation site prediction.
Item Type: Article
Additional Information: Article Number: S18
Uncontrolled Keywords: Sumoylation; SUMO; machine learning; support vector machines; post-translational modification
Subjects: Q Science > Q Science (General)
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Biological Sciences & Bio Eng.
Faculty of Engineering and Natural Sciences > Basic Sciences > Physics
Faculty of Engineering and Natural Sciences
Depositing User: Uğur Sezerman
Date Deposited: 12 Dec 2014 21:22
Last Modified: 26 Apr 2022 09:20
URI: https://research.sabanciuniv.edu/id/eprint/26224

Actions (login required)

View Item
View Item