Statistical morphological disambiguation with application to disambiguation of pronunciations in Turkish /

Warning The system is temporarily closed to updates for reporting purpose.

Külekci, Oğuzhan M. (2006) Statistical morphological disambiguation with application to disambiguation of pronunciations in Turkish /. [Thesis]

[thumbnail of kulekcimuhammedoguzhan.pdf] PDF
kulekcimuhammedoguzhan.pdf

Download (471kB)

Abstract

The statistical morphological disambiguation of agglutinative languages suffers from data sparseness. In this study, we introduce the notion of distinguishing tag sets (DTS) to overcome the problem. The morphological analyses of words are modeled with DTS and the root major part-of-speech tags. The disambiguator based on the introduced representations performs the statistical morphological disambiguation of Turkish with a recall of as high as 95.69 percent. In text-to-speech systems and in developing transcriptions for acoustic speech data, the problem occurs in disambiguating the pronunciation of a token in context, so that the correct pronunciation can be produced or the transcription uses the correct set of phonemes. We apply the morphological disambiguator to this problem of pronunciation disambiguation and achieve 99.54 percent recall with 97.95 percent precision. Most text-to-speech systems perform phrase level accentuation based on content word/function word distinction. This approach seems easy and adequate for some right headed languages such as English but is not suitable for languages such as Turkish. We then use a a heuristic approach to mark up the phrase boundaries based on dependency parsing on a basis of phrase level accentuation for Turkish TTS synthesizers.
Item Type: Thesis
Uncontrolled Keywords: Statistical morphological disambiguation -- Pronunciation disambiguation -- Turkish phrase boundary detection -- Natural language processing in text-to-speech synthesis
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng.
Faculty of Engineering and Natural Sciences
Depositing User: IC-Cataloging
Date Deposited: 14 Apr 2008 15:47
Last Modified: 26 Apr 2022 09:47
URI: https://research.sabanciuniv.edu/id/eprint/8379

Actions (login required)

View Item
View Item