Dependency parsing of Turkish

Eryiğit, Gülşen and Oflazer, Kemal and Nivre, Joakim (2007) Dependency parsing of Turkish. (Accepted/In Press)

WarningThere is a more recent version of this item available.

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


The suitability of different parsing methods for different languages is an important topic in syntactic parsing. Especially lesser-studied languages, typologically different from the languages for which methods have originally been developed, poses interesting challenges in this respect. This article presents an investigation of data-driven dependency parsing of Turkish, an agglutinative free constituent order language that can be seen as the representative of a wider class of languages of similar type. Our investigations show that morphological structure plays an essential role in finding syntactic relations in such a language. In particular, we show that employing sublexical representations called inflectional groups, rather than word forms, as the basic parsing units improves parsing accuracy. We compare two different parsing methods, one based on a probabilistic model with beam search, the other based on discriminative classifiers and a deterministic parsing strategy, and show that the usefulness of sublexical units holds regardless of parsing method.We examine the impact of morphological and lexical information in detail and show that, properly used, this kind of information can improve parsing accuracy substantially. Applying the techniques presented in this article, we achieve the highest reported accuracy for parsing the Turkish Treebank.

Item Type:Article
Subjects:Q Science > QA Mathematics > QA075 Electronic computers. Computer science
P Language and Literature > P Philology. Linguistics
Q Science > QA Mathematics > QA076 Computer software
ID Code:6392
Deposited By:Kemal Oflazer
Deposited On:25 Oct 2007 11:58
Last Modified:01 Jul 2010 12:41

Available Versions of this Item

Repository Staff Only: item control page