Eryiğit, Gülşen and Nivre, Joakim and Oflazer, Kemal (2008) Dependency parsing of Turkish. Computational Linguistics, 34 (3). pp. 357-389. ISSN 0891-2017
This is the latest version of this item.
PDF (This is a RoMEO yellow publisher -- author can archive pre-print (ie pre-refereeing))
stvkaf01826.pdf
Download (430kB)
stvkaf01826.pdf
Download (430kB)
Official URL: http://dx.doi.org/10.1162/coli.2008.07-017-R1-06-83
Abstract
The suitability of different parsing methods for different languages is an important topic in syntactic parsing. Especially lesser-studied languages, typologically different from the languages for which methods have originally been developed, pose interesting challenges in this respect. This article presents an investigation of data-driven dependency parsing of Turkish, an agglutinative, free constituent order language that can be seen as the representative of a wider class of languages of similar type. Our investigations show that morphological structure plays an essential role in finding syntactic relations in such a language. In particular, we show that employing sublexical units called infectional groups, rather than word forms, as the basic parsing units improves parsing accuracy. We test our claim on two different parsing methods, one based on a probabilistic model with beam search and the other based on discriminative classifiers and a deterministic parsing strategy, and show that the usefulness of sublexical units holds regardless of the parsing method. We examine the impact of morphological and lexical information in detail and show that, properly used, this kind of information can improve parsing accuracy substantially. Applying the techniques presented in this article, we achieve the highest reported accuracy for parsing the Turkish Treebank.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA075 Electronic computers. Computer science P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA076 Computer software |
Divisions: | Faculty of Engineering and Natural Sciences > Academic programs > Computer Science & Eng. Faculty of Engineering and Natural Sciences |
Depositing User: | Kemal Oflazer |
Date Deposited: | 01 Jul 2010 12:41 |
Last Modified: | 25 May 2011 14:21 |
URI: | https://research.sabanciuniv.edu/id/eprint/14098 |
Available Versions of this Item
-
Dependency parsing of Turkish. (deposited 19 Dec 2006 02:00)
-
Dependency parsing of Turkish. (deposited 25 Oct 2007 11:58)
- Dependency parsing of Turkish. (deposited 01 Jul 2010 12:41) [Currently Displayed]
-
Dependency parsing of Turkish. (deposited 25 Oct 2007 11:58)