Eryiğit, Gülşen and Oflazer, Kemal and Nivre, Joakim (2007) Dependency parsing of Turkish. (Accepted/In Press)
There is a more recent version of this item available.
PDF
stvkaf01826.pdf
Download (430kB)
stvkaf01826.pdf
Download (430kB)
Abstract
The suitability of different parsing methods for different languages is an important topic in
syntactic parsing. Especially lesser-studied languages, typologically different from the languages
for which methods have originally been developed, poses interesting challenges in this respect.
This article presents an investigation of data-driven dependency parsing of Turkish, an agglutinative
free constituent order language that can be seen as the representative of a wider class
of languages of similar type. Our investigations show that morphological structure plays an
essential role in finding syntactic relations in such a language. In particular, we show that
employing sublexical representations called inflectional groups, rather than word forms, as the
basic parsing units improves parsing accuracy. We compare two different parsing methods, one
based on a probabilistic model with beam search, the other based on discriminative classifiers and
a deterministic parsing strategy, and show that the usefulness of sublexical units holds regardless
of parsing method.We examine the impact of morphological and lexical information in detail and
show that, properly used, this kind of information can improve parsing accuracy substantially.
Applying the techniques presented in this article, we achieve the highest reported accuracy for
parsing the Turkish Treebank.
Item Type: | Article |
---|---|
Subjects: | Q Science > QA Mathematics > QA075 Electronic computers. Computer science P Language and Literature > P Philology. Linguistics Q Science > QA Mathematics > QA076 Computer software |
Divisions: | Faculty of Engineering and Natural Sciences |
Depositing User: | Kemal Oflazer |
Date Deposited: | 25 Oct 2007 11:58 |
Last Modified: | 01 Jul 2010 12:41 |
URI: | https://research.sabanciuniv.edu/id/eprint/6392 |
Available Versions of this Item
-
Dependency parsing of Turkish. (deposited 19 Dec 2006 02:00)
- Dependency parsing of Turkish. (deposited 25 Oct 2007 11:58) [Currently Displayed]