The incremental use of morphological information and lexicalization in data-driven dependency parsing

Eryiğit, Gülşen and Nivre, Joakim and Oflazer, Kemal (2006) The incremental use of morphological information and lexicalization in data-driven dependency parsing. In: 21st International Conference on the Computer Processing of Oriental Languages, Singapore

[thumbnail of 3011800000620.pdf] PDF
3011800000620.pdf

Download (195kB)

Abstract

Typological diversity among the natural languages of the world poses interesting challenges for the models and algorithms used in syntactic parsing. In this paper, we apply a data-driven dependency parser to Turkish, a language characterized by rich morphology and flexible constituent order, and study the effect of employing varying amounts of morpholexical information on parsing performance. The investigations show that accuracy can be improved by using representations based on inflectional groups rather than word forms, confirming earlier studies. In addition, lexicalization and the use of rich morphological features are found to have a positive effect. By combining all these techniques, we obtain the highest reported accuracy for parsing the Turkish Treebank.
Item Type: Papers in Conference Proceedings
Subjects: Q Science > QA Mathematics
P Language and Literature > P Philology. Linguistics
Divisions: Faculty of Engineering and Natural Sciences
Depositing User: Kemal Oflazer
Date Deposited: 19 Dec 2006 02:00
Last Modified: 26 Apr 2022 08:32
URI: https://research.sabanciuniv.edu/id/eprint/1170

Actions (login required)

View Item
View Item