Akyol, Aydın and Erdoğan, Hakan (2004) Filler model based confidence measures for spoken dialogue systems: a case study for Turkish. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004 (ICASSP '04), Montreal, Quebec, Canada
PDF
burcu_2.pdf
Download (115kB)
burcu_2.pdf
Download (115kB)
Official URL: http://dx.doi.org/10.1109/ICASSP.2004.1326102
Abstract
Because of the inadequate performance of speech recognition systems, an accurate confidence scoring mechanism should be employed to understand user requests correctly. To determine a confidence score for a hypothesis, certain confidence features are combined. The performance of filler-model based confidence features have been investigated. Five types of filler model networks were defined: triphone-network; phone-network; phone-class network; 5-state catch-all model; 3-state catch-all model. First, all models were evaluated in a Turkish speech recognition task in terms of their ability to tag correctly (recognition-error or correct) recognition hypotheses. The best performance was obtained from the triphone recognition network. Then, the performances of reliable combinations of these models were investigated and it was observed that certain combinations of filler models could significantly improve the accuracy of the confidence annotation
Item Type: | Papers in Conference Proceedings |
---|---|
Divisions: | Faculty of Engineering and Natural Sciences |
Depositing User: | Hakan Erdoğan |
Date Deposited: | 13 Apr 2009 09:49 |
Last Modified: | 26 Apr 2022 08:50 |
URI: | https://research.sabanciuniv.edu/id/eprint/11459 |