Görsel-işitsel tandem sınıflandırıcılar ve birleşimleri ile konuşma tanıma başarısını arttırma (Improving speech recognition with audio-visual tandem classifiers and their fusions)

Topkaya, İbrahim Saygın and Şen, Mehmet Umut and Yılmaz, Mustafa Berkay and Erdoğan, Hakan (2011) Görsel-işitsel tandem sınıflandırıcılar ve birleşimleri ile konuşma tanıma başarısını arttırma (Improving speech recognition with audio-visual tandem classifiers and their fusions). In: IEEE 19th Conference on Signal Processing and Communications Applications (SIU 2011), Kemer, Antalya

Full text not available from this repository. (Request a copy)

Abstract

“Tandem approach” is a method used in speech recognition to increase performance by using classifier posterior probabilities as observations in a hidden Markov model. In this work we study the effect of using multiple visual tandem features to improve audio-visual recognition accuracy. In addition, we investigate methods to combine outputs of several audio and visual tandem classifiers with a classifier fusion system to generate outputs using learned weights. Experiments show that both approaches help to improve audio-visual speech recognition with respect to regular audio-visual speech recognition especially in noisy environments.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: audio-visual tandem classifiers , audiovisual recognition accuracy , hidden Markov model , multiple visual tandem features , posterior probabilities , speech recognition , tandem approach
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Electronics
Faculty of Engineering and Natural Sciences
Depositing User: Hakan Erdoğan
Date Deposited: 24 Dec 2011 22:15
Last Modified: 26 Apr 2022 09:05
URI: https://research.sabanciuniv.edu/id/eprint/18535

Actions (login required)

View Item
View Item