Augmenting conversations through context-aware multimedia retrieval based on speech recognition

Warning The system is temporarily closed to updates for reporting purpose.

Yüksel, Kamer Ali and Erçil, Aytül and Cansoy, Murat Çelik and Balcısoy, Selim (2011) Augmenting conversations through context-aware multimedia retrieval based on speech recognition. In: 5th Workshop on Human-Computer Interaction and Information Retrieval (HCIR 2011), Mountain view, CA

[thumbnail of augmenting_conversations_through_context-aware_multimedia_retrieval_based_on_speech_recognition_.2057.pdf] PDF
augmenting_conversations_through_context-aware_multimedia_retrieval_based_on_speech_recognition_.2057.pdf

Download (323kB)

Abstract

Future’s environments will be sensitive and responsive to the presence of people to support them carrying out their everyday life activities, tasks and rituals, in an easy and natural way. Such interactive spaces will use the information and communication technologies to bring the computation into the physical world, in order to enhance ordinary activities of their users. This paper describes a speech-based spoken multimedia retrieval system that can be used to present relevant video-podcast (vodcast) footage, in response to spontaneous speech and conversations during daily life activities. The proposed system allows users to search the spoken content of multimedia files rather than their associated meta-information and let them navigate to the right portion where queried words are spoken by facilitating within-medium searches of multimedia content through a bag-of-words approach. Finally, we have studied the proposed system on different scenarios by using vodcasts in English from various categories, as the targeted multimedia, and discussed how it would enhance people’s everyday life activities by different scenarios including education, entertainment, marketing, news and workplace.
Item Type: Papers in Conference Proceedings
Uncontrolled Keywords: Ambient intelligence, multimedia retrieval, speech recognition, spoken document retrieval, video-podcast
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering and Natural Sciences > Academic programs > Electronics
Faculty of Engineering and Natural Sciences
Depositing User: Aytül Erçil
Date Deposited: 25 Nov 2011 12:39
Last Modified: 26 Apr 2022 09:02
URI: https://research.sabanciuniv.edu/id/eprint/17519

Actions (login required)

View Item
View Item