This article is part of the series Emotion and Mental State Recognition from Speech.

Open Access Research Article

Recognizing Uncertainty in Speech

Heather Pon-Barry* and Stuart M Shieber

Author Affiliations

School of Engineering and Applied Sciences, Harvard University, 33 Oxford Street, Cambridge, MA 02138, USA

For all author emails, please log on.

EURASIP Journal on Advances in Signal Processing 2011, 2011:251753 doi:10.1155/2011/251753


The electronic version of this article is the complete one and can be found online at: http://asp.eurasipjournals.com/content/2011/1/251753


Received:1 August 2010
Accepted:23 November 2010
Published:8 December 2010

© 2011 Heather Pon-Barry and Stuart M. Shieber.

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

We address the problem of inferring a speaker's level of certainty based on prosodic information in the speech signal, which has application in speech-based dialogue systems. We show that using phrase-level prosodic features centered around the phrases causing uncertainty, in addition to utterance-level prosodic features, improves our model's level of certainty classification. In addition, our models can be used to predict which phrase a person is uncertain about. These results rely on a novel method for eliciting utterances of varying levels of certainty that allows us to compare the utility of contextually-based feature sets. We elicit level of certainty ratings from both the speakers themselves and a panel of listeners, finding that there is often a mismatch between speakers' internal states and their perceived states, and highlighting the importance of this distinction.

Publisher note

To access the full article, please see PDF.