DETECTION OF SPEECH SECTIONS FROM ACOUSTIC SIGNALS

Accession number;04A0171210
Title;DETECTION OF SPEECH SECTIONS FROM ACOUSTIC SIGNALS
Author; TERASAWA H (Tokyo Metropolitan Univ., Tokyo, Jpn) YANAGISAWA M (Tokyo Metropolitan Univ., Tokyo, Jpn) TAGAWA N (Tokyo Metropolitan Univ., Tokyo, Jpn) MINAGAWA A (Tokyo Metropolitan Univ., Tokyo, Jpn)
Journal Title;IEIC Technical Report (Institute of Electronics, Information and Communication Engineers)
Journal Code:S0532B
ISSN:0913-5685
VOL.103;NO.539(IE2003 148-162);PAGE.7-11(2004)
Figure&Table&Reference;FIG.6, TBL.1, REF.5
Pub. Country;Japan
Language;English
Abstract;We study a method to detect speech sections from acoustic signals using lip image sequences. This method can reduce mistakes that the non-target person's speeches are recognized as those of the target person, since the lip images of the target person can be used. The proposed method employs the hidden Markov models to learn the characteristics of the speech and the non-speech sections in the lip image sequences. Especially, to reduce the ambiguities caused by the variations of the appearance of the lips in the images and of the brightness of the images, we examine the construction of the features, which are extracted from the images and are used to detect speech sections. (author abst.)