Video signal processing systems and methods utilizing automated speech analysis

A method of increasing the frame rate of an image of a speaking person comprises monitoring an audio signal indicative of utterances by the speaking person and the associated video signal. The audio signal corresponds to one or more fields or frames to be reconstructed, and individual portions of th...

Full description

Saved in:
Bibliographic Details
Main Author CHEN TSUHAN
Format Patent
LanguageEnglish
Published 11.12.2001
Edition7
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method of increasing the frame rate of an image of a speaking person comprises monitoring an audio signal indicative of utterances by the speaking person and the associated video signal. The audio signal corresponds to one or more fields or frames to be reconstructed, and individual portions of the audio signal are associated with facial feature information. The facial information includes mouth formation and position information derived from phonemes or other speech-based criteria from which the position of a speaker's mouth may be reliably predicted. A field or frame of the image is reconstructed using image features extracted from the existing frame and by utilizing the facial feature information associated with a detected phoneme.
Bibliography:Application Number: US19940210529