Look Who's Talking: Active Speaker Detection in the Wild

In this work, we present a novel audio-visual dataset for active speaker detection in the wild. A speaker is considered active when his or her face is visible and the voice is audible simultaneously. Although active speaker detection is a crucial pre-processing step for many audio-visual tasks, ther...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors You Jin Kim, Hee-Soo Heo, Choe, Soyeon, Soo-Whan Chung, Kwon, Yoohwan, Bong-Jin, Lee, Kwon, Youngki, Joon Son Chung
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 17.08.2021
Subjects
Online AccessGet full text

Cover

Loading…