Look Who's Talking: Active Speaker Detection in the Wild
In this work, we present a novel audio-visual dataset for active speaker detection in the wild. A speaker is considered active when his or her face is visible and the voice is audible simultaneously. Although active speaker detection is a crucial pre-processing step for many audio-visual tasks, ther...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , , , , , , , |
Format | Paper |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
17.08.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!