SPEAKER DIALIZATION METHOD, SYSTEM, AND COMPUTER PROGRAM USING VOICE ACTIVITY DETECTION BASED ON SPEAKER EMBEDDING

To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that a...

Full description

Saved in:
Bibliographic Details
Main Authors HAN ICKSANG, HEO HEE SOO, KWON YOUNGKI, LEE BONG JIN, CHUNG JOON SON
Format Patent
LanguageEnglish
Japanese
Published 09.06.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that are speech activity regions based on the speaker embedding.SELECTED DRAWING: Figure 4 【課題】 話者埋め込みに基づく音声活動検出を利用した話者ダイアライゼーション方法、システム、およびコンピュータプログラムを提供する。【解決手段】 話者ダイアライゼーション方法は、与えられた音声ファイルに対して音声フレームごとに話者埋め込みを抽出する段階、および前記話者埋め込みに基づいて音声活動領域(speech activity region)である音声区間を検出する段階を含む。【選択図】図4
Bibliography:Application Number: JP20210014192