SPEAKER DIALIZATION METHOD, SYSTEM, AND COMPUTER PROGRAM USING VOICE ACTIVITY DETECTION BASED ON SPEAKER EMBEDDING
To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that a...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English Japanese |
Published |
09.06.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that are speech activity regions based on the speaker embedding.SELECTED DRAWING: Figure 4
【課題】 話者埋め込みに基づく音声活動検出を利用した話者ダイアライゼーション方法、システム、およびコンピュータプログラムを提供する。【解決手段】 話者ダイアライゼーション方法は、与えられた音声ファイルに対して音声フレームごとに話者埋め込みを抽出する段階、および前記話者埋め込みに基づいて音声活動領域(speech activity region)である音声区間を検出する段階を含む。【選択図】図4 |
---|---|
Bibliography: | Application Number: JP20210014192 |