SPEAKER DIALIZATION METHOD, SYSTEM, AND COMPUTER PROGRAM USING VOICE ACTIVITY DETECTION BASED ON SPEAKER EMBEDDING

To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that a...

Full description

Saved in:

Bibliographic Details
Main Authors	HAN ICKSANG, HEO HEE SOO, KWON YOUNGKI, LEE BONG JIN, CHUNG JOON SON
Format	Patent
Language	English Japanese
Published	09.06.2022
Subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To provide a speaker dialization method, a system, and a computer program using voice activity detection based on speaker embedding.SOLUTION: A speaker dialization method includes stages of: extracting speaker embedding for each voice frame for a given voice file; and detecting voice segments that are speech activity regions based on the speaker embedding.SELECTED DRAWING: Figure 4 【課題】話者埋め込みに基づく音声活動検出を利用した話者ダイアライゼーション方法、システム、およびコンピュータプログラムを提供する。【解決手段】話者ダイアライゼーション方法は、与えられた音声ファイルに対して音声フレームごとに話者埋め込みを抽出する段階、および前記話者埋め込みに基づいて音声活動領域（ｓｐｅｅｃｈａｃｔｉｖｉｔｙｒｅｇｉｏｎ）である音声区間を検出する段階を含む。【選択図】図４
Bibliography:	Application Number: JP20210014192