SPEECH PROCESSING UNIT, METHOD, AND PROGRAM

PROBLEM TO BE SOLVED: To provide a speech processing unit, a method, and a program capable of performing speaker cluster without deterioration of accuracy.SOLUTION: An acquisition unit 11 acquires voice. A division unit 12 divides the voice into plural sections following to a predetermined rule. A c...

Full description

Saved in:

Bibliographic Details
Main Authors	KIDA YUSUKE, DING NING, HIROHATA MAKOTO
Format	Patent
Language	English Japanese
Published	30.03.2015
Subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

More Information
Summary:	PROBLEM TO BE SOLVED: To provide a speech processing unit, a method, and a program capable of performing speaker cluster without deterioration of accuracy.SOLUTION: An acquisition unit 11 acquires voice. A division unit 12 divides the voice into plural sections following to a predetermined rule. A calculation unit 13 calculates similarity of the voice for every combination of the sections. An estimation unit 14 estimates an arrival direction of the voice for each section. A correction unit 15 classifies sections whose arrival directions are close to each other into the same group, and corrects similarity of combination of the sections belonging to the same group. A clustering unit 16 performs clustering of the sections using the similarity which is corrected. 【課題】精度を低下させることなく話者クラスタリングを行なうことができる音声処理装置、方法、及びプログラムを提供する。【解決手段】取得部１１は、音声を取得する。分割部１２は、前記音声を所定の規則に従って複数の区間に分割する。算出部１３は、区間の組み合わせ毎に、音声の類似度を算出する。推定部１４は、区間毎に、音声の到来方向を推定する。修正部１５は、到来方向が互いに近い区間を同一のグループに分類し、同一のグループに属する区間の組み合わせについて、類似度を修正する。クラスタリング部１６は、修正後の類似度を用いて、区間をクラスタリングする。【選択図】図２
Bibliography:	Application Number: JP20130192399