Study on Quality Evaluation Method of Speech Datasets for Algorithm Model

With the maturity of intelligent voice technology and product application,the demand for high-quality voice datasets is increasing.There have been some researchers put effort on the quality evaluation of the structured data,but there are few standards appeared for the unstructured voice dataset.By a...

Full description

Saved in:
Bibliographic Details
Published inJi suan ji ke xue Vol. 49; p. 519
Main Authors Li, Sun, Cao, Feng, Liu, Zi-shan
Format Journal Article
LanguageChinese
Published Chongqing Guojia Kexue Jishu Bu 01.01.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With the maturity of intelligent voice technology and product application,the demand for high-quality voice datasets is increasing.There have been some researchers put effort on the quality evaluation of the structured data,but there are few standards appeared for the unstructured voice dataset.By analyzing the construction principle of speech algorithm model and analyzing the construction demand of voice dataset,a unified quality assessment framework for the voice dataset is presented.The framework proposes to evaluate the dataset in terms of four dimensions,each of which subsumes a set of criteria:breadth coverage,anthology distinction,field depth and accuracy completeness.The criteria that are suitable to evaluate the quality dimensions are presented,each with the definition,measurement method,and the evaluation process for the voice dataset quality measurement.Experimental assessment and analysis results of the voice datasets in the vehicular application field are presented as the reference for evaluating
ISSN:1002-137X