Study on Quality Evaluation Method of Speech Datasets for Algorithm Model
With the maturity of intelligent voice technology and product application,the demand for high-quality voice datasets is increasing.There have been some researchers put effort on the quality evaluation of the structured data,but there are few standards appeared for the unstructured voice dataset.By a...
Saved in:
Published in | Ji suan ji ke xue Vol. 49; p. 519 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | Chinese |
Published |
Chongqing
Guojia Kexue Jishu Bu
01.01.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | With the maturity of intelligent voice technology and product application,the demand for high-quality voice datasets is increasing.There have been some researchers put effort on the quality evaluation of the structured data,but there are few standards appeared for the unstructured voice dataset.By analyzing the construction principle of speech algorithm model and analyzing the construction demand of voice dataset,a unified quality assessment framework for the voice dataset is presented.The framework proposes to evaluate the dataset in terms of four dimensions,each of which subsumes a set of criteria:breadth coverage,anthology distinction,field depth and accuracy completeness.The criteria that are suitable to evaluate the quality dimensions are presented,each with the definition,measurement method,and the evaluation process for the voice dataset quality measurement.Experimental assessment and analysis results of the voice datasets in the vehicular application field are presented as the reference for evaluating |
---|---|
ISSN: | 1002-137X |