Speech translation model modeling method and device based on speech synthesis data

The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilizati...

Full description

Saved in:

Bibliographic Details
Main Authors	YANG MURUN, DU QUAN
Format	Patent
Language	Chinese English
Published	21.03.2023
Subjects	ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

Abstract	The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini
AbstractList	The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini
Author	YANG MURUN DU QUAN
Author_xml	– fullname: YANG MURUN – fullname: DU QUAN
BookMark	eNqNizsOwjAQBV1Awe8OywEoQogUShSBqCiAPlriB7bkrC3WQuL2IIUD0Mw0M1MzkiiYmPMlAZ2j_GTRwNlHoT5ahIFeHtQju2iJxZLFy3egGyssfUsdZn1LdlCvZDnz3IzvHBSLn2dmedhfm-MKKbbQxB0EuW1ORVHV63q7KXflP80HSjA5zA
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
DocumentTitleAlternate	一种基于语音合成数据的语音翻译模型建模方法和设备
ExternalDocumentID	CN115828943A
GroupedDBID	EVB
ID	FETCH-epo_espacenet_CN115828943A3
IEDL.DBID	EVB
IngestDate	Fri Jul 19 14:36:29 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	Chinese English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_CN115828943A3
Notes	Application Number: CN202211694653
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&CC=CN&NR=115828943A
ParticipantIDs	epo_espacenet_CN115828943A
PublicationCentury	2000
PublicationDate	20230321
PublicationDateYYYYMMDD	2023-03-21
PublicationDate_xml	– month: 03 year: 2023 text: 20230321 day: 21
PublicationDecade	2020
PublicationYear	2023
RelatedCompanies	SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD
RelatedCompanies_xml	– name: SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD
Score	3.5925064
Snippet	The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Title	Speech translation model modeling method and device based on speech synthesis data
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&locale=&CC=CN&NR=115828943A
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPm-KGsVH1sT01gjtlpZDY2QLISYUgmi4kT6WgIdC3Bqjv96ZLYgXvfSw7W62k8x83-5-OwNw67qE4lZkyiR2TC7tuunJJDKR2hI-c4dL2u_ohY3uM38cO-MSvK7vwug8oR86OSJ6VIL-nut4vdxsYgVaW6nu4jk2Le47Iz8wVqtj5NO2VTeClt8e9IO-MITwRWiEQx-JD60tuP2wBdtEoynPfvulRbdSlr8hpXMIOwMcLcuPoPQ1q8C-WFdeq8Beb3XgXYFdrdBMFDauvFAdw_BpKWUyYznhTKFlY7qiTfFEMGJFYWgWZSlLJQUDRnCVMvxSFZ3VZ4bcT80VI5HoCdx02iPRNXGakx-bTES4-SP7FMrZIpNnwBrcmdZsBBzeqPHUasbNGEOaG7l26nmRE51D9e9xqv-9vIADsi_Jr6z6JZTzt3d5hXicx9fakN8EHY6R
link.rule.ids	230,309,786,891,25594,76906
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NT8IwFH9B_MCbokTxqyZmt0XYOjYOi5GOBRUGQTTcyD5qwMMgdsboX-9rB-JFLzt0a9O95L3fr-2v7wFc2bZEcSPUeRxZOuVmXXd4HOpIbSU-U4tyud_RCxqdJ3o_tsYFeF3dhVF5Qj9UckT0qBj9PVPxerHexPKUtlJcRzNsmt_4I9fTlqtj5NOmUde8ltse9L0-0xhzWaAFQxeJj1xbUPN2AzZtmZ1XUqfnlryVsvgNKf4ebA1wtDTbh8LXtAwltqq8Voad3vLAuwzbSqEZC2xceqE4gOHjgvN4SjKJM7mWjaiKNvkTwYjkhaFJmCYk4TIYEAlXCcEvRd5ZfKbI_cRMECkSPYRLvz1iHR2nOfmxyYQF6z8yK1BM5yk_AtKg1kvNRMChjRpNjGbUjDCk2aFtJo4TWuExVP8ep_rfywsodUa97qR7FzycwK60tZRiGfVTKGZv7_wMsTmLzpVRvwHYV5F-
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Speech+translation+model+modeling+method+and+device+based+on+speech+synthesis+data&rft.inventor=YANG+MURUN&rft.inventor=DU+QUAN&rft.date=2023-03-21&rft.externalDBID=A&rft.externalDocID=CN115828943A