Speech translation model modeling method and device based on speech synthesis data

The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilizati...

Full description

Saved in:
Bibliographic Details
Main Authors YANG MURUN, DU QUAN
Format Patent
LanguageChinese
English
Published 21.03.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini
AbstractList The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini
Author YANG MURUN
DU QUAN
Author_xml – fullname: YANG MURUN
– fullname: DU QUAN
BookMark eNqNizsOwjAQBV1Awe8OywEoQogUShSBqCiAPlriB7bkrC3WQuL2IIUD0Mw0M1MzkiiYmPMlAZ2j_GTRwNlHoT5ahIFeHtQju2iJxZLFy3egGyssfUsdZn1LdlCvZDnz3IzvHBSLn2dmedhfm-MKKbbQxB0EuW1ORVHV63q7KXflP80HSjA5zA
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 一种基于语音合成数据的语音翻译模型建模方法和设备
ExternalDocumentID CN115828943A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN115828943A3
IEDL.DBID EVB
IngestDate Fri Jul 19 14:36:29 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN115828943A3
Notes Application Number: CN202211694653
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&CC=CN&NR=115828943A
ParticipantIDs epo_espacenet_CN115828943A
PublicationCentury 2000
PublicationDate 20230321
PublicationDateYYYYMMDD 2023-03-21
PublicationDate_xml – month: 03
  year: 2023
  text: 20230321
  day: 21
PublicationDecade 2020
PublicationYear 2023
RelatedCompanies SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD
RelatedCompanies_xml – name: SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD
Score 3.5925064
Snippet The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural...
SourceID epo
SourceType Open Access Repository
SubjectTerms ACOUSTICS
CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
Title Speech translation model modeling method and device based on speech synthesis data
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&locale=&CC=CN&NR=115828943A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPm-KGsVH1sT01gjtlpZDY2QLISYUgmi4kT6WgIdC3Bqjv96ZLYgXvfSw7W62k8x83-5-OwNw67qE4lZkyiR2TC7tuunJJDKR2hI-c4dL2u_ohY3uM38cO-MSvK7vwug8oR86OSJ6VIL-nut4vdxsYgVaW6nu4jk2Le47Iz8wVqtj5NO2VTeClt8e9IO-MITwRWiEQx-JD60tuP2wBdtEoynPfvulRbdSlr8hpXMIOwMcLcuPoPQ1q8C-WFdeq8Beb3XgXYFdrdBMFDauvFAdw_BpKWUyYznhTKFlY7qiTfFEMGJFYWgWZSlLJQUDRnCVMvxSFZ3VZ4bcT80VI5HoCdx02iPRNXGakx-bTES4-SP7FMrZIpNnwBrcmdZsBBzeqPHUasbNGEOaG7l26nmRE51D9e9xqv-9vIADsi_Jr6z6JZTzt3d5hXicx9fakN8EHY6R
link.rule.ids 230,309,786,891,25594,76906
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NT8IwFH9B_MCbokTxqyZmt0XYOjYOi5GOBRUGQTTcyD5qwMMgdsboX-9rB-JFLzt0a9O95L3fr-2v7wFc2bZEcSPUeRxZOuVmXXd4HOpIbSU-U4tyud_RCxqdJ3o_tsYFeF3dhVF5Qj9UckT0qBj9PVPxerHexPKUtlJcRzNsmt_4I9fTlqtj5NOmUde8ltse9L0-0xhzWaAFQxeJj1xbUPN2AzZtmZ1XUqfnlryVsvgNKf4ebA1wtDTbh8LXtAwltqq8Voad3vLAuwzbSqEZC2xceqE4gOHjgvN4SjKJM7mWjaiKNvkTwYjkhaFJmCYk4TIYEAlXCcEvRd5ZfKbI_cRMECkSPYRLvz1iHR2nOfmxyYQF6z8yK1BM5yk_AtKg1kvNRMChjRpNjGbUjDCk2aFtJo4TWuExVP8ep_rfywsodUa97qR7FzycwK60tZRiGfVTKGZv7_wMsTmLzpVRvwHYV5F-
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Speech+translation+model+modeling+method+and+device+based+on+speech+synthesis+data&rft.inventor=YANG+MURUN&rft.inventor=DU+QUAN&rft.date=2023-03-21&rft.externalDBID=A&rft.externalDocID=CN115828943A