Speech translation model modeling method and device based on speech synthesis data
The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilizati...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
21.03.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini |
---|---|
AbstractList | The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural language processing. The problem that in the prior art, a speech translation model is small in training data size and insufficient in utilization, and consequently the translation result is inaccurate is solved. The modeling method comprises the following steps: acquiring a general speech synthesis data set, and training to obtain a general speech synthesis model; acquiring a speech translation data set of the target domain; performing fine tuning on the universal speech synthesis model by using the speech translation data set to obtain a special speech synthesis model; inputting the source language annotation text into a special voice synthesis model, and generating a plurality of pieces of voice synthesis pseudo data according to a preset proportion to obtain a pseudo voice data set; and constructing an initial speech translation model, training the ini |
Author | YANG MURUN DU QUAN |
Author_xml | – fullname: YANG MURUN – fullname: DU QUAN |
BookMark | eNqNizsOwjAQBV1Awe8OywEoQogUShSBqCiAPlriB7bkrC3WQuL2IIUD0Mw0M1MzkiiYmPMlAZ2j_GTRwNlHoT5ahIFeHtQju2iJxZLFy3egGyssfUsdZn1LdlCvZDnz3IzvHBSLn2dmedhfm-MKKbbQxB0EuW1ORVHV63q7KXflP80HSjA5zA |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
DocumentTitleAlternate | 一种基于语音合成数据的语音翻译模型建模方法和设备 |
ExternalDocumentID | CN115828943A |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_CN115828943A3 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 14:36:29 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | Chinese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_CN115828943A3 |
Notes | Application Number: CN202211694653 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&CC=CN&NR=115828943A |
ParticipantIDs | epo_espacenet_CN115828943A |
PublicationCentury | 2000 |
PublicationDate | 20230321 |
PublicationDateYYYYMMDD | 2023-03-21 |
PublicationDate_xml | – month: 03 year: 2023 text: 20230321 day: 21 |
PublicationDecade | 2020 |
PublicationYear | 2023 |
RelatedCompanies | SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD |
RelatedCompanies_xml | – name: SHENYANG YA TRANS NETWORK TECHNOLOGY CO., LTD |
Score | 3.5925064 |
Snippet | The invention relates to a speech translation model modeling method and device based on speech synthesis data, and belongs to the technical field of natural... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
Title | Speech translation model modeling method and device based on speech synthesis data |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230321&DB=EPODOC&locale=&CC=CN&NR=115828943A |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPm-KGsVH1sT01gjtlpZDY2QLISYUgmi4kT6WgIdC3Bqjv96ZLYgXvfSw7W62k8x83-5-OwNw67qE4lZkyiR2TC7tuunJJDKR2hI-c4dL2u_ohY3uM38cO-MSvK7vwug8oR86OSJ6VIL-nut4vdxsYgVaW6nu4jk2Le47Iz8wVqtj5NO2VTeClt8e9IO-MITwRWiEQx-JD60tuP2wBdtEoynPfvulRbdSlr8hpXMIOwMcLcuPoPQ1q8C-WFdeq8Beb3XgXYFdrdBMFDauvFAdw_BpKWUyYznhTKFlY7qiTfFEMGJFYWgWZSlLJQUDRnCVMvxSFZ3VZ4bcT80VI5HoCdx02iPRNXGakx-bTES4-SP7FMrZIpNnwBrcmdZsBBzeqPHUasbNGEOaG7l26nmRE51D9e9xqv-9vIADsi_Jr6z6JZTzt3d5hXicx9fakN8EHY6R |
link.rule.ids | 230,309,786,891,25594,76906 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NT8IwFH9B_MCbokTxqyZmt0XYOjYOi5GOBRUGQTTcyD5qwMMgdsboX-9rB-JFLzt0a9O95L3fr-2v7wFc2bZEcSPUeRxZOuVmXXd4HOpIbSU-U4tyud_RCxqdJ3o_tsYFeF3dhVF5Qj9UckT0qBj9PVPxerHexPKUtlJcRzNsmt_4I9fTlqtj5NOmUde8ltse9L0-0xhzWaAFQxeJj1xbUPN2AzZtmZ1XUqfnlryVsvgNKf4ebA1wtDTbh8LXtAwltqq8Voad3vLAuwzbSqEZC2xceqE4gOHjgvN4SjKJM7mWjaiKNvkTwYjkhaFJmCYk4TIYEAlXCcEvRd5ZfKbI_cRMECkSPYRLvz1iHR2nOfmxyYQF6z8yK1BM5yk_AtKg1kvNRMChjRpNjGbUjDCk2aFtJo4TWuExVP8ep_rfywsodUa97qR7FzycwK60tZRiGfVTKGZv7_wMsTmLzpVRvwHYV5F- |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Speech+translation+model+modeling+method+and+device+based+on+speech+synthesis+data&rft.inventor=YANG+MURUN&rft.inventor=DU+QUAN&rft.date=2023-03-21&rft.externalDBID=A&rft.externalDocID=CN115828943A |