Ghost and iLPCnet-based Mongolian speech synthesis method
The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
29.07.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated according to the phoneme sequence; the iLPCnet model is used as a vocoder, and conversion from acoustic features to voice waveforms is carried out. The Mongolian text is converted into phonemes by using the Encoder-Decoder model, then the phonemes are directly generated into the mel frequency spectrum by using the ghost-based acoustic model, and the mel frequency spectrum is directly converted into the voice waveform by the iLPCnet vocoder, so that the method can be seamlessly integrated to an end-to-end TTS system, the requirement on parameters is reduced, the speed of voice synthesis is improved, and the method is suitable for voice synthesis of small languages.
本发明公开一种基于ghost和iLPCnet的蒙古语语音合成方法,基于Bang预训练模型,对齐蒙古语音 |
---|---|
AbstractList | The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated according to the phoneme sequence; the iLPCnet model is used as a vocoder, and conversion from acoustic features to voice waveforms is carried out. The Mongolian text is converted into phonemes by using the Encoder-Decoder model, then the phonemes are directly generated into the mel frequency spectrum by using the ghost-based acoustic model, and the mel frequency spectrum is directly converted into the voice waveform by the iLPCnet vocoder, so that the method can be seamlessly integrated to an end-to-end TTS system, the requirement on parameters is reduced, the speed of voice synthesis is improved, and the method is suitable for voice synthesis of small languages.
本发明公开一种基于ghost和iLPCnet的蒙古语语音合成方法,基于Bang预训练模型,对齐蒙古语音 |
Author | REN-QING DAOERJI SA HEYA DAI QIN ZHANG WENJING SIRLING GER RILE |
Author_xml | – fullname: DAI QIN – fullname: ZHANG WENJING – fullname: SIRLING GER RILE – fullname: REN-QING DAOERJI – fullname: SA HEYA |
BookMark | eNrjYmDJy89L5WSwdM_ILy5RSMxLUcj0CXDOSy3RTUosTk1R8M3PS8_PyUzMUyguSE1NzlAorswryUgtzixWyE0tychP4WFgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpQNPinf0MDU0sjIxMLMwdjYlRAwATozAi |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
DocumentTitleAlternate | 基于ghost和iLPCnet的蒙古语语音合成方法 |
ExternalDocumentID | CN114822487A |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_CN114822487A3 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 14:51:08 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | Chinese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_CN114822487A3 |
Notes | Application Number: CN202210252979 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220729&DB=EPODOC&CC=CN&NR=114822487A |
ParticipantIDs | epo_espacenet_CN114822487A |
PublicationCentury | 2000 |
PublicationDate | 20220729 |
PublicationDateYYYYMMDD | 2022-07-29 |
PublicationDate_xml | – month: 07 year: 2022 text: 20220729 day: 29 |
PublicationDecade | 2020 |
PublicationYear | 2022 |
RelatedCompanies | INNER MONGOLIA UNIVERSITY OF TECHNOLOGY |
RelatedCompanies_xml | – name: INNER MONGOLIA UNIVERSITY OF TECHNOLOGY |
Score | 3.542039 |
Snippet | The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | ACOUSTICS CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING HANDLING RECORD CARRIERS MUSICAL INSTRUMENTS PHYSICS PRESENTATION OF DATA RECOGNITION OF DATA RECORD CARRIERS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
Title | Ghost and iLPCnet-based Mongolian speech synthesis method |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220729&DB=EPODOC&locale=&CC=CN&NR=114822487A |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fS8MwED7m_PmmU9H5gwjSt-LWdW3zUMSlq0NcN2TK3kaXpm4-tMVURP96L3FzvuhrAuFy8OW73OW7AFy227GHtI34Fo4qMwrbpJTbZoOmrVYqmqnQKv5-5PQe7btxe1yBl6UWRvcJfdfNERFRHPFe6vO6WCWxAv22Ul5N5ziUX4cjPzAWt2PLUo2wjaDjd4eDYMAMxnwWGdGDr8J-ZCvPvVmDdQyjXYWG7lNHqVKK35QS7sLGEFfLyj2ofM5qsM2WP6_VYKu_KHjXYFO_0OQSBxcolPtAb5U0g8RZQub3Q4bGm4qMEoIAfc5V3oLIQgg-I_IjwwBPziX5_in6AC7C7oj1TLRm8rP1CYtWhrcOoZrlmTgConrYJ6k9FVYaKy1qnLi8kXjUEonTdLh7DPW_16n_N3kCO8qNKntp0VOolq9v4gxpt5yea399AeGNhfk |
link.rule.ids | 230,309,783,888,25576,76876 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPvCmKFF8rYnprRHa0tJDY2TbigqFGDTcmj62godC3Bqjv97ZFcSLXneTzewk336zM_vNAly0WlEbaRvxzUxRZmSGatuJoTbsTNcz1syYVPH3A7P7aNyNW-MSvCy1MLJP6LtsjoiIShDvhTyv56sklivfVvLLeIpDsyt_5LjK4nasaaIRtuJ2HG84cAdUodShgRI8OCLsR7ZqW9drsI4htiXQ4D11hCpl_ptS_B3YGOJqebELpc9JFSp0-fNaFbb6i4J3FTblC82E4-AChXwP7BshzSBRnpJpb0jReFWQUUoQoM8zkbcgfM5YMiH8I8cAj085-f4peh_OfW9EuypaE_5sPaTBynC9BuV8lrMDIKKHfZoZMdOySGhRo9RKGmnb1lhqNs3EOoT63-vU_5s8g0p31O-Fvdvg_gi2hUtFJlOzj6FcvL6xE6TgIj6VvvsCVYCI7A |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Ghost+and+iLPCnet-based+Mongolian+speech+synthesis+method&rft.inventor=DAI+QIN&rft.inventor=ZHANG+WENJING&rft.inventor=SIRLING+GER+RILE&rft.inventor=REN-QING+DAOERJI&rft.inventor=SA+HEYA&rft.date=2022-07-29&rft.externalDBID=A&rft.externalDocID=CN114822487A |