Ghost and iLPCnet-based Mongolian speech synthesis method

The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated...

Full description

Saved in:
Bibliographic Details
Main Authors DAI QIN, ZHANG WENJING, SIRLING GER RILE, REN-QING DAOERJI, SA HEYA
Format Patent
LanguageChinese
English
Published 29.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated according to the phoneme sequence; the iLPCnet model is used as a vocoder, and conversion from acoustic features to voice waveforms is carried out. The Mongolian text is converted into phonemes by using the Encoder-Decoder model, then the phonemes are directly generated into the mel frequency spectrum by using the ghost-based acoustic model, and the mel frequency spectrum is directly converted into the voice waveform by the iLPCnet vocoder, so that the method can be seamlessly integrated to an end-to-end TTS system, the requirement on parameters is reduced, the speed of voice synthesis is improved, and the method is suitable for voice synthesis of small languages. 本发明公开一种基于ghost和iLPCnet的蒙古语语音合成方法,基于Bang预训练模型,对齐蒙古语音
AbstractList The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a Mongolian phoneme information sequence based on a Bang pre-training model; on the basis of a ghost acoustic model, acoustic features are generated according to the phoneme sequence; the iLPCnet model is used as a vocoder, and conversion from acoustic features to voice waveforms is carried out. The Mongolian text is converted into phonemes by using the Encoder-Decoder model, then the phonemes are directly generated into the mel frequency spectrum by using the ghost-based acoustic model, and the mel frequency spectrum is directly converted into the voice waveform by the iLPCnet vocoder, so that the method can be seamlessly integrated to an end-to-end TTS system, the requirement on parameters is reduced, the speed of voice synthesis is improved, and the method is suitable for voice synthesis of small languages. 本发明公开一种基于ghost和iLPCnet的蒙古语语音合成方法,基于Bang预训练模型,对齐蒙古语音
Author REN-QING DAOERJI
SA HEYA
DAI QIN
ZHANG WENJING
SIRLING GER RILE
Author_xml – fullname: DAI QIN
– fullname: ZHANG WENJING
– fullname: SIRLING GER RILE
– fullname: REN-QING DAOERJI
– fullname: SA HEYA
BookMark eNrjYmDJy89L5WSwdM_ILy5RSMxLUcj0CXDOSy3RTUosTk1R8M3PS8_PyUzMUyguSE1NzlAorswryUgtzixWyE0tychP4WFgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpQNPinf0MDU0sjIxMLMwdjYlRAwATozAi
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 基于ghost和iLPCnet的蒙古语语音合成方法
ExternalDocumentID CN114822487A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN114822487A3
IEDL.DBID EVB
IngestDate Fri Jul 19 14:51:08 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN114822487A3
Notes Application Number: CN202210252979
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220729&DB=EPODOC&CC=CN&NR=114822487A
ParticipantIDs epo_espacenet_CN114822487A
PublicationCentury 2000
PublicationDate 20220729
PublicationDateYYYYMMDD 2022-07-29
PublicationDate_xml – month: 07
  year: 2022
  text: 20220729
  day: 29
PublicationDecade 2020
PublicationYear 2022
RelatedCompanies INNER MONGOLIA UNIVERSITY OF TECHNOLOGY
RelatedCompanies_xml – name: INNER MONGOLIA UNIVERSITY OF TECHNOLOGY
Score 3.542039
Snippet The invention discloses a ghost and iLPCnet-based Mongolian speech synthesis method, and the method comprises the steps: carrying out the alignment of a...
SourceID epo
SourceType Open Access Repository
SubjectTerms ACOUSTICS
CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
HANDLING RECORD CARRIERS
MUSICAL INSTRUMENTS
PHYSICS
PRESENTATION OF DATA
RECOGNITION OF DATA
RECORD CARRIERS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
Title Ghost and iLPCnet-based Mongolian speech synthesis method
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220729&DB=EPODOC&locale=&CC=CN&NR=114822487A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fS8MwED7m_PmmU9H5gwjSt-LWdW3zUMSlq0NcN2TK3kaXpm4-tMVURP96L3FzvuhrAuFy8OW73OW7AFy227GHtI34Fo4qMwrbpJTbZoOmrVYqmqnQKv5-5PQe7btxe1yBl6UWRvcJfdfNERFRHPFe6vO6WCWxAv22Ul5N5ziUX4cjPzAWt2PLUo2wjaDjd4eDYMAMxnwWGdGDr8J-ZCvPvVmDdQyjXYWG7lNHqVKK35QS7sLGEFfLyj2ofM5qsM2WP6_VYKu_KHjXYFO_0OQSBxcolPtAb5U0g8RZQub3Q4bGm4qMEoIAfc5V3oLIQgg-I_IjwwBPziX5_in6AC7C7oj1TLRm8rP1CYtWhrcOoZrlmTgConrYJ6k9FVYaKy1qnLi8kXjUEonTdLh7DPW_16n_N3kCO8qNKntp0VOolq9v4gxpt5yea399AeGNhfk
link.rule.ids 230,309,783,888,25576,76876
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT8JAEJ4gPvCmKFF8rYnprRHa0tJDY2TbigqFGDTcmj62godC3Bqjv97ZFcSLXneTzewk336zM_vNAly0WlEbaRvxzUxRZmSGatuJoTbsTNcz1syYVPH3A7P7aNyNW-MSvCy1MLJP6LtsjoiIShDvhTyv56sklivfVvLLeIpDsyt_5LjK4nasaaIRtuJ2HG84cAdUodShgRI8OCLsR7ZqW9drsI4htiXQ4D11hCpl_ptS_B3YGOJqebELpc9JFSp0-fNaFbb6i4J3FTblC82E4-AChXwP7BshzSBRnpJpb0jReFWQUUoQoM8zkbcgfM5YMiH8I8cAj085-f4peh_OfW9EuypaE_5sPaTBynC9BuV8lrMDIKKHfZoZMdOySGhRo9RKGmnb1lhqNs3EOoT63-vU_5s8g0p31O-Fvdvg_gi2hUtFJlOzj6FcvL6xE6TgIj6VvvsCVYCI7A
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Ghost+and+iLPCnet-based+Mongolian+speech+synthesis+method&rft.inventor=DAI+QIN&rft.inventor=ZHANG+WENJING&rft.inventor=SIRLING+GER+RILE&rft.inventor=REN-QING+DAOERJI&rft.inventor=SA+HEYA&rft.date=2022-07-29&rft.externalDBID=A&rft.externalDocID=CN114822487A