Context-dependent modeling of phonemes

Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation...

Full description

Saved in:
Bibliographic Details
Main Authors Senior Andrew W, Shafran Izhak, Sak Hasim
Format Patent
LanguageEnglish
Published 14.11.2017
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence.
AbstractList Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence.
Author Shafran Izhak
Sak Hasim
Senior Andrew W
Author_xml – fullname: Senior Andrew W
– fullname: Shafran Izhak
– fullname: Sak Hasim
BookMark eNrjYmDJy89L5WRQc87PK0mtKNFNSS1IzUtJzStRyM1PSc3JzEtXyE9TKMgAqspNLeZhYE1LzClO5YXS3AwKbq4hzh66qQX58anFBYnJqXmpJfGhwZYWhhYmBpZORsZEKAEAInUpVw
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
ExternalDocumentID US9818409B2
GroupedDBID EVB
ID FETCH-epo_espacenet_US9818409B23
IEDL.DBID EVB
IngestDate Fri Jul 19 14:48:57 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_US9818409B23
Notes Application Number: US201514877673
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&CC=US&NR=9818409B2
ParticipantIDs epo_espacenet_US9818409B2
PublicationCentury 2000
PublicationDate 20171114
PublicationDateYYYYMMDD 2017-11-14
PublicationDate_xml – month: 11
  year: 2017
  text: 20171114
  day: 14
PublicationDecade 2010
PublicationYear 2017
RelatedCompanies Google Inc
RelatedCompanies_xml – name: Google Inc
Score 3.1210363
Snippet Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic...
SourceID epo
SourceType Open Access Repository
SubjectTerms ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
Title Context-dependent modeling of phonemes
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&locale=&CC=US&NR=9818409B2
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH-M-XnTqji_yEFyK65dujWHIjRtGcI-cKvsNpY2hR3chq347_sS2ulFry_wkjx4H7_kfQA8Mp4XqvC5nTNXN9WWhS09d2X3uSc5Ijn08brAeTTuD1P2svAWLVg3tTCmT-iXaY6IGpWhvlfGXu9-HrEik1tZPsk1krbPyTyIaI2OnQGqLqNRGMTTSTQRVIggndHxa8B9A2VCtNYHGEUPdPZX_BbqopTdb4-SnMHhFJltqnNoqY0FJ6IZvGbB8aj-77bgyCRoZiUSayUsL4CajlIIWJsBthUx82zQCZFtQXSyuXpX5SWQJJ6LoY1bL_fXXKaz_SF7V9BG9K-ugay49B0M06T0cua4md-TGRqGrsK4i8lu3oHOn2xu_lm7hVMtL11W57A7aFcfn-oe_WslH4xkvgHCcX8-
link.rule.ids 230,309,783,888,25578,76884
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT4NAEJ409VFviprWJwfDjVjoUuFATHgFtdDGgumNdGFJerBtBOPfd3YD1YteZ5N9JfP4dme-AbgjVlGy0rTUguicVJuWKjX0pTq2DGohkkMfzwuco3gcpuR5YSw6sGprYQRP6JcgR0SNylHfa2Gvtz-PWJ7Irazu6QpFm8cgsT2lQcfaA6ouUTzH9mdTb-oqrmuncyV-tS1TQBkHrfUeRtgmp9n33xxelLL97VGCY9if4WTr-gQ6bC1Bz20br0lwGDX_3RIciATNvEJho4TVKSiCUQoBa9vAtpZFPxt0QvKmlHmyOXtn1RnIgZ-4oYpLZ7tjZul8t8nROXQR_bM-yEuLmhqGaZQaBdH03BzRHA3DkGHcReiwGMDgz2ku_hm7hV6YRJNs8hS_XMIRvzteYqeRK-jWH5_sGn1tTW_ELX0D4ZmCLg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Context-dependent+modeling+of+phonemes&rft.inventor=Senior+Andrew+W&rft.inventor=Shafran+Izhak&rft.inventor=Sak+Hasim&rft.date=2017-11-14&rft.externalDBID=B2&rft.externalDocID=US9818409B2