Context-dependent modeling of phonemes
Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
14.11.2017
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence. |
---|---|
AbstractList | Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence. |
Author | Shafran Izhak Sak Hasim Senior Andrew W |
Author_xml | – fullname: Senior Andrew W – fullname: Shafran Izhak – fullname: Sak Hasim |
BookMark | eNrjYmDJy89L5WRQc87PK0mtKNFNSS1IzUtJzStRyM1PSc3JzEtXyE9TKMgAqspNLeZhYE1LzClO5YXS3AwKbq4hzh66qQX58anFBYnJqXmpJfGhwZYWhhYmBpZORsZEKAEAInUpVw |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
ExternalDocumentID | US9818409B2 |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_US9818409B23 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 14:48:57 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_US9818409B23 |
Notes | Application Number: US201514877673 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&CC=US&NR=9818409B2 |
ParticipantIDs | epo_espacenet_US9818409B2 |
PublicationCentury | 2000 |
PublicationDate | 20171114 |
PublicationDateYYYYMMDD | 2017-11-14 |
PublicationDate_xml | – month: 11 year: 2017 text: 20171114 day: 14 |
PublicationDecade | 2010 |
PublicationYear | 2017 |
RelatedCompanies | Google Inc |
RelatedCompanies_xml | – name: Google Inc |
Score | 3.1210363 |
Snippet | Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
Title | Context-dependent modeling of phonemes |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&locale=&CC=US&NR=9818409B2 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH-M-XnTqji_yEFyK65dujWHIjRtGcI-cKvsNpY2hR3chq347_sS2ulFry_wkjx4H7_kfQA8Mp4XqvC5nTNXN9WWhS09d2X3uSc5Ijn08brAeTTuD1P2svAWLVg3tTCmT-iXaY6IGpWhvlfGXu9-HrEik1tZPsk1krbPyTyIaI2OnQGqLqNRGMTTSTQRVIggndHxa8B9A2VCtNYHGEUPdPZX_BbqopTdb4-SnMHhFJltqnNoqY0FJ6IZvGbB8aj-77bgyCRoZiUSayUsL4CajlIIWJsBthUx82zQCZFtQXSyuXpX5SWQJJ6LoY1bL_fXXKaz_SF7V9BG9K-ugay49B0M06T0cua4md-TGRqGrsK4i8lu3oHOn2xu_lm7hVMtL11W57A7aFcfn-oe_WslH4xkvgHCcX8- |
link.rule.ids | 230,309,783,888,25578,76884 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT4NAEJ409VFviprWJwfDjVjoUuFATHgFtdDGgumNdGFJerBtBOPfd3YD1YteZ5N9JfP4dme-AbgjVlGy0rTUguicVJuWKjX0pTq2DGohkkMfzwuco3gcpuR5YSw6sGprYQRP6JcgR0SNylHfa2Gvtz-PWJ7Irazu6QpFm8cgsT2lQcfaA6ouUTzH9mdTb-oqrmuncyV-tS1TQBkHrfUeRtgmp9n33xxelLL97VGCY9if4WTr-gQ6bC1Bz20br0lwGDX_3RIciATNvEJho4TVKSiCUQoBa9vAtpZFPxt0QvKmlHmyOXtn1RnIgZ-4oYpLZ7tjZul8t8nROXQR_bM-yEuLmhqGaZQaBdH03BzRHA3DkGHcReiwGMDgz2ku_hm7hV6YRJNs8hS_XMIRvzteYqeRK-jWH5_sGn1tTW_ELX0D4ZmCLg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Context-dependent+modeling+of+phonemes&rft.inventor=Senior+Andrew+W&rft.inventor=Shafran+Izhak&rft.inventor=Sak+Hasim&rft.date=2017-11-14&rft.externalDBID=B2&rft.externalDocID=US9818409B2 |