Context-dependent modeling of phonemes

Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation...

Full description

Saved in:

Bibliographic Details
Main Authors	Senior Andrew W, Shafran Izhak, Sak Hasim
Format	Patent
Language	English
Published	14.11.2017
Subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

Abstract	Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence.
AbstractList	Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence.
Author	Shafran Izhak Sak Hasim Senior Andrew W
Author_xml	– fullname: Senior Andrew W – fullname: Shafran Izhak – fullname: Sak Hasim
BookMark	eNrjYmDJy89L5WRQc87PK0mtKNFNSS1IzUtJzStRyM1PSc3JzEtXyE9TKMgAqspNLeZhYE1LzClO5YXS3AwKbq4hzh66qQX58anFBYnJqXmpJfGhwZYWhhYmBpZORsZEKAEAInUpVw
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
ExternalDocumentID	US9818409B2
GroupedDBID	EVB
ID	FETCH-epo_espacenet_US9818409B23
IEDL.DBID	EVB
IngestDate	Fri Jul 19 14:48:57 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_US9818409B23
Notes	Application Number: US201514877673
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&CC=US&NR=9818409B2
ParticipantIDs	epo_espacenet_US9818409B2
PublicationCentury	2000
PublicationDate	20171114
PublicationDateYYYYMMDD	2017-11-14
PublicationDate_xml	– month: 11 year: 2017 text: 20171114 day: 14
PublicationDecade	2010
PublicationYear	2017
RelatedCompanies	Google Inc
RelatedCompanies_xml	– name: Google Inc
Score	3.1210363
Snippet	Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Title	Context-dependent modeling of phonemes
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20171114&DB=EPODOC&locale=&CC=US&NR=9818409B2
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3NS8MwFH-M-XnTqji_yEFyK65dujWHIjRtGcI-cKvsNpY2hR3chq347_sS2ulFry_wkjx4H7_kfQA8Mp4XqvC5nTNXN9WWhS09d2X3uSc5Ijn08brAeTTuD1P2svAWLVg3tTCmT-iXaY6IGpWhvlfGXu9-HrEik1tZPsk1krbPyTyIaI2OnQGqLqNRGMTTSTQRVIggndHxa8B9A2VCtNYHGEUPdPZX_BbqopTdb4-SnMHhFJltqnNoqY0FJ6IZvGbB8aj-77bgyCRoZiUSayUsL4CajlIIWJsBthUx82zQCZFtQXSyuXpX5SWQJJ6LoY1bL_fXXKaz_SF7V9BG9K-ugay49B0M06T0cua4md-TGRqGrsK4i8lu3oHOn2xu_lm7hVMtL11W57A7aFcfn-oe_WslH4xkvgHCcX8-
link.rule.ids	230,309,783,888,25578,76884
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LT4NAEJ409VFviprWJwfDjVjoUuFATHgFtdDGgumNdGFJerBtBOPfd3YD1YteZ5N9JfP4dme-AbgjVlGy0rTUguicVJuWKjX0pTq2DGohkkMfzwuco3gcpuR5YSw6sGprYQRP6JcgR0SNylHfa2Gvtz-PWJ7Irazu6QpFm8cgsT2lQcfaA6ouUTzH9mdTb-oqrmuncyV-tS1TQBkHrfUeRtgmp9n33xxelLL97VGCY9if4WTr-gQ6bC1Bz20br0lwGDX_3RIciATNvEJho4TVKSiCUQoBa9vAtpZFPxt0QvKmlHmyOXtn1RnIgZ-4oYpLZ7tjZul8t8nROXQR_bM-yEuLmhqGaZQaBdH03BzRHA3DkGHcReiwGMDgz2ku_hm7hV6YRJNs8hS_XMIRvzteYqeRK-jWH5_sGn1tTW_ELX0D4ZmCLg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Context-dependent+modeling+of+phonemes&rft.inventor=Senior+Andrew+W&rft.inventor=Shafran+Izhak&rft.inventor=Sak+Hasim&rft.date=2017-11-14&rft.externalDBID=B2&rft.externalDocID=US9818409B2