SYSTEMS AND METHODS FOR COMBINING STOCHASTIC AVERAGE GRADIENT AND HESSIAN-FREE OPTIMIZATION FOR SEQUENCE TRAINING OF DEEP NEURAL NETWORKS
A method for training a deep neural network (DNN), comprises receiving and formatting speech data for the training, performing Hessian-free sequence training (HFST) on a first subset of a plurality of subsets of the speech data, and iteratively performing the HFST on successive subsets of the plural...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | English |
Published |
29.10.2015
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!