METHOD AND APPARATUS FOR TRAINING DEEP LEARNING MODEL

Embodiments of the present disclosure disclose a method and apparatus for training a deep learning model. A specific embodiment of the method includes: acquiring model description information and configuration information of a deep learning model; segmenting the model description information into at...

Full description

Saved in:
Bibliographic Details
Main Authors LIU, Yi, YU, Dianhai, HE, Tianjian, MA, Yanjun, DONG, Daxiang
Format Patent
LanguageEnglish
French
German
Published 12.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Embodiments of the present disclosure disclose a method and apparatus for training a deep learning model. A specific embodiment of the method includes: acquiring model description information and configuration information of a deep learning model; segmenting the model description information into at least two sections based on segmentation point variable in the configuration information, and loading the model description information to a corresponding resource to run; inputting a batch of training samples into a resource corresponding to a first section of model description information, then starting training and using obtained context information as an input of a resource corresponding to a subsequent section of model description information; and so on until an operation result of a resource corresponding to a final section of model description information is obtained; if a training completion condition is met, outputting a trained deep learning model; and otherwise, keeping on acquiring a subsequent batch of training samples and performing the above training steps until the training completion condition is met. This embodiment realizes free collocation of heterogeneous devices, gives full play to the computing power of different computing devices, and improves training speed.
Bibliography:Application Number: EP20200866951