METHOD FOR DISTRIBUTED TRAINING MODEL, RELEVANT APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM

The present disclosure provides a method and apparatus for distributed training a model, an electronic device, and a computer readable storage medium. The method may include: performing, for each batch of training samples acquired by a distributed first trainer, model training through a distributed...

Full description

Saved in:
Bibliographic Details
Main Authors YU, Dianhai, WU, Zhihua, MA, Yanjun, WU, Xinxuan, WANG, Haifeng, YAO, Xuefeng, WU, Tian
Format Patent
LanguageEnglish
Published 18.11.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The present disclosure provides a method and apparatus for distributed training a model, an electronic device, and a computer readable storage medium. The method may include: performing, for each batch of training samples acquired by a distributed first trainer, model training through a distributed second trainer to obtain gradient information; updating a target parameter in a distributed built-in parameter server according to the gradient information; and performing, in response to determining that training for a preset number of training samples is completed, a parameter exchange between the distributed built-in parameter server and a distributed parameter server through the distributed first trainer to perform a parameter update on the initial model until training for the initial model is completed.
Bibliography:Application Number: US202117362674