Scheduling method and device for model training, electronic equipment and storage medium

The invention provides a scheduling method and device for model training, electronic equipment and a storage medium, and relates to the field of artificial intelligence, in particular to the field of deep learning and cloud computing. According to the specific implementation scheme of the scheduling...

Full description

Saved in:
Bibliographic Details
Main Authors BAI YANGFAN, SHEN LIANG, GONG WEIBAO, WU ZHIHUA, YU DIANHAI
Format Patent
LanguageChinese
English
Published 16.08.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention provides a scheduling method and device for model training, electronic equipment and a storage medium, and relates to the field of artificial intelligence, in particular to the field of deep learning and cloud computing. According to the specific implementation scheme of the scheduling method for model training, a to-be-trained model is segmented into three model partitions which are connected in sequence; caching the three model partitions connected in sequence to three spaces forming a three-level cache space respectively; wherein the three spaces comprise a display memory space for the graphics processor, a memory space for the processor and a hard disk storage space; and in the training process of the to-be-trained model, dynamically adjusting the model partitions cached in the three spaces respectively, and scheduling to enable training for the three model partitions to be executed in an overlapping mode. Wherein the training for each model partition in the three model partitions relates to
Bibliography:Application Number: CN202210532762