Multi-objective auto tuning for layer fusion and tensor tiling on multi-level cache hierarchy
A method of performing automatic tuning on a deep learning model includes: utilizing an instruction-based learned cost model to estimate a first type of operational performance metrics based on a tuned configuration of layer fusion and tensor tiling; utilizing statistical data gathered during a comp...
Saved in:
Main Authors | , , , , , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
16.04.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A method of performing automatic tuning on a deep learning model includes: utilizing an instruction-based learned cost model to estimate a first type of operational performance metrics based on a tuned configuration of layer fusion and tensor tiling; utilizing statistical data gathered during a compilation process of the deep learning model to determine a second type of operational performance metrics based on the tuned configuration of layer fusion and tensor tiling; performing an auto-tuning process to obtain a plurality of optimal configurations based on the first type of platform metrics and the second type of platform metrics; and configure the deep learning model according to one of the plurality of optimal configurations. |
---|---|
Bibliography: | Application Number: TW202312138646 |