CONVOLUTIONAL NEURAL NETWORK OPTIMIZATION MECHANISM
Embodiments provide systems and methods which facilitate optimization of a convolutional neural network (CNN). One embodiment provides for a non-transitory machine-readable medium storing instructions that cause one or more processors to perform operations comprising processing a trained convolution...
Saved in:
Main Authors | , , , , , , , , , , , , , , , , |
---|---|
Format | Patent |
Language | English |
Published |
20.06.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Embodiments provide systems and methods which facilitate optimization of a convolutional neural network (CNN). One embodiment provides for a non-transitory machine-readable medium storing instructions that cause one or more processors to perform operations comprising processing a trained convolutional neural network (CNN) to generate a processed CNN, the trained CNN having weights in a floating-point format. Processing the trained CNN includes quantizing the weights in the floating-point format to generate weights in an integer format. Quantizing the weights includes generating a quantization table to enable non-uniform quantization of the weights and quantizing the weights from the floating-point format to the integer format using the quantization table. The operations additionally comprise performing an inference operation utilizing the processed CNN with the integer format weights. |
---|---|
Bibliography: | Application Number: US201916283021 |