CONVOLUTIONAL NEURAL NETWORK OPTIMIZATION MECHANISM

Embodiments provide systems and methods which facilitate optimization of a convolutional neural network (CNN). One embodiment provides for a non-transitory machine-readable medium storing instructions that cause one or more processors to perform operations comprising processing a trained convolution...

Full description

Saved in:
Bibliographic Details
Main Authors Chen, Xiaoming, Ould-Ahmed-Vall, Elmoustapha, Yao, Anbang, Jin, Jingyi, Lakshmanan, Barath, Ashbaugh, Ben J, Appu, Abhishek R, Tang, Ping T, Nealis, Kevin, Koker, Altug, Srivastava, Dhawal, Ray, Joydeep, Ma, Liwei, Macpherson, Mike B, Bottleson, Jeremy, Strickland, Michael S, Shpeisman, Tatiana
Format Patent
LanguageEnglish
Published 20.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Embodiments provide systems and methods which facilitate optimization of a convolutional neural network (CNN). One embodiment provides for a non-transitory machine-readable medium storing instructions that cause one or more processors to perform operations comprising processing a trained convolutional neural network (CNN) to generate a processed CNN, the trained CNN having weights in a floating-point format. Processing the trained CNN includes quantizing the weights in the floating-point format to generate weights in an integer format. Quantizing the weights includes generating a quantization table to enable non-uniform quantization of the weights and quantizing the weights from the floating-point format to the integer format using the quantization table. The operations additionally comprise performing an inference operation utilizing the processed CNN with the integer format weights.
Bibliography:Application Number: US201916283021