DeepRecSys: A System for Optimizing End-To-End At-Scale Neural Recommendation Inference

Neural personalized recommendation is the cornerstone of a wide collection of cloud services and products, constituting significant compute demand of cloud infrastructure. Thus, improving the execution efficiency of recommendation directly translates into infrastructure capacity saving. In this pape...

Full description

Saved in:
Bibliographic Details
Published in2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) pp. 982 - 995
Main Authors Gupta, Udit, Hsia, Samuel, Saraph, Vikram, Wang, Xiaodong, Reagen, Brandon, Wei, Gu-Yeon, Lee, Hsien-Hsin S., Brooks, David, Wu, Carole-Jean
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2020
Subjects
Online AccessGet full text
DOI10.1109/ISCA45697.2020.00084

Cover

Loading…