Machine Learning at Facebook: Understanding Inference at the Edge
Wu, Carole-Jean, Brooks, David, Chen, Kevin, Chen, Douglas, Choudhury, Sy, Dukhan, Marat, Hazelwood, Kim, Isaac, Eldad, Jia, Yangqing, Jia, Bill, Leyvand, Tommer, Lu, Hao, Lu, Yang, Qiao, Lin, Reagen, Brandon, Spisak, Joe, Sun, Fei, Tulloch, Andrew, Vajda, Peter, Wang, Xiaodong, Wang, Yanghan, Wasti, Bram, Wu, Yiming, Xian, Ran, Yoo, Sungjoo, Zhang, Peizhao
Published in 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2019)
Published in 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2019)
Get full text
Conference Proceeding
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Cummins, Chris, Wasti, Bram, Guo, Jiadong, Cui, Brandon, Ansel, Jason, Gomez, Sahir, Jain, Somya, Liu, Jia, Teytaud, Olivier, Steiner, Benoit, Tian, Yuandong, Leather, Hugh
Published in 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (02.04.2022)
Published in 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (02.04.2022)
Get full text
Conference Proceeding
LoopTune: Optimizing Tensor Computations with Reinforcement Learning
Grubisic, Dejan, Wasti, Bram, Cummins, Chris, Mellor-Crummey, John, Zlateski, Aleksandar
Published in arXiv.org (08.11.2023)
Published in arXiv.org (08.11.2023)
Get full text
Paper
Journal Article
LoopStack: a Lightweight Tensor Algebra Compiler Stack
Wasti, Bram, José Pablo Cambronero, Steiner, Benoit, Leather, Hugh, Zlateski, Aleksandar
Published in arXiv.org (02.05.2022)
Published in arXiv.org (02.05.2022)
Get full text
Paper
Journal Article
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Elhoushi, Mostafa, Shrivastava, Akshat, Liskovich, Diana, Hosmer, Basil, Wasti, Bram, Lai, Liangzhen, Mahmoud, Anas, Acun, Bilge, Agarwal, Saurabh, Ahmed, Roman, Aly, Ahmed A, Chen, Beidi, Carole-Jean Wu
Published in arXiv.org (29.04.2024)
Published in arXiv.org (29.04.2024)
Get full text
Paper
Journal Article
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Cummins, Chris, Wasti, Bram, Guo, Jiadong, Cui, Brandon, Ansel, Jason, Gomez, Sahir, Jain, Somya, Liu, Jia, Teytaud, Olivier, Steiner, Benoit, Tian, Yuandong, Leather, Hugh
Published in arXiv.org (22.12.2021)
Published in arXiv.org (22.12.2021)
Get full text
Paper
Journal Article