DeepShift: Towards Multiplication-Less Neural Networks
Elhoushi, Mostafa, Chen, Zihao, Shafiq, Farhan, Tian, Ye Henry, Li, Joey Yiwei
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2021)
Published in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2021)
Get full text
Conference Proceeding
One-Shot Layer-Wise Accuracy Approximation For Layer Pruning
Elkerdawy, Sara, Elhoushi, Mostafa, Singh, Abhineet, Zhang, Hong, Ray, Nilanjan
Published in 2020 IEEE International Conference on Image Processing (ICIP) (01.10.2020)
Published in 2020 IEEE International Conference on Image Processing (ICIP) (01.10.2020)
Get full text
Conference Proceeding
Model of a hybrid processor executing C++ with additional quantum functions
Elhoushi, Mostafa, El-Kharashi, M. Watheq, Elrefaei, Hatem
Published in Microprocessors and microsystems (01.11.2014)
Published in Microprocessors and microsystems (01.11.2014)
Get full text
Journal Article
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding
Get full text
Paper
Journal Article
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks
Get full text
Paper
Journal Article
Brevity is the soul of wit: Pruning long files for code generation
Singh, Aaditya K, Yang, Yu, Tirumala, Kushal, Elhoushi, Mostafa, Morcos, Ari S
Year of Publication 29.06.2024
Year of Publication 29.06.2024
Get full text
Journal Article
Sieve: Multimodal Dataset Pruning Using Image Captioning Models
Mahmoud, Anas, Elhoushi, Mostafa, Amro Abbas, Yang, Yu, Ardalani, Newsha, Leather, Hugh, Morcos, Ari
Published in arXiv.org (10.03.2024)
Published in arXiv.org (10.03.2024)
Get full text
Paper
Journal Article
Minuet: Accelerating 3D Sparse Convolutions on GPUs
Yang, Jiacheng, Giannoula, Christina, Wu, Jun, Elhoushi, Mostafa, Gleeson, James, Pekhimenko, Gennady
Published in arXiv.org (01.12.2023)
Published in arXiv.org (01.12.2023)
Get full text
Paper
Journal Article
CHAI: Clustered Head Attention for Efficient LLM Inference
Agarwal, Saurabh, Acun, Bilge, Hosmer, Basil, Elhoushi, Mostafa, Lee, Yejin, Venkataraman, Shivaram, Papailiopoulos, Dimitris, Carole-Jean Wu
Published in arXiv.org (27.04.2024)
Published in arXiv.org (27.04.2024)
Get full text
Paper
Journal Article
Work-in-Progress: MLGOPerf: An ML Guided Inliner to Optimize Performance
Ashouri, Amir H., Elhoushi, Mostafa, Hua, Yuzhe, Wang, Xiang, Manzoor, Muhammad Asif, Chan, Bryan, Gao, Yaoqing
Published in 2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES) (01.10.2022)
Published in 2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES) (01.10.2022)
Get full text
Conference Proceeding
Modeling a quantum processor using the QRAM model
Elhoushi, M., El-Kharashi, M. W., Elrefaei, H.
Published in Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (01.08.2011)
Published in Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (01.08.2011)
Get full text
Conference Proceeding
To Filter Prune, or to Layer Prune, That Is The Question
Elkerdawy, Sara, Elhoushi, Mostafa, Singh, Abhineet, Zhang, Hong, Ray, Nilanjan
Published in arXiv.org (08.11.2020)
Published in arXiv.org (08.11.2020)
Get full text
Paper
Journal Article