Q8BERT: Quantized 8Bit BERT
Zafrir, Ofir, Boudoukh, Guy, Izsak, Peter, Wasserblat, Moshe
Published in 2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition (EMC2-NIPS) (01.12.2019)
Published in 2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition (EMC2-NIPS) (01.12.2019)
Get full text
Conference Proceeding
Prune Once for All: Sparse Pre-Trained Language Models
Zafrir, Ofir, Larey, Ariel, Boudoukh, Guy, Shen, Haihao, Wasserblat, Moshe
Year of Publication 10.11.2021
Year of Publication 10.11.2021
Get full text
Journal Article
Fast DistilBERT on CPUs
Shen, Haihao, Zafrir, Ofir, Dong, Bo, Meng, Hengyu, Ye, Xinyu, Wang, Zhe, Ding, Yi, Chang, Hanwen, Boudoukh, Guy, Wasserblat, Moshe
Year of Publication 27.10.2022
Year of Publication 27.10.2022
Get full text
Journal Article
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Shen, Haihao, Meng, Hengyu, Dong, Bo, Wang, Zhe, Zafrir, Ofir, Ding, Yi, Luo, Yu, Chang, Hanwen, Gao, Qun, Wang, Ziheng, Boudoukh, Guy, Wasserblat, Moshe
Year of Publication 28.06.2023
Year of Publication 28.06.2023
Get full text
Journal Article
Visual tracking of object silhouettes
Boudoukh, G., Leichter, I., Rivlin, E.
Published in 2009 16th IEEE International Conference on Image Processing (ICIP) (01.11.2009)
Published in 2009 16th IEEE International Conference on Image Processing (ICIP) (01.11.2009)
Get full text
Conference Proceeding
Generating Pretrained Sparse Student Model for Transfer Learning
Shen, Haihao, Wasserblat, Moshe, Zafrir, Ofir, Boudoukh, Guy, Lahrey, Ariel
Year of Publication 12.01.2023
Get full text
Year of Publication 12.01.2023
Patent
Prune Once for All: Sparse Pre-Trained Language Models
Zafrir, Ofir, Larey, Ariel, Boudoukh, Guy, Shen, Haihao, Wasserblat, Moshe
Published in arXiv.org (10.11.2021)
Get full text
Published in arXiv.org (10.11.2021)
Paper
APPARATUSES, METHODS, AND SYSTEMS FOR INSTRUCTIONS FOR STRUCTURED-SPARSE TILE MATRIX FMA
Ziv, Barukh, Gradstein, Amit, Heinecke, Alexander, Rip, Dana, Rubanovich, Simon, Jain, Nilesh, Sherman, Uri, Mizrahi, Shahar, Adelman, Menachem, Hughes, Christopher, Georganas, Evangelos, Boudoukh, Guy, Mellempudi, Naveen
Year of Publication 08.02.2024
Get full text
Year of Publication 08.02.2024
Patent
Fast DistilBERT on CPUs
Shen, Haihao, Zafrir, Ofir, Dong, Bo, Meng, Hengyu, Ye, Xinyu, Wang, Zhe, Ding, Yi, Chang, Hanwen, Boudoukh, Guy, Wasserblat, Moshe
Published in arXiv.org (06.12.2022)
Get full text
Published in arXiv.org (06.12.2022)
Paper
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Shen, Haihao, Meng, Hengyu, Dong, Bo, Wang, Zhe, Zafrir, Ofir, Ding, Yi, Luo, Yu, Chang, Hanwen, Gao, Qun, Wang, Ziheng, Boudoukh, Guy, Wasserblat, Moshe
Published in arXiv.org (28.06.2023)
Get full text
Published in arXiv.org (28.06.2023)
Paper
APPARATUSES, METHODS, AND SYSTEMS FOR INSTRUCTIONS FOR STRUCTURED-SPARSE TILE MATRIX FMA
CHARNEY, Mark, BOUDOUKH, Guy, VALENTINE, Robert, SPERBER, Zeev, POORNACHANDRAN, Rajesh, SHEMY, Regev, ADELMAN, Menachem, HEINECKE, Alexander, ZIV, Barukh, HUGHES, Christopher, GRADSTEIN, Amit, RAPPOPORT, Rinat, POLLAK, Yaroslav, GANESH, Brinda, GEORGANAS, Evangelos, NARKIS, Arik, BAUM, Dan, RUBANOVICH, Simon, AKHAURI, Yash, JAIN, Nilesh
Year of Publication 30.03.2023
Get full text
Year of Publication 30.03.2023
Patent
APPARATUSES, METHODS, AND SYSTEMS FOR INSTRUCTIONS FOR STRUCTURED-SPARSE TILE MATRIX FMA
CHARNEY, Mark, BOUDOUKH, Guy, VALENTINE, Robert, SPERBER, Zeev, POORNACHANDRAN, Rajesh, SHEMY, Regev, ADELMAN, Menachem, HEINECKE, Alexander, ZIV, Barukh, HUGHES, Christopher, GRADSTEIN, Amit, RAPPOPORT, Rinat, POLLAK, Yaroslav, GANESH, Brinda, GEORGANAS, Evangelos, NARKIS, Arik, BAUM, Dan, RUBANOVICH, Simon, AKHAURI, Yash, JAIN, Nilesh
Year of Publication 29.03.2023
Get full text
Year of Publication 29.03.2023
Patent
Matrix multiplication acceleration of sparse matrices using column folding and squeezing
Yang, Andrew, Koren, Chen, Rotzin, Michael, Azizi, Omid, Nurvitadhi, Eriko, Boudoukh, Guy, Werner, Tony
Year of Publication 14.04.2020
Get full text
Year of Publication 14.04.2020
Patent
MATRIX MULTIPLICATION ACCELERATION OF SPARSE MATRICES USING COLUMN FOLDING AND SQUEEZING
AZIZI, Omid, BOUDOUKH, Guy, ROTZIN, Michael, KOREN, Chen, WERNER, Tony, NURVITADHI, Eriko, YANG, Andrew
Year of Publication 07.02.2019
Get full text
Year of Publication 07.02.2019
Patent
System and method for suspect search
Yehezkel, Raanan Yonatan, Goldner, Vladimir, Blumstein-Koren, Guy, Gurwicz, Yaniv, Girmonsky, Doron, Boudoukh, Guy
Year of Publication 26.06.2018
Get full text
Year of Publication 26.06.2018
Patent
SYSTEM AND METHOD FOR SUSPECT SEARCH
GURWICZ Yaniv, YEHEZKEL Raanan Yonatan, BLUMSTEIN-KOREN Guy, BOUDOUKH Guy, GIRMONSKY Doron, GOLDNER Vladimir
Year of Publication 20.04.2017
Get full text
Year of Publication 20.04.2017
Patent
Matrix multiplication acceleration of sparse matrices using column folding and extrusion
AZIZI OMID, YANG ALICE, NURVITADHI ERIKO, WARNER TYLER, ROTZIN MICHAEL, BOUDOUKH GUY, KOREN CHEN
Year of Publication 07.12.2021
Get full text
Year of Publication 07.12.2021
Patent