Data-Free Quantization Through Weight Equalization and Bias Correction
Nagel, Markus, Baalen, Mart Van, Blankevoort, Tijmen, Welling, Max
Published in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (01.10.2019)
Published in 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (01.10.2019)
Get full text
Conference Proceeding
QBitOpt: Fast and Accurate Bitwidth Reallocation during Training
Peters, Jorn, Fournarakis, Marios, Nagel, Markus, Van Baalen, Mart, Blankevoort, Tijmen
Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (02.10.2023)
Published in 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) (02.10.2023)
Get full text
Conference Proceeding
Cyclical Pruning for Sparse Neural Networks
Srinivas, Suraj, Kuzmin, Andrey, Nagel, Markus, van Baalen, Mart, Skliar, Andrii, Blankevoort, Tijmen
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2022)
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2022)
Get full text
Conference Proceeding
The LLM Surgeon
van der Ouderaa, Tycho F. A, Nagel, Markus, van Baalen, Mart, Asano, Yuki M, Blankevoort, Tijmen
Year of Publication 28.12.2023
Year of Publication 28.12.2023
Get full text
Journal Article
QBitOpt: Fast and Accurate Bitwidth Reallocation during Training
Peters, Jorn, Fournarakis, Marios, Nagel, Markus, van Baalen, Mart, Blankevoort, Tijmen
Year of Publication 10.07.2023
Year of Publication 10.07.2023
Get full text
Journal Article
Pruning vs Quantization: Which is Better?
Kuzmin, Andrey, Nagel, Markus, van Baalen, Mart, Behboodi, Arash, Blankevoort, Tijmen
Year of Publication 06.07.2023
Year of Publication 06.07.2023
Get full text
Journal Article
GPTVQ: The Blessing of Dimensionality for LLM Quantization
van Baalen, Mart, Kuzmin, Andrey, Nagel, Markus, Couperus, Peter, Bastoul, Cedric, Mahurin, Eric, Blankevoort, Tijmen, Whatmough, Paul
Year of Publication 23.02.2024
Year of Publication 23.02.2024
Get full text
Journal Article
A Practical Mixed Precision Algorithm for Post-Training Quantization
Pandey, Nilesh Prasad, Nagel, Markus, van Baalen, Mart, Huang, Yin, Patel, Chirag, Blankevoort, Tijmen
Year of Publication 10.02.2023
Year of Publication 10.02.2023
Get full text
Journal Article
Simulated Quantization, Real Power Savings
van Baalen, Mart, Kahne, Brian, Mahurin, Eric, Kuzmin, Andrey, Skliar, Andrii, Nagel, Markus, Blankevoort, Tijmen
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2022)
Published in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01.06.2022)
Get full text
Conference Proceeding
Rapid Switching and Multi-Adapter Fusion via Sparse High Rank Adapters
Bhardwaj, Kartikeya, Pandey, Nilesh Prasad, Priyadarshi, Sweta, Ganapathy, Viswanath, Esteves, Rafael, Kadambi, Shreya, Borse, Shubhankar, Whatmough, Paul, Garrepalli, Risheek, Van Baalen, Mart, Teague, Harris, Nagel, Markus
Year of Publication 22.07.2024
Year of Publication 22.07.2024
Get full text
Journal Article
Sparse High Rank Adapters
Bhardwaj, Kartikeya, Pandey, Nilesh Prasad, Priyadarshi, Sweta, Ganapathy, Viswanath, Esteves, Rafael, Kadambi, Shreya, Borse, Shubhankar, Whatmough, Paul, Garrepalli, Risheek, Van Baalen, Mart, Teague, Harris, Nagel, Markus
Year of Publication 18.06.2024
Year of Publication 18.06.2024
Get full text
Journal Article
FP8 Quantization: The Power of the Exponent
Kuzmin, Andrey, Van Baalen, Mart, Ren, Yuwei, Nagel, Markus, Peters, Jorn, Blankevoort, Tijmen
Year of Publication 19.08.2022
Year of Publication 19.08.2022
Get full text
Journal Article
Cyclical Pruning for Sparse Neural Networks
Srinivas, Suraj, Kuzmin, Andrey, Nagel, Markus, van Baalen, Mart, Skliar, Andrii, Blankevoort, Tijmen
Year of Publication 02.02.2022
Year of Publication 02.02.2022
Get full text
Journal Article
FP8 versus INT8 for efficient deep learning inference
van Baalen, Mart, Kuzmin, Andrey, Nair, Suparna S, Ren, Yuwei, Mahurin, Eric, Patel, Chirag, Subramanian, Sundar, Lee, Sanghyuk, Nagel, Markus, Soriaga, Joseph, Blankevoort, Tijmen
Year of Publication 31.03.2023
Year of Publication 31.03.2023
Get full text
Journal Article
A White Paper on Neural Network Quantization
Nagel, Markus, Fournarakis, Marios, Amjad, Rana Ali, Bondarenko, Yelysei, van Baalen, Mart, Blankevoort, Tijmen
Year of Publication 15.06.2021
Year of Publication 15.06.2021
Get full text
Journal Article
Bayesian Bits: Unifying Quantization and Pruning
van Baalen, Mart, Louizos, Christos, Nagel, Markus, Amjad, Rana Ali, Wang, Ying, Blankevoort, Tijmen, Welling, Max
Year of Publication 14.05.2020
Year of Publication 14.05.2020
Get full text
Journal Article
Up or Down? Adaptive Rounding for Post-Training Quantization
Nagel, Markus, Amjad, Rana Ali, van Baalen, Mart, Louizos, Christos, Blankevoort, Tijmen
Year of Publication 22.04.2020
Year of Publication 22.04.2020
Get full text
Journal Article