A Survey of Deep Learning on CPUs: Opportunities and Co-Optimizations
Mittal, Sparsh, Rajput, Poonam, Subramoney, Sreenivas
Published in IEEE transaction on neural networks and learning systems (01.10.2022)
Published in IEEE transaction on neural networks and learning systems (01.10.2022)
Get full text
Journal Article
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Jeong, Geonhwa, Damani, Sana, Bambhaniya, Abhimanyu Rajeshkumar, Qin, Eric, Hughes, Christopher J., Subramoney, Sreenivas, Kim, Hyesoon, Krishna, Tushar
Published in 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01.02.2023)
Published in 2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01.02.2023)
Get full text
Conference Proceeding
PrxCa1−xMnO3 based stochastic neuron for Boltzmann machine to solve “maximum cut” problem
Khilwani, Devesh, Moghe, Vineet, Lashkare, Sandip, Saraswat, Vivek, Kumbhare, Pankaj, Shojaei Baghini, Maryam, Jandhyala, Srivatsava, Subramoney, Sreenivas, Ganguly, Udayan
Published in APL materials (01.09.2019)
Published in APL materials (01.09.2019)
Get full text
Journal Article
Early Prediction of DNN Activation Using Hierarchical Computations
Suresh, Bharathwaj, Pillai, Kamlesh, Kalsi, Gurpreet Singh, Abuhatzera, Avishaii, Subramoney, Sreenivas
Published in Mathematics (Basel) (01.12.2021)
Published in Mathematics (Basel) (01.12.2021)
Get full text
Journal Article
Taming Server Memory TCO with Multiple Software-Defined Compressed Tiers
Get full text
Paper
Journal Article
Motivating Next-Generation OS Physical Memory Management for Terabyte-Scale NVMMs
Garg, Shivank, Prasad, Aravinda, Mishra, Debadatta, Sreenivas Subramoney
Published in arXiv.org (05.10.2023)
Published in arXiv.org (05.10.2023)
Get full text
Paper
Journal Article
QCQA: Quality and Capacity-aware grouped Query Attention
Joshi, Vinay, Laddha, Prashant, Sinha, Shambhavi, Om Ji Omer, Sreenivas Subramoney
Published in arXiv.org (08.06.2024)
Published in arXiv.org (08.06.2024)
Get full text
Paper
Journal Article
CiMNet: Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Kundu, Souvik, Anthony, Sarah, Joshi, Vinay, Omer, Om J, Sreenivas Subramoney
Published in arXiv.org (18.03.2024)
Published in arXiv.org (18.03.2024)
Get full text
Paper
Journal Article
Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Fettes, Quintin, Karanth, Avinash, Bunescu, Razvan, Beckwith, Brandon, Sreenivas Subramoney
Published in arXiv.org (17.04.2023)
Published in arXiv.org (17.04.2023)
Get full text
Paper
Journal Article
Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution
Bera, Rahul, Ranganathan, Adithya, Rakshit, Joydeep, Mahto, Sujit, Nori, Anant V, Gaur, Jayesh, Olgun, Ataberk, Kanellopoulos, Konstantinos, Sadrosadati, Mohammad, Sreenivas Subramoney, Mutlu, Onur
Published in arXiv.org (26.06.2024)
Published in arXiv.org (26.06.2024)
Get full text
Paper
Journal Article
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion
Thyagharajan, Anirud, Ummenhofer, Benjamin, Laddha, Prashant, Omer, Om J, Sreenivas Subramoney
Published in arXiv.org (16.11.2021)
Published in arXiv.org (16.11.2021)
Get full text
Paper
Journal Article
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Jeong, Geonhwa, Damani, Sana, Abhimanyu Rajeshkumar Bambhaniya, Qin, Eric, Hughes, Christopher J, Sreenivas Subramoney, Kim, Hyesoon, Krishna, Tushar
Published in arXiv.org (23.02.2023)
Published in arXiv.org (23.02.2023)
Get full text
Paper
Journal Article
Page Table Management for Heterogeneous Memory Systems
Kumar, Sandeep, Prasad, Aravinda, Sarangi, Smruti R, Sreenivas Subramoney
Published in arXiv.org (16.03.2021)
Published in arXiv.org (16.03.2021)
Get full text
Paper
Journal Article
ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis
Firtina, Can, Pillai, Kamlesh, Kalsi, Gurpreet S, Bharathwaj Suresh, Cali, Damla Senol, Kim, Jeremie, Shahroodi, Taha, Cavlak, Meryem Banu, Lindegger, Joel, Alser, Mohammed, Juan Gómez Luna, Sreenivas Subramoney, Mutlu, Onur
Published in arXiv.org (21.10.2023)
Published in arXiv.org (21.10.2023)
Get full text
Paper
Journal Article
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Jeong, Geonhwa, Qin, Eric, Samajdar, Ananda, Hughes, Christopher J, Sreenivas Subramoney, Kim, Hyesoon, Krishna, Tushar
Published in arXiv.org (05.10.2021)
Published in arXiv.org (05.10.2021)
Get full text
Paper
Journal Article
Proximu$: Efficiently Scaling DNN Inference in Multi-core CPUs through Near-Cache Compute
Nori, Anant V, Bera, Rahul, Balachandran, Shankar, Rakshit, Joydeep, Omer, Om J, Abuhatzera, Avishaii, Belliappa Kuttanna, Sreenivas Subramoney
Published in arXiv.org (03.12.2020)
Published in arXiv.org (03.12.2020)
Get full text
Paper
Journal Article
AccSS3D: Accelerator for Spatially Sparse 3D DNNs
Om Ji Omer, Laddha, Prashant, Kalsi, Gurpreet S, Thyagharajan, Anirud, Pillai, Kamlesh R, Kulkarni, Abhimanyu, Yao, Anbang, Chen, Yurong, Sreenivas Subramoney
Published in arXiv.org (25.11.2020)
Published in arXiv.org (25.11.2020)
Get full text
Paper
Journal Article
SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Mapping
Cali, Damla Senol, Kanellopoulos, Konstantinos, Lindegger, Joel, Bingöl, Zülal, Kalsi, Gurpreet S, Zuo, Ziyi, Firtina, Can, Cavlak, Meryem Banu, Kim, Jeremie, Nika Mansouri Ghiasi, Singh, Gagandeep, Gómez-Luna, Juan, Nour Almadhoun Alserr, Alser, Mohammed, Sreenivas Subramoney, Alkan, Can, Ghose, Saugata, Mutlu, Onur
Published in arXiv.org (31.05.2022)
Published in arXiv.org (31.05.2022)
Get full text
Paper
Journal Article