MCM-GPU: Multi-chip-module GPUs for continued performance scalability
Arunkumar, Akhil, Bolotin, Evgeny, Cho, Benjamin, Milic, Ugljesa, Ebrahimi, Eiman, Villa, Oreste, Jaleel, Aamer, Wu, Carole-Jean, Nellans, David
Published in 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)
Published in 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) (01.06.2017)
Get full text
Conference Proceeding
Beyond the socket: NUMA-aware GPUs
Milic, Ugljesa, Villa, Oreste, Bolotin, Evgeny, Arunkumar, Akhil, Ebrahimi, Eiman, Jaleel, Aamer, Ramirez, Alex, Nellans, David
Published in 2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (14.10.2017)
Published in 2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (14.10.2017)
Get full text
Conference Proceeding
Publication
Understanding the Future of Energy Efficiency in Multi-Module GPUs
Arunkumar, Akhil, Bolotin, Evgeny, Nellans, David, Wu, Carole-Jean
Published in 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2019)
Published in 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2019)
Get full text
Conference Proceeding
CAWA: Coordinated warp scheduling and Cache Prioritization for critical warp acceleration of GPGPU workloads
Shin-Ying Lee, Arunkumar, Akhil, Wu, Carole-Jean
Published in 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) (13.06.2015)
Published in 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) (13.06.2015)
Get full text
Conference Proceeding
LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs
Arunkumar, Akhil, Lee, Shin-Ying, Soundararajan, Vignesh, Wu, Carole-Jean
Published in 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2018)
Published in 2018 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01.02.2018)
Get full text
Conference Proceeding
Using Low Cost Erasure and Error Correction Schemes to Improve Reliability of Commodity DRAM Systems
Hsing-Min Chen, Jeloka, Supreet, Arunkumar, Akhil, Blaauw, David, Wu, Carole-Jean, Mudge, Trevor, Chakrabarti, Chaitali
Published in IEEE transactions on computers (01.12.2016)
Published in IEEE transactions on computers (01.12.2016)
Get full text
Journal Article
ReMAP: Reuse and memory access cost aware eviction policy for last level cache management
Arunkumar, Akhil, Wu, Carole-Jean
Published in 2014 IEEE 32nd International Conference on Computer Design (ICCD) (01.10.2014)
Published in 2014 IEEE 32nd International Conference on Computer Design (ICCD) (01.10.2014)
Get full text
Conference Proceeding
Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference
Adnan, Muhammad, Arunkumar, Akhil, Jain, Gaurav, Nair, Prashant J, Soloveychik, Ilya, Kamath, Purushotham
Year of Publication 13.03.2024
Year of Publication 13.03.2024
Get full text
Journal Article
DORA: Optimizing Smartphone Energy Efficiency and Web Browser Performance under Interference
Shingari, Davesh, Arunkumar, Akhil, Gaudette, Benjamin, Vrudhula, Sarma, Wu, Carole-Jean
Published in 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01.04.2018)
Published in 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01.04.2018)
Get full text
Conference Proceeding