Vyzkoušejte nový nástroj s podporou AI
Summon Research Assistant
BETA
Parallel GEMM-based convolution for deep learning on multicore RISC-V processors
Ramírez, Cristian, Castelló, Adrián, Martínez, Héctor, Quintana-Ortí, Enrique S.
Published in The Journal of supercomputing (01.06.2024)
Published in The Journal of supercomputing (01.06.2024)
Get full text
Journal Article
OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA
Castro, Roberto L., Andrade, Diego, Fraguela, Basilio B.
Published in Mathematics (Basel) (01.09.2021)
Published in Mathematics (Basel) (01.09.2021)
Get full text
Journal Article
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores
Kim, Hyeonjin, Ahn, Sungwoo, Oh, Yunho, Kim, Bogil, Ro, Won Woo, Song, William J.
Published in 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2020)
Published in 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2020)
Get full text
Conference Proceeding
High Performance and Portable Convolution Operators for Multicore Processors
San Juan, Pablo, Castello, Adrian, Dolz, Manuel F., Alonso-Jorda, Pedro, Quintana-Orti, Enrique S.
Published in Proceedings (Symposium on Computer Architecture and High Performance Computing) (01.09.2020)
Published in Proceedings (Symposium on Computer Architecture and High Performance Computing) (01.09.2020)
Get full text
Conference Proceeding
Complete photonic tensor convolution driven by single dataflow
Tang, Kaifei, Wang, Jiantao, Ji, Xiang, Liu, Jiahui, Xin, Yu, Cao, Haijiang, Zeng, Zhaobang, Xiao, Rulei, Jiang, Wei
Published in 2023 Asia Communications and Photonics Conference/2023 International Photonics and Optoelectronics Meetings (ACP/POEM) (04.11.2023)
Published in 2023 Asia Communications and Photonics Conference/2023 International Photonics and Optoelectronics Meetings (ACP/POEM) (04.11.2023)
Get full text
Conference Proceeding
UniWiG: Unified Winograd-GEMM Architecture for Accelerating CNN on FPGAs
Get full text
Conference Proceeding
Optimization of Direct Convolution Algorithms on ARM Processors for Deep Learning Inference
Li, Shang, Yu, Fei, Zhang, Shankou, Yin, Huige, Lin, Hairong
Published in Mathematics (Basel) (01.03.2025)
Published in Mathematics (Basel) (01.03.2025)
Get full text
Journal Article
Photonic Tensor Processing Unit With Single Dataflow and Programmable High-Precision Weighting Control
Tang, Kaifei, Wang, Jiantao, Xu, Wenqu, Ji, Xiang, Liu, Jiahui, Huang, Xiaobin, Xin, Yu, Dai, Pan, Sun, Guozhu, Zeng, Zhaobang, Xiao, Rulei, Chen, Xiangfei, Jiang, Wei
Published in Journal of lightwave technology (15.01.2024)
Published in Journal of lightwave technology (15.01.2024)
Get full text
Journal Article
High Performance and Portable Convolution Operators for ARM-based Multicore Processors
Juan, Pablo San, Castelló, Adrián, Dolz, Manuel F, Alonso-Jordá, Pedro, Quintana-Ortí, Enrique S
Year of Publication 13.05.2020
Year of Publication 13.05.2020
Get full text
Journal Article
Computing large 2D convolutions on GPU efficiently with the im2tensor algorithm
Seznec, Mickaël, Gac, Nicolas, Orieux, François, Sashala Naik, Alvin
Published in Journal of real-time image processing (01.12.2022)
Published in Journal of real-time image processing (01.12.2022)
Get full text
Journal Article
Efficient Realization of Householder Transform Through Algorithm-Architecture Co-Design for Acceleration of QR Factorization
Merchant, Farhad, Vatwani, Tarun, Chattopadhyay, Anupam, Raha, Soumyendu, Nandy, S. K., Narayan, Ranjani
Published in IEEE transactions on parallel and distributed systems (01.08.2018)
Published in IEEE transactions on parallel and distributed systems (01.08.2018)
Get full text
Journal Article
Efficient Realization of Householder Transform through Algorithm-Architecture Co-design for Acceleration of QR Factorization
Merchant, Farhad, Vatwani, Tarun, Chattopadhyay, Anupam, Raha, Soumyendu, Nandy, S K, Narayan, Ranjani
Published in arXiv.org (14.12.2016)
Published in arXiv.org (14.12.2016)
Get full text
Paper
Journal Article
NeuralMatrix: Compute the Entire Neural Networks with Linear Matrix Operations for Efficient Inference
Sun, Ruiqi, Ye, Siwei, Zhao, Jie, He, Xin, Lin, Jianzhe, Li, Yiran, Zou, An
Year of Publication 23.05.2023
Year of Publication 23.05.2023
Get full text
Journal Article