Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms
Md. Mostofa Ali Patwary(Intel (United Kingdom)), Pradeep Dubey(Intel (United Kingdom)), Vadim Pirogov, Dipankar Das(Malaviya National Institute of Technology Jaipur), Jongsoo Park(Intel (United States)), Michael J. Anderson(Intel (United Kingdom)), Satya Gautam Vadlamudi(Intel (United Kingdom)), Narayanan Sundaram(Intel (United Kingdom)), Nadathur Satish(Intel (United Kingdom)), Sergey Pudov(Intel (United States))
Cited by 59
Related Papers
SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training
|Unknown|2020|462
GraphMat
|Proceedings of the VLDB Endowment|2015|292
Mixed Precision Training of Convolutional Neural Networks using Integer Operations
|ArXiv.org|2018|115
Ternary Neural Networks with Fine-Grained Quantization
|arXiv (Cornell University)|2017|61
CA-Net: A Novel Cascaded Attention-Based Network for Multistage Glaucoma Classification Using Fundus Images
|IEEE Transactions on Instrumentation and Measurement|2023|48