Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms.
Md. Mostofa Ali Patwary(Intel (United Kingdom)), Pradeep Dubey(Intel (United Kingdom)), Vadim Pirogov, Dipankar Das(Malaviya National Institute of Technology Jaipur), Jongsoo Park(Intel (United States)), Michael J. Anderson(Intel (United Kingdom)), Satya Gautam Vadlamudi(Intel (United Kingdom)), Narayanan Sundaram(Intel (United Kingdom)), Nadathur Satish(Intel (United Kingdom)), Sergey Pudov(Intel (United States))
ISC
January 1, 2015
Cited by 3
Related Papers
SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training
|Unknown|2020|462
GraphMat
|Proceedings of the VLDB Endowment|2015|292
Mixed Precision Training of Convolutional Neural Networks using Integer Operations
|ArXiv.org|2018|115
Ternary Neural Networks with Fine-Grained Quantization
|arXiv (Cornell University)|2017|61
Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms
|Lecture notes in computer science|2015|59