PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile DevicesXiaolong Ma, Yanzhi Wang, Kaisheng Ma et al.|Proceedings of the AAAI Conference on Artificial Intelligence|2020Cited by 188
DNNFusion: accelerating deep neural networks execution with advanced operator fusionWei Niu, Bin Ren, Yanzhi Wang et al.|Unknown|2021Cited by 152
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token PruningZhenglun Kong, Yanzhi Wang, Geng Yuan et al.|Lecture notes in computer science|2022Cited by 143
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and ChallengesFei Dou, Wen‐Zhan Song, Chenjiao Tan et al.|arXiv (Cornell University)|2023Cited by 31