Matrix Multiplication Beyond Auto-Tuning: Rewrite-based GPU Code Generation

Publication
Proceedings of the 2016 International Conference on Compilers, Architecture and Synthesis for Embedded Systems

Related