Search for persons
X
?
2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) ,
1
Block-based GPU Programming with Triton:
, In:
?
Proceedings of the 3rd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages ,
2
Triton: an intermediate language and compiler for tiled neu..:
, In:
?
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis ,
3
Input-aware auto-tuning of compute-bound HPC kernels:
, In:
?
Proceedings of the International Workshop on OpenCL 2013 & 2014 ,
5
Performance portability study of linear algebra kernels in ..:
, In:
?
Proceedings of the 5th USENIX Conference on Hot Topics in Parallelism ,
7
Towards performance-portable, scalable, and convenient line..:
, In:
?
Proceedings of the 2012 Symposium on High Performance Computing ,
8