Search for persons
X
?
2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS) ,
1
Fast multiplication of random dense matrices with sparse ma..:
, In:
?
Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures ,
2
Distributed-Memory Randomized Algorithms for Sparse Tensor ..:
, In:
?
Proceedings of the Platform for Advanced Scientific Computing Conference ,
5
Communication bounds for convolutional neural networks:
, In:
?
2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) ,
6
Distributed-Memory Sparse Kernels for Machine Learning:
, In:
?
2022 IEEE/ACM Sixth International Workshop on Software Correctness for HPC Applications (Correctness) ,
7
Proposed Consistent Exception Handling for the BLAS and LAP..:
, In:
?
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming ,
8
Dynamic scaling for low-precision learning:
, In:
?
Proceedings of the 48th Annual International Symposium on Computer Architecture ,
10
CoSA : scheduling by constrained optimization for spatia..:
, In:
?
Lecture Notes in Computer Science; High Performance Computing ,
11
Auto-Precision Scaling for Distributed Deep Learning:
, In:
?
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) ,
14
CoSA: Scheduling by Constrained Optimization for Spatial Ac..:
, In:
?
Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and Architectures ,
15