Khorassani, Kawthar Shafie
28  results:
Search for persons X
?
2

Designing and Optimizing GPU-aware Nonblocking MPI Neighbor..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
 
?
3

Implementing and Optimizing a GPU-aware MPI Library for Int..:

, In: 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid),
 
?
4

Network Assisted Non-Contiguous Transfers for GPU-Aware MPI..:

, In: 2022 IEEE Symposium on High-Performance Interconnects (HOTI),
 
?
5

Highly Efficient Alltoall and Alltoallv Communication Algor..:

, In: 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
 
?
6

Adaptive and Hierarchical Large Message All-to-all Communic..:

, In: 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet Computing (CCGrid),
 
?
7

NV-group : link-efficient reduction for distributed deep..:

, In: Proceedings of the 34th ACM International Conference on Supercomputing,
 
?
8

Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer..:

, In: 2020 IEEE International Conference on Cluster Computing (CLUSTER),
 
?
9

High-Performance Adaptive MPI Derived Datatype Communicatio..:

, In: 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC),
 
?
10

Performance Evaluation of MPI Libraries on GPU-Enabled Open..:

, In: Lecture Notes in Computer Science; High Performance Computing,
 
?
11

MPI-xCCL: A Portable MPI Library over Collective Communicat..:

, In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis,
 
?
12

Accelerating MPI All-to-All Communication with Online Compr..:

, In: Lecture Notes in Computer Science; High Performance Computing,
Zhou, Qinghua ; Kousha, Pouya ; Anthony, Quentin... - p. 3-25 , 2022
 
?
13

High Performance MPI over the Slingshot Interconnect: Early..:

, In: Practice and Experience in Advanced Research Computing,
 
?
14

Designing a ROCm-Aware MPI Library for AMD GPUs: Early Expe..:

, In: Lecture Notes in Computer Science; High Performance Computing,
 
?
15

OMB-UM: Design, Implementation, and Evaluation of CUDA Unif..:

, In: 2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS),
 
1-15