AbdulJabbar, Mustafa
270  results:
Search for persons X
?
1

OMB-FPGA: A Microbenchmark Suite for FPGA-aware MPIs using ..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
2

PML-MPI: A Pre-Trained ML Framework for Efficient Collectiv..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
 
?
3

Towards Accelerating k-NN with MPI and Near-Memory Processi..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
Ahn, Hooyoung ; Kim, Seonyoung ; Park, Yoomi... - p. 608-615 , 2024
 
?
4

OMB-CXL: A Micro-Benchmark Suite for Evaluating MPI Communi..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
5

HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid M..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Ramesh, Bharath ; Contini, Nick ; Alnaasan, Nawras... - p. 802-813 , 2024
 
?
6

Enabling Reconfigurable HPC through MPI-based Inter-FPGA Co..:

, In: Proceedings of the 37th ACM International Conference on Supercomputing,
 
?
7

Implementing and Optimizing a GPU-aware MPI Library for Int..:

, In: 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid),
 
?
8

Accelerating Distributed Deep Learning Training with Compre..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Zhou, Qinghua ; Anthony, Quentin ; Xu, Lang... - p. 134-144 , 2023
 
?
 
?
11

Performance Characterization of Using Quantization for DNN ..:

, In: 2023 IEEE 7th International Conference on Fog and Edge Computing (ICFEC),
Ahn, Hyunho ; Chen, Tian ; Alnaasan, Nawras... - p. 1-6 , 2023
 
?
12

Shisha: Online Scheduling of CNN Pipelines on Heterogeneous..:

, In: Parallel Processing and Applied Mathematics; Lecture Notes in Computer Science,
 
?
13

Accelerating communication with multi‐HCA aware collectives..:

Tran, Tu ; Ramesh, Bharath ; Michalowicz, Benjamin...
Concurrency and Computation: Practice and Experience.  36 (2023)  1 - p. , 2023
 
?
14

MCR-DL: Mix-and-Match Communication Runtime for Deep Learni..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Anthony, Quentin ; Awan, Ammar Ahmad ; Rasley, Jeff... - p. 996-1006 , 2023
 
?
15

In-Depth Evaluation of a Lower-Level Direct-Verbs API on In..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
 
1-15