Subramoni, Hari
86  results:
Search for persons X
?
1

Benchmarking Modern Databases for Storing and Profiling Ver..:

, In: Benchmarking, Measuring, and Optimizing; Lecture Notes in Computer Science,
Kousha, Pouya ; Zhou, Qinghua ; Subramoni, Hari. - p. 104-119 , 2024
 
?
2

HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid M..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Ramesh, Bharath ; Contini, Nick ; Alnaasan, Nawras... - p. 802-813 , 2024
 
?
3

Infer-HiRes: Accelerating Inference for High-Resolution Ima..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
4

Exploiting Inter-Layer Expert Affinity for Accelerating Mix..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Yao, Jinghan ; Anthony, Quentin ; Shafi, Aamir.. - p. 915-925 , 2024
 
?
5

OMB-FPGA: A Microbenchmark Suite for FPGA-aware MPIs using ..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
6

Design and Implementation of an IPC-based Collective MPI Li..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
7

OMB-CXL: A Micro-Benchmark Suite for Evaluating MPI Communi..:

, In: Practice and Experience in Advanced Research Computing 2024: Human Powered Computing,
 
?
8

A Novel Framework for Efficient Offloading of Communication..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
 
?
9

In-Depth Evaluation of a Lower-Level Direct-Verbs API on In..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
 
?
 
?
11

DPU-Bench: A Micro-Benchmark Suite to Measure Offload Effic..:

, In: Practice and Experience in Advanced Research Computing 2023: Computing for the Common Good,
 
?
12

Performance Characterization of Using Quantization for DNN ..:

, In: 2023 IEEE 7th International Conference on Fog and Edge Computing (ICFEC),
Ahn, Hyunho ; Chen, Tian ; Alnaasan, Nawras... - p. 1-6 , 2023
 
?
13

Accelerating Distributed Deep Learning Training with Compre..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Zhou, Qinghua ; Anthony, Quentin ; Xu, Lang... - p. 134-144 , 2023
 
?
14

Optimizing Amber for Device-to-Device GPU Communication:

, In: Practice and Experience in Advanced Research Computing 2023: Computing for the Common Good,
Khuvis, Samuel ; Tomko, Karen ; Brozell, Scott R.... - p. 200-205 , 2023
 
?
15

Democratizing HPC Access and Use with Knowledge Graphs:

, In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis,
 
1-15