Subramoni, Hari
108  results:
Search for persons X
?
1

Benchmarking Modern Databases for Storing and Profiling Ver..:

, In: Benchmarking, Measuring, and Optimizing; Lecture Notes in Computer Science,
Kousha, Pouya ; Zhou, Qinghua ; Subramoni, Hari. - p. 104-119 , 2024
 
?
2

Performance Characterization of Using Quantization for DNN ..:

, In: 2023 IEEE 7th International Conference on Fog and Edge Computing (ICFEC),
Ahn, Hyunho ; Chen, Tian ; Alnaasan, Nawras... - p. 1-6 , 2023
 
?
3

SAI: AI-Enabled Speech Assistant Interface for Science Gate..:

, In: Lecture Notes in Computer Science; High Performance Computing,
Kousha, Pouya ; Jain, Arpan ; Kolli, Ayyappa... - p. 402-424 , 2023
 
?
4

MPI4Spark Meets YARN: Enhancing MPI4Spark through YARN supp..:

, In: 2023 IEEE International Conference on Big Data (BigData),
Al-Attar, Kinan ; Shafi, Aamir ; Subramoni, Hari. - p. 2265-2274 , 2023
 
?
6

Accelerating communication with multi‐HCA aware collectives..:

Tran, Tu ; Ramesh, Bharath ; Michalowicz, Benjamin...
Concurrency and Computation: Practice and Experience.  36 (2023)  1 - p. , 2023
 
?
7

Battle of the BlueFields: An In-Depth Comparison of the Blu..:

, In: 2023 IEEE Symposium on High-Performance Interconnects (HOTI),
 
?
8

Flover: A Temporal Fusion Framework for Efficient Autoregre..:

, In: 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC),
Yao, Jinghan ; Alnaasan, Nawras ; Chen, Tian... - p. 107-116 , 2023
 
?
9

Accelerating Distributed Deep Learning Training with Compre..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Zhou, Qinghua ; Anthony, Quentin ; Xu, Lang... - p. 134-144 , 2023
 
?
 
?
11

MCR-DL: Mix-and-Match Communication Runtime for Deep Learni..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Anthony, Quentin ; Awan, Ammar Ahmad ; Rasley, Jeff... - p. 996-1006 , 2023
 
?
12

ScaMP: Scalable Meta-Parallelism for Deep Learning Search:

, In: 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing Workshops (CCGridW),
Anthony, Quentin ; Xu, Lang ; Shafi, Aamir.. - p. 346-348 , 2023
 
?
13

A Novel Framework for Efficient Offloading of Communication..:

, In: 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
 
?
14

Optimized All-to-All Connection Establishment for High-Perf..:

, In: 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC),
 
?
15

ScaMP: Scalable Meta-Parallelism for Deep Learning Search:

, In: 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet Computing (CCGrid),
Anthony, Quentin ; Xu, Lang ; Shafi, Aamir.. - p. 391-402 , 2023
 
1-15