Rajbhandari, Samyam
51  results:
Search for persons X
?
1

System Optimizations for Enabling Training of Extreme Long ..:

, In: Proceedings of the 43rd ACM Symposium on Principles of Distributed Computing,
 
?
2

System Optimizations for Enabling Training of Extreme Long ..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
Jacobs, Sam Ade ; Tanaka, Masahiro ; Zhang, Chengming... - p. 1206-1208 , 2024
 
?
4

A Hybrid Tensor-Expert-Data Parallelism Approach to Optimiz..:

, In: Proceedings of the 37th ACM International Conference on Supercomputing,
 
?
 
?
 
?
 
?
 
?
14

DeepSpeed- Inference: Enabling Efficient Inference of Trans..:

, In: SC22: International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
15

1-bit LAMB: Communication Efficient Large-Scale Large-Batch..:

, In: 2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics (HiPC),
Li, Conglong ; Awan, Ammar Ahmad ; Tang, Hanlin.. - p. 272-281 , 2022
 
1-15