Ruwase, Olatunji
45  results:
Search for persons X
?
1

A Hybrid Tensor-Expert-Data Parallelism Approach to Optimiz..:

, In: Proceedings of the 37th ACM International Conference on Supercomputing,
 
?
2

SHARP: An Adaptable, Energy-Efficient Accelerator for Recur..:

Aminabadi, Reza Yazdani ; Ruwase, Olatunji ; Zhang, Minjia...
ACM Transactions on Embedded Computing Systems.  22 (2023)  2 - p. 1-23 , 2023
 
?
3

DeepSpeed- Inference: Enabling Efficient Inference of Trans..:

, In: SC22: International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
4

ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Sca..:

, In: SC21: International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
5

ZeRO-infinity : breaking the GPU memory wall for extreme..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
6

DeepSpeed : System Optimizations Enable Training Deep Le..:

, In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining,
 
?
7

ZeRO : memory optimizations toward training trillion par..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
8

HyperDrive : exploring hyperparameters with POP scheduli..:

, In: Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference,
Rasley, Jeff ; He, Yuxiong ; Yan, Feng.. - p. 1-13 , 2017
 
?
10

Optimizing CNNs on Multicores for Scalability, Performance ..:

Rajbhandari, Samyam ; He, Yuxiong ; Ruwase, Olatunji..
ACM SIGOPS Operating Systems Review.  51 (2017)  2 - p. 267-280 , 2017
 
?
11

Optimizing CNNs on Multicores for Scalability, Performance ..:

, In: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems,
 
?
12

Optimizing CNNs on Multicores for Scalability, Performance ..:

Rajbhandari, Samyam ; He, Yuxiong ; Ruwase, Olatunji..
ACM SIGARCH Computer Architecture News.  45 (2017)  1 - p. 267-280 , 2017
 
?
13

SERF : efficient scheduling for fast deep neural network..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
Yan, Feng ; He, Yuxiong ; Ruwase, Olatunji. - p. 1-12 , 2016
 
?
14

Performance Modeling and Scalability Optimization of Distri..:

, In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,
Yan, Feng ; Ruwase, Olatunji ; He, Yuxiong. - p. 1355-1364 , 2015
 
?
15

Page overlays: an enhanced virtual memory framework to enab..:

Seshadri, Vivek ; Pekhimenko, Gennady ; Ruwase, Olatunji...
ACM SIGARCH Computer Architecture News.  43 (2015)  3S - p. 79-91 , 2015
 
1-15