Li, Shuaiwen
19  Ergebnisse:
Personensuche X
?
1

Bring orders into uncertainty : enabling efficient uncer..:

, In: Proceedings of the 36th ACM International Conference on Supercomputing,
Zhang, Heng ; Li, Lingda ; Liu, Hang... - p. 1-14 , 2022
 
?
2

A novel memory-efficient deep learning training framework v..:

, In: Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Jin, Sian ; Li, Guanpeng ; Song, Shuaiwen Leon. - p. 485-487 , 2021
 
?
3

Dr. Top-k : delegate-centric Top-k on GPUs:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
Gaihre, Anil ; Zheng, Da ; Weitze, Scott... - p. 1-14 , 2021
 
?
4

Q-VR: system-level design for future mobile collaborative v..:

, In: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems,
Xie, Chenhao ; Li, Xie ; Hu, Yang... - p. 587-599 , 2021
 
?
5

An efficient uncertain graph processing framework for heter..:

, In: Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Zhang, Heng ; Li, Lingda ; Zhuang, Donglin... - p. 477-479 , 2021
 
?
6

Fast and Scalable Sparse Triangular Solver for Multi-GPU Ba..:

, In: 50th International Conference on Parallel Processing,
XIE, CHENHAO ; Chen, Jieyang ; Firoz, Jesun... - p. 1-11 , 2021
 
?
7

Dr. Top-k: Delegate-Centric Top-k on GPUs:

, In: SC21: International Conference for High Performance Computing, Networking, Storage and Analysis,
Gaihre, Anil ; Zheng, Da ; Weitze, Scott... - p. 1-14 , 2021
 
?
8

BSTC : a novel binarized-soft-tensor-core design for acc..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
Li, Ang ; Geng, Tong ; Wang, Tianqi... - p. 1-30 , 2019
 
?
9

Warp-Consolidation : A Novel Execution Model for GPUs:

, In: Proceedings of the 2018 International Conference on Supercomputing,
Li, Ang ; Liu, Weifeng ; Wang, Linnan.. - p. 53-64 , 2018
 
?
10

Superneurons : dynamic GPU memory management for trainin..:

, In: Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Wang, Linnan ; Ye, Jinmian ; Zhao, Yiyang... - p. 41-53 , 2018
 
?
11

Introduction to HPPAC 2018:

, In: 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
Song, Shuaiwen Leon ; Bates, Natalie ; Li, Ang - p. 674-674 , 2018
 
?
12

CUDAAdvisor: LLVM-based runtime profiling for modern GPUs:

, In: Proceedings of the 2018 International Symposium on Code Generation and Optimization,
Shen, Du ; Song, Shuaiwen Leon ; Li, Ang. - p. 214-227 , 2018
 
?
13

Locality-Aware CTA Clustering for Modern GPUs:

, In: Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems,
Li, Ang ; Song, Shuaiwen Leon ; Liu, Weifeng... - p. 297-311 , 2017
 
?
14

Exploring and analyzing the real impact of modern on-packag..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
Li, Ang ; Liu, Weifeng ; Kristensen, Mads R. B.... - p. 1-14 , 2017
 
?
15

BVF : enabling significant on-chip power savings via bit..:

, In: Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture,
Li, Ang ; Zhao, Wenfeng ; Song, Shuaiwen Leon - p. 532-545 , 2017
 
1-15