Pekhimenko, Gennady
112  results:
Search for persons X
?
1

Sylva: Sparse Embedded Adapters via Hierarchical Approximat..:

, In: Proceedings of the 38th ACM International Conference on Supercomputing,
Mu, Baorun ; Giannoula, Christina ; Wang, Shang. - p. 485-497 , 2024
 
?
2

Minuet: Accelerating 3D Sparse Convolutions on GPUs:

, In: Proceedings of the Nineteenth European Conference on Computer Systems,
Yang, Jiacheng ; Giannoula, Christina ; Wu, Jun... - p. 786-802 , 2024
 
?
3

TiLT: A Time-Centric Approach for Stream Query Optimization..:

, In: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2,
Jayarajan, Anand ; Zhao, Wei ; Sun, Yudi. - p. 818-832 , 2023
 
?
4

TorchProbe: Fuzzing Dynamic Deep Learning Compilers:

, In: Programming Languages and Systems; Lecture Notes in Computer Science,
Su, Qidong ; Geng, Chuqin ; Pekhimenko, Gennady. - p. 310-331 , 2023
 
?
5

Grape: Practical and Efficient Graphed Execution for Dynami..:

, In: Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture,
Zheng, Bojian ; Yu, Cody Hao ; Wang, Jie... - p. 1364-1380 , 2023
 
?
6

Hidet: Task-Mapping Programming Paradigm for Deep Learning ..:

, In: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2,
Ding, Yaoyao ; Yu, Cody Hao ; Zheng, Bojian... - p. 370-384 , 2023
 
?
7

Pavise : Integrating Fault Tolerance Support for Persist..:

, In: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques,
Qiu, Han Jie ; Liu, Sihang ; Song, Xinyang.. - p. 109-123 , 2022
 
?
8

Keynote Talk 1: Efficient DNN Training at Scale: from Algor..:

, In: 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),
Pekhimenko, Gennady - p. 1244-1244 , 2022
 
?
9

Automatic horizontal fusion for GPU kernels:

, In: Proceedings of the 20th IEEE/ACM International Symposium on Code Generation and Optimization,
Li, Ao ; Zheng, Bojian ; Pekhimenko, Gennady. - p. 14-27 , 2022
 
?
10

GPUPool : A Holistic Approach to Fine-Grained GPU Sharin..:

, In: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques,
 
?
11

Automatic Horizontal Fusion for GPU Kernels:

, In: 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO),
Li, Ao ; Zheng, Bojian ; Pekhimenko, Gennady. - p. 14-27 , 2022
 
?
12

LifeStream: a high-performance stream processing engine for..:

, In: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems,
 
?
13

Gretch : A Hardware Prefetcher for Graph Analytics:

Kaushik, Anirudh Mohan ; Pekhimenko, Gennady ; Patel, Hiren
ACM Transactions on Architecture and Code Optimization (TACO).  18 (2021)  2 - p. 1-25 , 2021
 
?
14

FPRaker: A Processing Element For Accelerating Neural Netwo..:

, In: MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture,
Awad, Omar Mohamed ; Mahmoud, Mostafa ; Edo, Isak... - p. 857-869 , 2021
 
?
15

NVOverlay : enabling efficient and scalable high-frequen..:

, In: Proceedings of the 48th Annual International Symposium on Computer Architecture,
Wang, Ziqi ; Choo, Chul-Hwan ; Kozuch, Michael A.... - p. 498-511 , 2021
 
1-15