E-LIB Suche - Ergebnisse für: Pekhimenko, Gennady - Sprache EN

Search for persons X

Sorted by: Relevance

Sorted by: Year

1

Sylva: Sparse Embedded Adapters via Hierarchical Approximat..:

, In: Proceedings of the 38th ACM International Conference on Supercomputing,

Mu, Baorun ; Giannoula, Christina ; Wang, Shang. - p. 485-497 , 2024

Link: https://dl.acm.org/doi/1..

2

Minuet: Accelerating 3D Sparse Convolutions on GPUs:

, In: Proceedings of the Nineteenth European Conference on Computer Systems,

Yang, Jiacheng ; Giannoula, Christina ; Wu, Jun... - p. 786-802 , 2024

Link: https://dl.acm.org/doi/1..

3

TiLT: A Time-Centric Approach for Stream Query Optimization..:

, In: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2,

Jayarajan, Anand ; Zhao, Wei ; Sun, Yudi. - p. 818-832 , 2023

Link: https://dl.acm.org/doi/1..

4

TorchProbe: Fuzzing Dynamic Deep Learning Compilers:

, In: Programming Languages and Systems; Lecture Notes in Computer Science,

Su, Qidong ; Geng, Chuqin ; Pekhimenko, Gennady. - p. 310-331 , 2023

Link: https://doi.org/10.1007/..

5

Grape: Practical and Efficient Graphed Execution for Dynami..:

, In: Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture,

Zheng, Bojian ; Yu, Cody Hao ; Wang, Jie... - p. 1364-1380 , 2023

Link: https://dl.acm.org/doi/1..

6

Hidet: Task-Mapping Programming Paradigm for Deep Learning ..:

, In: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2,

Ding, Yaoyao ; Yu, Cody Hao ; Zheng, Bojian... - p. 370-384 , 2023

Link: https://dl.acm.org/doi/1..

7

Pavise : Integrating Fault Tolerance Support for Persist..:

, In: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques,

Qiu, Han Jie ; Liu, Sihang ; Song, Xinyang.. - p. 109-123 , 2022

Link: https://dl.acm.org/doi/1..

8

Keynote Talk 1: Efficient DNN Training at Scale: from Algor..:

, In: 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW),

Pekhimenko, Gennady - p. 1244-1244 , 2022

Link: https://doi.org/10.1109/..

9

Automatic horizontal fusion for GPU kernels:

, In: Proceedings of the 20th IEEE/ACM International Symposium on Code Generation and Optimization,

Li, Ao ; Zheng, Bojian ; Pekhimenko, Gennady. - p. 14-27 , 2022

Link: https://dl.acm.org/doi/1..

10

GPUPool : A Holistic Approach to Fine-Grained GPU Sharin..:

, In: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques,

Tan, Xiaodan Serina ; Golikov, Pavel ; Vijaykumar, Nandita. - p. 317-332 , 2022

Link: https://dl.acm.org/doi/1..

11

Automatic Horizontal Fusion for GPU Kernels:

, In: 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO),

Li, Ao ; Zheng, Bojian ; Pekhimenko, Gennady. - p. 14-27 , 2022

Link: https://doi.org/10.1109/..

12

LifeStream: a high-performance stream processing engine for..:

, In: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems,

Jayarajan, Anand ; Hau, Kimberly ; Goodwin, Andrew. - p. 107-122 , 2021

Link: https://dl.acm.org/doi/1..

13

Gretch : A Hardware Prefetcher for Graph Analytics:

Kaushik, Anirudh Mohan ; Pekhimenko, Gennady ; Patel, Hiren
ACM Transactions on Architecture and Code Optimization (TACO). 18 (2021) 2 - p. 1-25 , 2021

Link: https://dl.acm.org/doi/1..

14

FPRaker: A Processing Element For Accelerating Neural Netwo..:

, In: MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture,

Awad, Omar Mohamed ; Mahmoud, Mostafa ; Edo, Isak... - p. 857-869 , 2021

Link: https://dl.acm.org/doi/1..

15

NVOverlay : enabling efficient and scalable high-frequen..:

, In: Proceedings of the 48th Annual International Symposium on Computer Architecture,

Wang, Ziqi ; Choo, Chul-Hwan ; Kozuch, Michael A.... - p. 498-511 , 2021

Link: https://dl.acm.org/doi/1..

1-15