Doerfert, Johannes
71  Ergebnisse:
Personensuche X
?
1

The Kokkos OpenMPTarget Backend: Implementation and Lessons..:

, In: OpenMP: Advanced Task-Based, Device and Compiler Programming; Lecture Notes in Computer Science,
 
?
2

Maximizing Parallelism and GPU Utilization For Direct GPU C..:

, In: Proceedings of the 52nd International Conference on Parallel Processing Workshops,
 
?
3

Exploring the Limits of Generic Code Execution on GPUs via ..:

, In: OpenMP: Advanced Task-Based, Device and Compiler Programming; Lecture Notes in Computer Science,
 
?
4

OpenMP Kernel Language Extensions for Performance Portable ..:

, In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis,
Tian, Shilei ; Scogland, Tom ; Chapman, Barbara. - p. 876-883 , 2023
 
?
5

Scalable Tuning of (OpenMP) GPU Applications via Kernel Rec..:

, In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
6

ORAQL — Optimistic Responses to Alias Queries in LLVM:

, In: Proceedings of the 52nd International Conference on Parallel Processing,
Hueckelheim, Jan ; Doerfert, Johannes - p. 655-664 , 2023
 
?
7

Implementing OpenMP's SIMD Directive in LLVM's GPU Runtime:

, In: Proceedings of the 52nd International Conference on Parallel Processing,
Wright, Eric ; Doerfert, Johannes ; Tian, Shilei.. - p. 173-182 , 2023
 
?
8

SPLENDID: Supporting Parallel LLVM-IR Enhanced Natural Deco..:

, In: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3,
Tan, Zujun ; Chon, Yebin ; Kruse, Michael... - p. 679-693 , 2023
 
?
9

Precision and Performance Analysis of C Standard Math Libra..:

, In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis,
 
?
10

Scalable Tuning of (OpenMP) GPU Applications via Kernel Rec..:

, In: SC23: International Conference for High Performance Computing, Networking, Storage and Analysis,
 
?
11

High-Performance GPU-to-CPU Transpilation and Optimization ..:

, In: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming,
Moses, William S. ; Ivanov, Ivan R. ; Domke, Jens... - p. 119-134 , 2023
 
?
12

Memory Transfer Decomposition: Exploring Smart Data Movemen..:

, In: Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis,
Roa Perdomo, Diego A. ; Ceccato, Rodrigo ; Neveu, Rémy... - p. 1958-1967 , 2023
 
?
13

Remote OpenMP offloading:

, In: Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Patel, Atmn ; Doerfert, Johannes - p. 441-442 , 2022
 
?
14

Just-in-Time Compilation and Link-Time Optimization for Ope..:

, In: OpenMP in a Modern World: From Multi-device Support to Meta Programming; Lecture Notes in Computer Science,
Tian, Shilei ; Huber, Joseph ; Tramm, John.. - p. 145-158 , 2022
 
?
15

Efficient Execution of OpenMP on GPUs:

, In: 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO),
 
1-15