Endo, Toshio
660  results:
Search for persons X
?
1

AshPipe: Asynchronous Hybrid Pipeline Parallel for DNN Trai..:

, In: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region,
 
?
2

Retargeting and Respecializing GPU Workloads for Performanc..:

, In: 2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO),
Ivanov, Ivan R. ; Zinenko, Oleksandr ; Domke, Jens.. - p. 119-132 , 2024
 
?
3

Real-time High-resolution X-Ray Computed Tomography:

, In: Proceedings of the 38th ACM International Conference on Supercomputing,
Wu, Du ; Chen, Peng ; Wang, Xiao... - p. 110-123 , 2024
 
?
4

Accelerating Stencil Computations on a GPU by Combining Usi..:

, In: Proceedings of the 16th Workshop on General Purpose Processing Using GPU,
Kambe, Futa ; Endo, Toshio - p. 1-6 , 2024
 
?
5

Revisiting Temporal Blocking Stencil Optimizations:

, In: Proceedings of the 37th ACM International Conference on Supercomputing,
Zhang, Lingqi ; Wahib, Mohamed ; Chen, Peng... - p. 251-263 , 2023
 
?
6

High-Performance GPU-to-CPU Transpilation and Optimization ..:

, In: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming,
Moses, William S. ; Ivanov, Ivan R. ; Domke, Jens... - p. 119-134 , 2023
 
?
7

The Aggressive Oversubscribing Scheduling for Interactive J..:

, In: 2023 IEEE High Performance Extreme Computing Conference (HPEC),
 
?
8

PERKS: a Locality-Optimized Execution Model for Iterative M..:

, In: Proceedings of the 37th ACM International Conference on Supercomputing,
Zhang, Lingqi ; Wahib, Mohamed ; Chen, Peng... - p. 167-179 , 2023
 
?
9

Enhancing the Performance of AlphaFold Through Modified Sto..:

, In: 2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE),
Fujita, Hayato ; Nomura, Akihiro ; Endo, Toshio. - p. 2140-2146 , 2023
 
?
10

Exploiting Scratchpad Memory for Deep Temporal Blocking ..:

, In: Proceedings of the 15th Workshop on General Purpose Processing Using GPU,
Zhang, Lingqi ; Wahib, Mohamed ; Chen, Peng... - p. 34-35 , 2023
 
?
11

Effectiveness of the Oversubscribing Scheduling on Supercom..:

, In: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region,
 
?
12

Pyramid Swin Transformer for Multi-task: Expanding to More ..:

, In: Advanced Concepts for Intelligent Vision Systems; Lecture Notes in Computer Science,
 
?
13

Efficient Stencil Computation with Temporal Blocking by Hal..:

, In: 2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom),
Aikawa, Hiroki ; Endo, Toshio ; Yuki, Tomoya.. - p. 870-877 , 2022
 
?
14

mdx: A Cloud Platform for Supporting Data Science and Cross..:

, In: 2022 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech),
 
?
15

Speed-Up Single Shot Detector on GPU with CUDA:

, In: Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing; Studies in Computational Intelligence,
 
1-15