E-LIB Suche - Ergebnisse für: Dong, Dezun

1

Optimizing Multi-Grid Preconditioned Conjugate Gradient Met..:

Yuan, Fan ; Yang, Xiaojian ; Li, Shengguo...
IEEE Transactions on Parallel and Distributed Systems. 35 (2024) 5 - p. 768-779 , 2024

Link: https://doi.org/10.1109/..

2

Optimizing General Matrix Multiplications on Modern Multi-c..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS),

Yu, Kainan ; Qi, Xinxin ; Zhang, Peng... - p. 964-975 , 2024

Link: https://doi.org/10.1109/..

3

Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Mu..:

Yang, Weiling ; Fang, Jianbin ; Dong, Dezun..
IEEE Transactions on Parallel and Distributed Systems. 35 (2024) 3 - p. 439-454 , 2024

Link: https://doi.org/10.1109/..

4

A survey of machine learning for Network-on-Chips:

Zhang, Xiaoyun ; Dong, Dezun ; Li, Cunlu..
Journal of Parallel and Distributed Computing. 186 (2024) - p. 104778 , 2024

Link: https://doi.org/10.1016/..

5

GraphCube: Interconnection Hierarchy-aware Graph Processing:

, In: Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming,

Gan, Xinbiao ; Wu, Guang ; Qiu, Shenghao... - p. 160-174 , 2024

Link: https://dl.acm.org/doi/1..

6

COER: A Network Interface Offloading Architecture for RDMA ..:

Wu, Ke ; Dong, Dezun ; Xu, Weixia
ACM Transactions on Architecture and Code Optimization. , 2024

Link: https://doi.org/10.1145/..

7

Large Language Models are Few-Shot Summarizers: Multi-Inten..:

, In: 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE),

Geng, Mingyang ; Wang, Shangwen ; Dong, Dezun... - p. 453-465 , 2024

Link: https://doi.org/10.1145/..

8

ACU: Aggregator-based Congestion control and link Utilizati..:

, In: Proceedings of the 8th Asia-Pacific Workshop on Networking,

Yuan, Zhu ; Yuan, Guoyuan ; Dong, Dezun - p. 194-195 , 2024

Link: https://dl.acm.org/doi/1..

9

Enhancing Gradient Compression for Distributed Deep Learnin:

, In: Proceedings of the 8th Asia-Pacific Workshop on Networking,

Bai, Zhe ; Yu, Enda ; Dong, Dezun. - p. 171-172 , 2024

Link: https://dl.acm.org/doi/1..

10

Efficiently Running SpMV on Multi-core DSPs for Banded Matr..:

, In: Algorithms and Architectures for Parallel Processing; Lecture Notes in Computer Science,

Bi, Deshun ; Li, Shengguo ; Zhang, Yichen.. - p. 201-220 , 2024

Link: https://doi.org/10.1007/..

11

Optimizing Attention by Exploiting Data Reuse on ARM Multi-..:

, In: Proceedings of the 38th ACM International Conference on Supercomputing,

Fu, Xiao ; Yang, Weiling ; Dong, Dezun. - p. 137-149 , 2024

Link: https://dl.acm.org/doi/1..

12

Large Language Models are Few-Shot Summarizers: Multi-Inten..:

, In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering,

Geng, Mingyang ; Wang, Shangwen ; Dong, Dezun... - p. 1-13 , 2024

Link: https://dl.acm.org/doi/1..

13

DRLAR: A deep reinforcement learning-based adaptive routing..:

Wang, Shaocong ; Zhang, Xiaoyun ; Wang, Changhong...
Computer Networks. 246 (2024) - p. 110419 , 2024

Link: https://doi.org/10.1016/..

14

LTNoT: Realizing the Trade-Offs Between Latency and Through..:

, In: Algorithms and Architectures for Parallel Processing; Lecture Notes in Computer Science,

Gu, Wenhao ; Xie, Xuchao ; Dong, Dezun - p. 412-432 , 2023

Link: https://doi.org/10.1007/..

15

EagerCC: An ultra-low latency congestion control mechanism ..:

Lu, Yuan ; Yuan, Guoyuan ; Bai, Yang..
Computer Networks. 236 (2023) - p. 110009 , 2023

Link: https://doi.org/10.1016/..