Dong, Dezun
125  results:
Search for persons X
?
1

Optimizing Multi-Grid Preconditioned Conjugate Gradient Met..:

Yuan, Fan ; Yang, Xiaojian ; Li, Shengguo...
IEEE Transactions on Parallel and Distributed Systems.  35 (2024)  5 - p. 768-779 , 2024
 
?
2

Optimizing General Matrix Multiplications on Modern Multi-c..:

, In: 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS),
Yu, Kainan ; Qi, Xinxin ; Zhang, Peng... - p. 964-975 , 2024
 
?
3

Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Mu..:

Yang, Weiling ; Fang, Jianbin ; Dong, Dezun..
IEEE Transactions on Parallel and Distributed Systems.  35 (2024)  3 - p. 439-454 , 2024
 
?
4

A survey of machine learning for Network-on-Chips:

Zhang, Xiaoyun ; Dong, Dezun ; Li, Cunlu..
Journal of Parallel and Distributed Computing.  186 (2024)  - p. 104778 , 2024
 
?
5

GraphCube: Interconnection Hierarchy-aware Graph Processing:

, In: Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming,
Gan, Xinbiao ; Wu, Guang ; Qiu, Shenghao... - p. 160-174 , 2024
 
?
6

COER: A Network Interface Offloading Architecture for RDMA ..:

Wu, Ke ; Dong, Dezun ; Xu, Weixia
ACM Transactions on Architecture and Code Optimization.  , 2024
 
?
7

Large Language Models are Few-Shot Summarizers: Multi-Inten..:

, In: 2024 IEEE/ACM 46th International Conference on Software Engineering (ICSE),
Geng, Mingyang ; Wang, Shangwen ; Dong, Dezun... - p. 453-465 , 2024
 
?
8

ACU: Aggregator-based Congestion control and link Utilizati..:

, In: Proceedings of the 8th Asia-Pacific Workshop on Networking,
Yuan, Zhu ; Yuan, Guoyuan ; Dong, Dezun - p. 194-195 , 2024
 
?
9

Enhancing Gradient Compression for Distributed Deep Learnin:

, In: Proceedings of the 8th Asia-Pacific Workshop on Networking,
Bai, Zhe ; Yu, Enda ; Dong, Dezun. - p. 171-172 , 2024
 
?
10

Efficiently Running SpMV on Multi-core DSPs for Banded Matr..:

, In: Algorithms and Architectures for Parallel Processing; Lecture Notes in Computer Science,
Bi, Deshun ; Li, Shengguo ; Zhang, Yichen.. - p. 201-220 , 2024
 
?
11

Optimizing Attention by Exploiting Data Reuse on ARM Multi-..:

, In: Proceedings of the 38th ACM International Conference on Supercomputing,
Fu, Xiao ; Yang, Weiling ; Dong, Dezun. - p. 137-149 , 2024
 
?
12

Large Language Models are Few-Shot Summarizers: Multi-Inten..:

, In: Proceedings of the IEEE/ACM 46th International Conference on Software Engineering,
Geng, Mingyang ; Wang, Shangwen ; Dong, Dezun... - p. 1-13 , 2024
 
?
14

LTNoT: Realizing the Trade-Offs Between Latency and Through..:

, In: Algorithms and Architectures for Parallel Processing; Lecture Notes in Computer Science,
Gu, Wenhao ; Xie, Xuchao ; Dong, Dezun - p. 412-432 , 2023
 
1-15