Personensuche
X
?
Proceedings of the 38th ACM International Conference on Supercomputing ,
3
Exploiting Vector Code Semantics for Efficient Data Cache P..:
, In:
?
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming ,
5
Efficient Direct Convolution Using Long SIMD Instructions:
, In:
?
Machine Learning and Knowledge Discovery in Databases; Lecture Notes in Computer Science ,
6
FASE: A Fast, Accurate and Seamless Emulator for Custom Num..:
, In:
?
Lecture Notes in Computer Science; Embedded Computer Systems: Architectures, Modeling, and Simulation ,
7
Characterization of a Coherent Hardware Accelerator Framewo..:
, In:
?
Proceedings of the 50th Annual International Symposium on Computer Architecture ,
8
DynAMO: Improving Parallelism Through Dynamic Placement of ..:
, In:
?
2023 Design, Automation & Test in Europe Conference & Exhibition (DATE) ,
9
Fast Behavioural RTL Simulation of 10B Transistor SoC Desig..:
, In:
?
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture ,
10
A Tensor Marshaling Unit for Sparse Tensor Algebra on Gener..:
, In:
?
2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) ,
12
FASE: A Fast, Accurate and Seamless Emulator for Custom Num..:
, In:
?
2022 IEEE 29th Symposium on Computer Arithmetic (ARITH) ,
14
A BF16 FMA is All You Need for DNN Training:
, In:
?
Proceedings of the 35th ACM International Conference on Supercomputing ,
15