Baghsorkhi, Sara S.
17  results:
Search for persons X
?
1

C3-Flow : Compute Compression Co-Design Flow for Deep Ne..:

, In: Proceedings of the 56th Annual Design Automation Conference 2019,
 
?
2

Automating efficient variable-grained resiliency for low-po..:

, In: Proceedings of the 2018 International Symposium on Code Generation and Optimization,
 
?
4

FlexVec: auto-vectorization for irregular loops:

, In: Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation,
 
?
6

Efficient performance evaluation of memory hierarchy for hi..:

, In: Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming,
 
?
8

Auto-tuning of fast fourier transform on graphics processor:

, In: Proceedings of the 16th ACM symposium on Principles and practice of parallel programming,
 
?
9

An adaptive performance modeling tool for GPU architectures:

, In: Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
 
?
11

Program optimization space pruning for a multithreaded gpu:

, In: Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization,
 
?
12

Optimization principles and application performance evaluat..:

, In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming,
 
?
13

Program optimization carving for GPU computing:

Ryoo, Shane ; Rodrigues, Christopher I. ; Stone, Sam S....
Journal of Parallel and Distributed Computing.  68 (2008)  10 - p. 1389-1401 , 2008
 
?
14

Implicitly parallel programming models for thousand-core mi..:

, In: Proceedings of the 44th annual Design Automation Conference,
Hwu, Wen-mei ; Ryoo, Shane ; Ueng, Sain-Zee... - p. 754-759 , 2007
 
1-15