Suivre
Sara Baghsorkhi
Titre
Citée par
Citée par
Année
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
S Ryoo, CI Rodrigues, SS Baghsorkhi, SS Stone, DB Kirk, WW Hwu
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008
12742008
An adaptive performance modeling tool for GPU architectures
SS Baghsorkhi, M Delahaye, SJ Patel, WD Gropp, WW Hwu
Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of …, 2010
4032010
Program optimization space pruning for a multithreaded GPU
S Ryoo, CI Rodrigues, SS Stone, SS Baghsorkhi, SZ Ueng, JA Stratton, ...
Proceedings of the 6th annual IEEE/ACM international symposium on Code …, 2008
3702008
CUDA-lite: Reducing GPU programming complexity
SZ Ueng, M Lathara, SS Baghsorkhi, WMW Hwu
International Workshop on Languages and Compilers for Parallel Computing, 1-15, 2008
3182008
Program optimization carving for GPU computing
S Ryoo, CI Rodrigues, SS Stone, JA Stratton, SZ Ueng, SS Baghsorkhi, ...
Journal of Parallel and Distributed Computing 68 (10), 1389-1401, 2008
1622008
Auto-tuning of fast fourier transform on graphics processors
Y Dotsenko, SS Baghsorkhi, B Lloyd, NK Govindaraju
ACM SIGPLAN Notices 46 (8), 257-266, 2011
902011
Implicitly parallel programming models for thousand-core microprocessors
W Hwu, S Ryoo, SZ Ueng, JH Kelm, I Gelado, SS Stone, RE Kidd, ...
Proceedings of the 44th annual Design Automation Conference, 754-759, 2007
882007
Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors
SS Baghsorkhi, I Gelado, M Delahaye, WW Hwu
ACM SIGPLAN Notices 47 (8), 23-34, 2012
662012
Performance analysis and tuning for general purpose graphics processing units (GPGPU)
H Kim, R Vuduc, S Baghsorkhi, J Choi, W Hwu
Synthesis Lectures on Computer Architecture 7 (2), 1-96, 2012
592012
Programmable coarse grained and sparse matrix compute hardware with advanced scheduling
E Nurvitadhi, B Vembu, NCG Von Borries, R Barik, TH Lin, K Sinha, ...
US Patent 10,186,011, 2019
542019
Program optimization study on a 128-core GPU
S Ryoo, C Rodrigues, S Stone, S Baghsorkhi, SZ Ueng, WW Hwu
The First Workshop on General Purpose Processing on Graphics Processing Units 23, 2007
482007
Compute optimizations for neural networks
K Nealis, A Yao, X Chen, E Ould-Ahmed-Vall, SS Baghsorkhi, ...
US Patent 10,410,098, 2019
302019
FlexVec: Auto-vectorization for irregular loops
SS Baghsorkhi, N Vasudevan, Y Wu
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language …, 2016
272016
Lightweight restricted transactional memory for speculative compiler optimization
C Wang, Y Wu, SS Baghsorkhi, A Hartono, R Valentine
US Patent 10,324,768, 2019
242019
Automating efficient variable-grained resiliency for low-power IoT systems
SS Baghsorkhi, C Margiolas
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
222018
Methods and systems to vectorize scalar computer program loops having loop-carried dependences
J Bharadwaj, N Vasudevan, A Hartono, SS Baghsorkhi
US Patent 9,268,541, 2016
222016
Specialized fixed function hardware for efficient convolution
R Barik, E Ould-Ahmed-Vall, X Chen, D Srivastava, A Yao, K Nealis, ...
US Patent 10,824,938, 2020
212020
Coordination and increased utilization of graphics processors during inference
AR Appu, A Koker, JC Weast, MB MacPherson, LL Hurd, SS Baghsorkhi, ...
US Patent 10,304,154, 2019
212019
Methods and apparatus to detect anomalies of a monitored system
M Agerstam, B Sadeghi, J Martin, J Ota, J Gottschlich, M Carranza, ...
US Patent 10,802,942, 2020
172020
Loop vectorization methods and apparatus
N Vasudevan, J Bharadwaj, CJ Hughes, MB Girkar, MJ Charney, ...
US Patent 9,244,677, 2016
172016
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20