Gregory Diamos
Gregory Diamos
Landing AI
Verified email at landing.ai
Title
Cited by
Cited by
Year
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
17502016
Deep speech: Scaling up end-to-end speech recognition
A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ...
arXiv preprint arXiv:1412.5567, 2014
12302014
Mixed precision training
P Micikevicius, S Narang, J Alben, G Diamos, E Elsen, D Garcia, ...
arXiv preprint arXiv:1710.03740, 2017
3452017
Deep voice: Real-time neural text-to-speech
SO Arik, M Chrzanowski, A Coates, G Diamos, A Gibiansky, Y Kang, X Li, ...
arXiv preprint arXiv:1702.07825, 2017
3382017
Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems
G Diamos, A Kerr, S Yalamanchili, N Clark
2010 19th International Conference on Parallel Architectures and Compilation …, 2010
2872010
Deep voice 2: Multi-speaker neural text-to-speech
A Gibiansky, S Arik, G Diamos, J Miller, K Peng, W Ping, J Raiman, ...
Advances in neural information processing systems, 2962-2970, 2017
2582017
Harmony: an execution model and runtime for heterogeneous many core systems
GF Diamos, S Yalamanchili
Proceedings of the 17th international symposium on High performance …, 2008
2272008
Exploring sparsity in recurrent neural networks
S Narang, E Elsen, G Diamos, S Sengupta
arXiv preprint arXiv:1704.05119, 2017
1552017
A characterization and analysis of ptx kernels
A Kerr, G Diamos, S Yalamanchili
2009 IEEE international symposium on workload characterization (IISWC), 3-12, 2009
1492009
Modeling GPU-CPU workloads and systems
A Kerr, G Diamos, S Yalamanchili
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010
1272010
Simultaneous branch and warp interweaving for sustained GPU performance
N Brunie, S Collange, G Diamos
2012 39th Annual International Symposium on Computer Architecture (ISCA), 49-60, 2012
1112012
Deep learning scaling is predictable, empirically
J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ...
arXiv preprint arXiv:1712.00409, 2017
1102017
Kernel weaver: Automatically fusing database primitives for efficient gpu computation
H Wu, G Diamos, S Cadambi, S Yalamanchili
2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 107-118, 2012
1012012
SIMD re-convergence at thread frontiers
G Diamos, B Ashbaugh, S Maiyuran, A Kerr, H Wu, S Yalamanchili
2011 44th Annual IEEE/ACM International Symposium on Microarchitecture …, 2011
902011
Red fox: An execution environment for relational query processing on gpus
H Wu, G Diamos, T Sheard, M Aref, S Baxter, M Garland, S Yalamanchili
Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014
742014
A framework for dynamically instrumenting GPU compute applications within GPU Ocelot
N Farooqui, A Kerr, G Diamos, S Yalamanchili, K Schwan
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics …, 2011
682011
Persistent rnns: Stashing recurrent weights on-chip
G Diamos, S Sengupta, B Catanzaro, M Chrzanowski, A Coates, E Elsen, ...
International Conference on Machine Learning, 2024-2033, 2016
632016
Optimizing data warehousing applications for gpus using kernel fusion/fission
H Wu, G Diamos, J Wang, S Cadambi, S Yalamanchili, S Chakradhar
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
612012
Block-sparse recurrent neural networks
S Narang, E Undersander, G Diamos
arXiv preprint arXiv:1711.02782, 2017
582017
Translating GPU binaries to tiered SIMD architectures with Ocelot
G Diamos, A Kerr, M Kesavan
Georgia Institute of Technology, 2009
542009
The system can't perform the operation now. Try again later.
Articles 1–20