Jiayuan Meng
Title
Cited by
Cited by
Year
Rodinia: A benchmark suite for heterogeneous computing
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, SH Lee, K Skadron
2009 IEEE international symposium on workload characterization (IISWC), 44-54, 2009
24912009
A performance study of general-purpose applications on graphics processors using CUDA
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, K Skadron
Journal of parallel and distributed computing 68 (10), 1370-1380, 2008
8032008
Dynamic warp subdivision for integrated branch and memory divergence tolerance
J Meng, D Tarjan, K Skadron
Proceedings of the 37th annual international symposium on Computer …, 2010
2732010
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs
J Meng, K Skadron
Proceedings of the 23rd international conference on Supercomputing, 256-265, 2009
1682009
GROPHECY: GPU performance projection from CPU code skeletons
J Meng, VA Morozov, K Kumaran, V Vishwanath, TD Uram
SC'11: Proceedings of 2011 International Conference for High Performance …, 2011
1132011
Best-effort parallel execution framework for recognition and mining applications
J Meng, S Chakradhar, A Raghunathan
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009
952009
Increasing memory miss tolerance for SIMD cores
D Tarjan, J Meng, K Skadron
Proceedings of the Conference on High Performance Computing Networking …, 2009
722009
Improving GPU performance prediction with data transfer modeling
M Boyer, J Meng, K Kumaran
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
612013
Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling
J Meng, K Skadron
2009 IEEE International Conference on Computer Design, 282-288, 2009
602009
A performance study for iterative stencil loops on gpus with ghost zone optimizations
J Meng, K Skadron
International Journal of Parallel Programming 39 (1), 115-142, 2011
522011
Exploiting the forgiving nature of applications for scalable parallel execution
J Mengte, A Raghunathan, S Chakradhar, S Byna
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
402010
Partial molar volume of paracetamol in water, 0.1 M HCl and 0.154 M NaCl at T=(298.15, 303.15, 308.15 and 310.65) K and at 101.325 kPa
MJ Iqbal, QM Malik
The Journal of Chemical Thermodynamics 37 (12), 1347-1350, 2005
382005
A performance study of general purpose applications on graphics processors
S Che, J Meng, JW Sheaffer, K Skadron
First Workshop on General Purpose Processing on Graphics Processing Units, 10, 2007
332007
Exploiting inter-thread temporal locality for chip multithreading
J Meng, JW Sheaffer, K Skadron
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
292010
Best-effort semantic document search on GPUs
S Byna, J Meng, A Raghunathan, S Chakradhar, S Cadambi
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010
272010
Workflow performance improvement using model-based scheduling over multiple clusters and clouds
K Maheshwari, ES Jung, J Meng, V Morozov, V Vishwanath, R Kettimuthu
Future Generation Computer Systems 54, 206-218, 2016
252016
Dataflow-driven GPU performance projection for multi-kernel transformations
J Meng, VA Morozov, V Vishwanath, K Kumaran
SC'12: Proceedings of the International Conference on High Performance …, 2012
222012
SKOPE: A framework for modeling and exploring workload behavior
J Meng, X Wu, V Morozov, V Vishwanath, K Kumaran, V Taylor
Proceedings of the 11th ACM Conference on Computing Frontiers, 1-10, 2014
212014
Robust SIMD: Dynamically adapted SIMD width and multi-threading depth
J Meng, JW Sheaffer, K Skadron
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
212012
Dynamic warp subdivision for integrated branch and memory latency divergence tolerance
K Skadron, J Meng, D Tarjan
US Patent App. 13/040,045, 2011
202011
The system can't perform the operation now. Try again later.
Articles 1–20