Yunquan Zhang
Cited by
Cited by
AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs
Q Wang, X Zhang, Y Zhang, Q Yi
SC'13: Proceedings of the International Conference on High Performance …, 2013
Model-driven level 3 BLAS performance optimization on Loongson 3A processor
Z Xianyi, W Qian, Z Yunquan
2012 IEEE 18th international conference on parallel and distributed systems …, 2012
yaSpMV: Yet another SpMV framework on GPUs
S Yan, C Li, Y Zhang, H Zhou
Acm Sigplan Notices 49 (8), 107-118, 2014
StreamScan: Fast scan algorithms for GPUs without global barrier synchronization
S Yan, G Long, Y Zhang
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of …, 2013
Parallel processing systems for big data: a survey
Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos
Proceedings of the IEEE 104 (11), 2114-2136, 2016
Models of parallel computation: a survey and classification
Y Zhang, G Chen, G Sun, Q Miao
Frontiers of Computer Science in China 1 (2), 156-165, 2007
MPFFT: An auto-tuning FFT library for OpenCL GPUs
Y Li, YQ Zhang, YQ Liu, GP Long, HP Jia
Journal of Computer Science and Technology 28 (1), 90-105, 2013
GPURoofline: a model for guiding performance optimizations on GPUs
H Jia, Y Zhang, G Long, J Xu, S Yan, Y Li
European Conference on Parallel Processing, 920-932, 2012
Accelerating viola-jones facce detection algorithm on gpus
H Jia, Y Zhang, W Wang, J Xu
2012 IEEE 14th International Conference on High Performance Computing and …, 2012
A parallel shortest path algorithm based on graph-partitioning and iterative correcting
Y Tang, Y Zhang, H Chen
2008 10th IEEE International Conference on High Performance Computing and …, 2008
Study on parallel computing
GL Chen, GZ Sun, YQ Zhang, ZY Mo
Journal of Computer Science and Technology 21 (5), 665-673, 2006
A novel tripeptide, tyroserleutide, inhibits irradiation-induced invasiveness and metastasis of hepatocellular carcinoma in nude mice
JB Jia, WQ Wang, HC Sun, L Liu, XD Zhu, LQ Kong, ZT Chai, W Zhang, ...
Investigational new drugs 29 (5), 861-872, 2011
Performance evaluation of allgather algorithms on terascale linux cluster with fast ethernet
J Chen, L Zhang, Y Zhang, W Yuan
Eighth International Conference on High-Performance Computing in Asia …, 2005
Performance evaluation of multithreaded sparse matrix-vector multiplication using openmp
S Liu, Y Zhang, X Sun, RR Qiu
2009 11th IEEE International Conference on High Performance Computing and …, 2009
DRAM (h): a parallel computation model for high per-formance numerical computing
Z Yun-Quan
Chinese Journal of Computers 26 (12), 1660-1670, 2003
Optimizing SpMV for diagonal sparse matrices on GPU
X Sun, Y Zhang, T Wang, X Zhang, L Yuan, L Rao
2011 international conference on parallel processing, 492-501, 2011
Parallelization and performance optimization on face detection algorithm with OpenCL: A case study
W Wang, Y Zhang, S Yan, Y Zhang, H Jia
Tsinghua Science and Technology 17 (3), 287-295, 2012
pVOCL: Power-aware dynamic placement and migration in virtualized GPU environments
P Lama, Y Li, AM Aji, P Balaji, J Dinan, S Xiao, Y Zhang, W Feng, ...
2013 IEEE 33rd International Conference on Distributed Computing Systems …, 2013
Cache-oblivious MPI all-to-all communications based on Morton order
S Li, Y Zhang, T Hoefler
IEEE Transactions on Parallel and Distributed Systems 29 (3), 542-555, 2017
LogGPH: A parallel computational model with hierarchical communication awareness
L Yuan, Y Zhang, Y Tang, L Rao, X Sun
2010 13th IEEE International Conference on Computational Science and …, 2010
The system can't perform the operation now. Try again later.
Articles 1–20