Min Si
Title
Cited by
Cited by
Year
MT-MPI: multithreaded MPI for many-core environments
M Si, AJ Peña, P Balaji, M Takagi, Y Ishikawa
Proceedings of the 28th ACM international conference on Supercomputing, 125-134, 2014
502014
MPICH User’s Guide
A Amer, P Balaji, W Bland, W Gropp, R Latham, H Lu, L Oden, AJ Pena, ...
Version, 2015
422015
Casper: An asynchronous progress model for MPI RMA on many-core architectures
M Si, AJ Peña, J Hammond, P Balaji, M Takagi, Y Ishikawa
2015 IEEE International Parallel and Distributed Processing Symposium, 665-676, 2015
362015
Direct MPI library for Intel Xeon Phi co-processors
M Si, Y Ishikawa, M Tatagi
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
242013
Why is MPI so slow? analyzing the fundamental limits in implementing MPI-3.1
K Raffenetti, A Amer, L Oden, C Archer, W Bland, H Fujita, Y Guo, ...
Proceedings of the International Conference for High Performance Computing …, 2017
172017
Design of direct communication facility for many-core based accelerators
M Si, Y Ishikawa
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
172012
Parallel I/O optimizations for scalable deep learning
S Pumma, M Si, W Feng, P Balaji
2017 IEEE 23rd International Conference on Parallel and Distributed Systems …, 2017
112017
Towards scalable deep learning via I/O analysis and optimization
S Pumma, M Si, W Feng, P Balaji
2017 IEEE 19th International Conference on High Performance Computing and …, 2017
82017
Process-in-process: techniques for practical address-space sharing
A Hori, M Si, B Gerofi, M Takagi, J Dayal, P Balaji, Y Ishikawa
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
52018
Scaling NWChem with efficient and portable asynchronous communication in MPI RMA
M Si, AJ Pena, J Hammond, P Balaji, Y Ishikawa
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2015
52015
An MPI Library Implementing Direct Communication for Many-Core based Accelerators
M Si, Y Ishikawa
2012 SC Companion: High Performance Computing, Networking, Storage and …, 2012
52012
Dynamic adaptable asynchronous progress model for mpi rma multiphase applications
M Si, AJ Pena, J Hammond, P Balaji, M Takagi, Y Ishikawa
IEEE Transactions on Parallel and Distributed Systems 29 (9), 1975-1989, 2018
32018
Scalable Deep Learning via I/O Analysis and Optimization
S Pumma, M Si, WC Feng, P Balaji
ACM Transactions on Parallel Computing (TOPC) 6 (2), 1-34, 2019
22019
Software combining to mitigate multithreaded MPI contention
A Amer, C Archer, M Blocksome, C Cao, M Chuvelev, H Fujita, ...
Proceedings of the ACM International Conference on Supercomputing, 367-379, 2019
22019
Process-based asynchronous progress model for MPI point-to-point communication
M Si, P Balaji
2017 IEEE 19th International Conference on High Performance Computing and …, 2017
22017
Workshop 8: AsHES Accelerators and Hybrid Exascale Systems
M Si, L Oden, SG de Gonzalo
2020 IEEE International Parallel and Distributed Processing Symposium …, 2020
2020
Guest editorial: Special Issue on Applications and System Software for Hybrid Exascale Systems
AJ Pena, M Si
Parallel Computing 91, 102583, 2020
2020
Parallel programming models and systems software for high-end computing (P2S2 2018)
M Si, A Vishnu, Y Chen
Parallel Computing 89, 102549, 2019
2019
International workshop on programming models and applications for multicores and manycores (PMAM 2018)
M Si, Z Huang, P Balaji
Parallel Computing 88, 102541, 2019
2019
Breaking performance portability bottlenecks in NWChem
J Hammond, M Si, J Dinan, P Balaji, K Kowalski, E Apra, M Klemm
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY 257, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–20