Joshua Hursey
Titre
Citée par
Citée par
Année
The design and implementation of checkpoint/restart process fault tolerance for Open MPI
J Hursey, JM Squyres, TI Mattox, A Lumsdaine
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007
2242007
Why it’s worth the hassle: The value of in-situ studies when designing ubicomp
Y Rogers, K Connelly, L Tedesco, W Hazlewood, A Kurtz, RE Hall, ...
International Conference on Ubiquitous Computing, 336-353, 2007
2172007
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
European MPI Users' Group Meeting, 193-203, 2012
1042012
Interconnect agnostic checkpoint/restart in Open MPI
J Hursey, TI Mattox, A Lumsdaine
Proceedings of the 18th ACM international symposium on High performance …, 2009
782009
Run-through stabilization: An MPI proposal for process fault tolerance
J Hursey, RL Graham, G Bronevetsky, D Buntinas, H Pritchard, DG Solt
European MPI Users' Group Meeting, 329-332, 2011
582011
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
Computing 95 (12), 1171-1184, 2013
452013
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI
J Hursey, T Naughton, G Vallee, RL Graham
European MPI Users' Group Meeting, 255-263, 2011
412011
Coordinated checkpoint/restart process fault tolerance for mpi applications on hpc systems
JJ Hursey
[Bloomington, Ind.]: Indiana University, 2010
352010
Locality-aware parallel process mapping for multi-core HPC systems
J Hursey, JM Squyres, T Dontje
2011 IEEE International Conference on Cluster Computing, 527-531, 2011
312011
PMIx: Process management for exascale environments
RH Castain, J Hursey, A Bouteiller, D Solt
Parallel Computing 79, 9-29, 2018
292018
A checkpoint and restart service specification for Open MPI
J Hursey, JM Squyres, A Lumsdaine
Indiana University, Computer Science Department, Technical Report, 2006
282006
Building a fault tolerant MPI application: A ring communication example
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
222011
Netloc: Towards a comprehensive view of the HPC system topology
B Goglin, J Hursey, JM Squyres
2014 43rd International Conference on Parallel Processing Workshops, 216-225, 2014
202014
A composable runtime recovery policy framework supporting resilient HPC applications
J Hursey, A Lumsdaine
Indiana University, Bloomington, Indiana, USA, Tech. Rep. TR686, 2010
172010
Preserving collective performance across process failure for a fault tolerant MPI
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
162011
Checkpoint/restart-enabled parallel debugging
J Hursey, C January, M O’Connor, PH Hargrove, D Lecomber, ...
European MPI Users' Group Meeting, 219-228, 2010
152010
Advancing application process affinity experimentation: open MPI's LAMA-based affinity interface
J Hursey, JM Squyres
Proceedings of the 20th European MPI Users' Group Meeting, 163-168, 2013
112013
A Paradigm for Parallel Matrix Algorithms
DS Wise, C Citro, J Hursey, F Liu, M Rainey
European Conference on Parallel Processing, 687-698, 2005
102005
Representing unit test data for large scale software development
JA Cottam, J Hursey, A Lumsdaine
Proceedings of the 4th ACM symposium on Software visualization, 57-66, 2008
92008
An extensible framework for distributed testing of MPI implementations
J Hursey, E Mallove, JM Squyres, A Lumsdaine
European Parallel Virtual Machine/Message Passing Interface Users’ Group …, 2007
82007
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20