Follow
Soila Kavulya
Title
Cited by
Cited by
Year
An analysis of traces from a production mapreduce cluster
S Kavulya, J Tan, R Gandhi, P Narasimhan
Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster …, 2010
4452010
Causes of failure in web applications
S Pertet, P Narasimhan
Parallel Data Laboratory, Carnegie Mellon University, CMU-PDL-05-109, 2005
2592005
SALSA: analyzing logs as state machines
J Tan, X Pan, S Kavulya, R Gandhi, P Narasimhan
Proceedings of the First USENIX conference on Analysis of system logs, 6-6, 2008
1652008
Ganesha: blackBox diagnosis of MapReduce systems
X Pan, J Tan, S Kavulya, R Gandhi, P Narasimhan
ACM SIGMETRICS Performance Evaluation Review 37 (3), 8-13, 2010
1262010
Mochi: visual log-analysis based tools for debugging hadoop
J Tan, X Pan, S Kavulya, R Gandhi, P Narasimhan
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud), San Diego, CA 6 (3), 2009
1032009
Kahuna: Problem diagnosis for mapreduce-based cloud computing environments
J Tan, X Pan, E Marinelli, S Kavulya, R Gandhi, P Narasimhan
2010 IEEE Network Operations and Management Symposium-NOMS 2010, 112-119, 2010
942010
MEAD: support for Real‐Time Fault‐Tolerant CORBA
P Narasimhan, TA Dumitraş, AM Paulos, SM Pertet, CF Reverte, ...
Concurrency and Computation: Practice and Experience 17 (12), 1527-1545, 2005
942005
Tiresias: Black-box failure prediction in distributed systems
AW Williams, SM Pertet, P Narasimhan
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007
892007
Draco: Statistical diagnosis of chronic problems in large distributed systems
SP Kavulya, S Daniels, K Joshi, M Hiltunen, R Gandhi, P Narasimhan
IEEE/IFIP International Conference on Dependable Systems and Networks (DSN …, 2012
802012
Visual, log-based causal tracing for performance debugging of mapreduce systems
J Tan, S Kavulya, R Gandhi, P Narasimhan
2010 IEEE 30th International Conference on Distributed Computing Systems …, 2010
792010
Optimizing skewed joins in big data
SP Kavulya, MR Alton, A Shahbazi, T Lisonbee
US Patent App. 14/757,748, 2017
702017
Diagnosis in automotive systems: A survey
PE Lanigan, S Kavulya, P Narasimhan, TE Fuhrman, MA Salman
Tech. Rep. CMU-PDL-11-110, Carnegie Mellon University Parallel Data Lab, 2011
532011
Performance Troubleshooting in Data Centers: An Annotated Bibliography
C Wang, SP Kavulya, J Tan, L Hu, M Kutare, M Kasick, K Schwan, ...
ACM SIGOPS Operating Systems Review 47 (3), 50-62, 2013
512013
Failure Diagnosis of Complex Systems
SP Kavulya, K Joshi, F Di Giandomenico, P Narasimhan
Resilience Assessment and Evaluation of Computing Systems, 239-261, 2012
482012
Proactive recovery in distributed corba applications
S Pertet, P Narasimhan
International Conference on Dependable Systems and Networks, 2004, 357-366, 2004
432004
Theia: visual signatures for problem diagnosis in large hadoop clusters
E Garduno, SP Kavulya, J Tan, R Gandhi, P Narasimhan
Proceedings of the 26th international conference on Large Installation …, 2012
352012
Fingerpointing correlated failures in replicated systems
S Pertet, R Gandhi, P Narasimhan
Proceedings of the 2nd USENIX workshop on Tackling computer systems problems …, 2007
282007
Lightweight Black-box Failure Detection for Distributed Systems
J Tan, S Kavulya, R Gandhi, P Narasimhan
Proceedings of the 2012 workshop on Management of big data systems, 13-18, 2012
262012
Experiences with fault-injection in a Byzantine fault-tolerant protocol
R Martins, R Gandhi, P Narasimhan, S Pertet, A Casimiro, D Kreutz, ...
ACM/IFIP/USENIX International Conference on Distributed Systems Platforms …, 2013
252013
Fault tolerant control system
SM Naik, PK Mishra, SM Pertet
US Patent 8,099,179, 2012
232012
The system can't perform the operation now. Try again later.
Articles 1–20