1205 W. Clark Street, Urbana, IL 61801
Office 4017 NCSA
Welcome to my website !!
I am currently a PhD student in the Computer Science Department of the University of Illinois at Urbana-Champaign.
On this website, you will find information about projects I am involved in, my publications and open source tools I have developed. Feel free to check out my work and if you have any questions, comments, or would like to work together, don't hesitate to contact me.
My research interests are parallel and distributed computing on HPC systems with a special focus on resilience/fault tolerance. My PhD research aims to discover to what extent online failure prediction is a possibility at petascale/exascale and what are the challenges in achieving an effective fault prevention mechanism for current and future HPC systems.
My quest gave me the opportunity to play with (and somethimes crash) many large systems during my RAs and internships. Here is a summary:
Improving the computing efficiency of HPC systems using a combination of proactive and preventive checkpointing - Mohamed Slim Bouguerra, Ana Gainaru, Franck Cappello, Leonardo Bautista Gomez, Naoya Maruyama, Satoshi Matsuoka - IPDPS 2013 (acceptance rate of 21%), Boston, USA
Fault prediction under the microscope: A closer look into HPC systems - Ana Gainaru, Franck Cappello, Marc Snir, William Kramer - Supercomputing 2012 (acceptance rate of 21%), Salt Lake City, USA
Taming of the Shrew: Modeling the Normal and Faulty Behavior of Large-scale HPC Systems - Ana Gainaru, Franck Cappello, William Kramer - IPDPS 2012 (acceptance rate of 21%), Shanghai, China
Challenges in predicting failures on the Blue Waters system - Talk given at the ninth workshop of the Joint Laboratory for Petascale Computing, June 12-14, 2013, Lyon
Coupling failure prediction, proactive and preventive checkpoint for current production HPC systems. - Talk given at the eighth workshop of the Joint Laboratory for Petascale Computing, November 19-21, 2012, Argonne National Laboratory
Fault prediction under the microscope: A closer look into HPC systems - A few online journals mention my paper from SC2012: