EReinit: Scalable and Efficient Fault Tolerance for Bulk-Synchronous MPI Applications S. Chakraborty, Ignacio Laguna, Murali Emani, Kathryn Mohror, D. Panda, Martin Schulz, H. Subramoni Concurrency and Computation: Practice and Experience, .