A Software Based Approach for Providing Network Fault Tolerance in Clusters Using the uDAPL Interface: MPI Level Design and Performance Evaluation
A. Vishnu, P. Gupta, A. Mamidala, D. Panda
SuperComputing 2006,
Nov 2006.