Efficient Inter-node MPI Communication using GPUDirect RDMA for InfiniBand Clusters with NVIDIA GPUs
S. Potluri, K. Hamidouche, A. Venkatesh, D. Bureddy, D. Panda
International Conference on Parallel Processing (ICPP '13),
Oct 2013.