HAND: A Hybrid Approach to Accelerate Non-contiguous Data Movement using MPI Datatypes on GPU Clusters
R. Shi, X. Lu, S. Potluri, K. Hamidouche, J. Zhang, D. Panda
International Conference on Parallel Processing (ICPP’14),
Sep 2014.