Exploiting Maximal Overlap for Non-Contiguous Data Movement Processing on Modern GPU-enabled System C. Chu, K. Hamidouche, A. Venkatesh, D. Banerjee, H. Subramoni, D. Panda The 30th IEEE International Parallel & Distributed Processing Symposium (IPDPS '16), May 2016.