http://cse.osu.edu/~awan.10

This page lists the publications by Ammar Ahmad (Ammar) Awan

Journals (4)

1 Ammar Awan, K. Vadambacheri Manian, C. Chu, H. Subramoni, and DK Panda, Optimized Large-Message Broadcast for Deep Learning Workloads: MPI, MPI+NCCL, or NCCL2? , Volume 85, July 2019, Pages 141-152, https://doi.org/10.1016/j.parco.2019.03.005 , .
2 C. Chu, X. Lu, Ammar Awan, H. Subramoni, Bracy Elton, and DK Panda, Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast , IEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 30, no. 3, pp. 575-588, 1 March 2019 , .
3 K. Hamidouche, A. Venkatesh, Ammar Awan, H. Subramoni, and DK Panda, CUDA-Aware OpenSHMEM: Extensions and Designs for High Performance OpenSHMEM on GPU Clusters , ParCo: Elsevier Parallel Computing Journal , .
4 Ammar Awan, A. Jain, C. Chu, H. Subramoni, and DK Panda, Communication Profiling and Characterization of Deep Learning Workloads on Clusters with High-Performance Interconnects , IEEE Micro Magazine (accepted for publication) , .

Conferences & Workshops (18)

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18