Design and Implementation of Multi-Rail-Aware Hierarchical MPI Reduce-Scatter and Allgather Operations C. Chen, J. Yao, D. Panda ISC HIGH PERFORMANCE 2026, Jun 2026.