Efficient Multi-Path NVLink/PCIe-Aware UCX based Collective Communication for Deep Learning YH Temuçin, AH Sojoodi, P Alizadeh, A Afsahi | 9 | 2021 |
Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives YH Temuçin, AH Sojoodi, P Alizadeh, B Kitor, A Afsahi IEEE Micro 42 (2), 68-76, 2022 | 8 | 2022 |
Efficient Process Arrival Pattern Aware Collective Communication for Deep Learning P Alizadeh, A Sojoodi, Y Hassan Temucin, A Afsahi Proceedings of the 29th European MPI Users' Group Meeting, 68-78, 2022 | 5 | 2022 |
Micro-Benchmarking MPI Partitioned Point-to-Point Communication Y Hassan Temucin, RE Grant, A Afsahi Proceedings of the 51st International Conference on Parallel Processing, 1-12, 2022 | 5 | 2022 |
A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs YH Temuçin, S Levy, W Schonbein, RE Grant, A Afsahi 2023 IEEE International Conference on Cluster Computing (CLUSTER), 259-270, 2023 | 1 | 2023 |
Enhancing Intra-Node GPU-to-GPU Performance in MPI+ UCX through Multi-Path Communication A Sojoodi, YH Temucin, A Afsahi Proceedings of the 3rd International Workshop on Extreme Heterogeneity …, 2024 | | 2024 |
High-Performance Interconnect-Aware MPI communication for Deep Learning Workloads YH Temucin Queen's University (Canada), 2021 | | 2021 |
ROCm-Aware Leader-based Designs for MPI Neighbourhood Collectives YH Temuçin, M Gazimirsaeed, RE Grant, A Afsahi | | |