Stash: Have your scratchpad and cache it too R Komuravelli, MD Sinclair, J Alsop, M Kotsifakou, P Srivastava, SV Adve, ... Proceedings of the 42nd Annual International Symposium on Computer …, 2015 | 89 | 2015 |
Efficient GPU synchronization without scopes: Saying no to complex consistency models MD Sinclair, J Alsop, SV Adve Proceedings of the 48th International Symposium on Microarchitecture, 647-659, 2015 | 74 | 2015 |
Lazy release consistency for GPUs J Alsop, MS Orr, BM Beckmann, DA Wood 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 44 | 2016 |
Spandex: A flexible interface for efficient heterogeneous coherence J Alsop, M Sinclair, S Adve 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 43 | 2018 |
Chasing away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems MD Sinclair, J Alsop, SV Adve Proceedings of the 44th Annual International Symposium on Computer …, 2017 | 39 | 2017 |
HeteroSync: A benchmark suite for fine-grained synchronization on tightly coupled GPUs MD Sinclair, J Alsop, SV Adve 2017 IEEE International Symposium on Workload Characterization (IISWC), 239-249, 2017 | 24 | 2017 |
Inter-kernel reuse-aware thread block scheduling M Huzaifa, J Alsop, A Mahmoud, G Salvador, MD Sinclair, SV Adve ACM Transactions on Architecture and Code Optimization (TACO) 17 (3), 1-27, 2020 | 13 | 2020 |
Specializing coherence, consistency, and push/pull for gpu graph analytics G Salvador, WH Darvin, M Huzaifa, J Alsop, MD Sinclair, SV Adve 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020 | 9 | 2020 |
Optimizing GPU cache policies for MI workloads J Alsop, MD Sinclair, S Bharadwaj, A Dutu, A Gutierrez, O Kayiran, ... 2019 IEEE International Symposium on Workload Characterization (IISWC), 243-248, 2019 | 8 | 2019 |
GSI: A GPU stall inspector to characterize the sources of memory stalls for tightly coupled GPUs J Alsop, MD Sinclair, R Komuravelli, SV Adve 2016 IEEE International Symposium on Performance Analysis of Systems and …, 2016 | 8 | 2016 |
Limited propagation of unnecessary memory updates J Alsop, P Fotouhi, B Beckmann, S Blagodurov US Patent 11,526,449, 2022 | 1 | 2022 |
A case for fine-grain coherence specialization in heterogeneous systems J Alsop, WT Na, MD Sinclair, S Grayson, S Adve ACM Transactions on Architecture and Code Optimization (TACO) 19 (3), 1-26, 2022 | 1 | 2022 |
Dynamically coalescing atomic memory operations for memory-local computing J Alsop, A Dutu, AGA Shaizeen, N Jayasena US Patent App. 17/361,145, 2022 | | 2022 |
Memory request priority assignment techniques for parallel processors S Puthoor, K Punniyamurthy, O Kayiran, X Zhang, Y Eckert, J Alsop, ... US Patent 11,507,522, 2022 | | 2022 |
Detecting execution hazards in offloaded operations J Alsop, AGA Shaizeen US Patent App. 17/536,817, 2022 | | 2022 |
Adaptive memory consistency in disaggregated datacenters S Blagodurov, BK Potter, J Alsop US Patent App. 17/219,505, 2022 | | 2022 |
System and method for coalesced multicast data transfers over memory interfaces J Alsop, N Jayasena, AGA Shaizeen, A McCrabb US Patent App. 17/218,700, 2022 | | 2022 |
Enforcing data placement requirements via address bit swapping J Alsop, AGA Shaizeen US Patent App. 17/218,994, 2022 | | 2022 |
Approach for enforcing ordering between memory-centric and core-centric memory operations AGA Shaizeen, N Jayasena, J Alsop US Patent App. 17/219,446, 2022 | | 2022 |
Data placement with packet metadata S Blagodurov, J Alsop, S Seyedzadehdelcheh US Patent App. 17/124,872, 2022 | | 2022 |