Abstract
This paper presents an address translation scheme in GPUs named SnakeByte that can dynamically manage variable-sized pages and maximize TLB reach by recursively merging contiguous pages. Memory virtualization has become an integral part of GPUs to enhance programmability and memory management efficiency. However, conventional memory virtualization methods using multi-level page tables and caching them in TLBs are insufficient to provide GPUs with enough address translation coverage for the massive volume of data. SnakeByte implements a hardware-based address translation mechanism that recursively merges contiguous pages into larger page groups and effectively extends TLB coverage. SnakeByte allows multiple equal-sized pages coalescing into a page table entry (PTE). It records the validity of pages to be merged using a bit vector, and few bits are annexed to indicate the size of merged pages. If all pages covered by the PTE are allocated with contiguity, the PTE is promoted to be further coalesced into a larger page group. The recursive coalescence of contiguous pages enables SnakeByte to handle variable-sized page groups with the exponentially increasing TLB reach. Associated with a contiguity-aware memory allocator, SnakeByte can consolidate vastly contiguous address spaces into a few TLB entries. Consequently, it significantly reduces TLB misses for large working sets in GPUs and achieves substantial performance improvements. Experiment results show that SnakeByte decreases the number of page table walks by 6.5x and enhances the GPU performance by 2.0x on average over the conventional paging scheme.
Original language | English |
---|---|
Title of host publication | 2023 IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023 - Proceedings |
Publisher | IEEE Computer Society |
Pages | 1195-1207 |
Number of pages | 13 |
ISBN (Electronic) | 9781665476522 |
DOIs | |
Publication status | Published - 2023 |
Event | 29th IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023 - Montreal, Canada Duration: 2023 Feb 25 → 2023 Mar 1 |
Publication series
Name | Proceedings - International Symposium on High-Performance Computer Architecture |
---|---|
Volume | 2023-February |
ISSN (Print) | 1530-0897 |
Conference
Conference | 29th IEEE International Symposium on High-Performance Computer Architecture, HPCA 2023 |
---|---|
Country/Territory | Canada |
City | Montreal |
Period | 23/2/25 → 23/3/1 |
Bibliographical note
Funding Information:ACKNOWLEDGEMENT This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2021R1A2B5B01002932), by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2022R1C1C1011021), and by the Samsung Electronics Company, Ltd., Hwaseong, Korea. Won Woo Ro and William J. Song are the co-corresponding authors.
Publisher Copyright:
© 2023 IEEE.
All Science Journal Classification (ASJC) codes
- Hardware and Architecture