Loading...
2019
Latency Hiding based Warp Scheduling Policy for High Performance GPUs
Latency Hiding based Warp Scheduling Policy for High Performance GPUs
한국컴퓨터정보학회
논문정보
- Publisher
- 한국컴퓨터정보학회논문지
- Issue Date
- 2019-04-01
- Keywords
- -
- Citation
- -
- Source
- -
- Journal Title
- -
- Volume
- 24
- Number
- 4
- Start Page
- 1
- End Page
- 9
- DOI
- ISSN
- 1598849X
Abstract
LRR(Loose Round Robin) warp scheduling policy for GPU architecture results in high warp-level parallelism and balanced loads across multiple warps. However, traditional LRR policy makes multiple warps execute long latency operations at the same time. In cases that no more warps to be issued under long latency, the throughput of GPUs may be degraded significantly. In this paper, we propose a new warp scheduling policy which utilizes latency hiding, leading to more utilized memory resources in high performance GPUs. The proposed warp scheduler prioritizes memory instruction based on GTO(Greedy Then Oldest) policy in order to provide reduced memory stalls. When no warps can execute memory instruction any more, the warp scheduler selects a warp for computation instruction by round robin manner. Furthermore, our proposed technique achieves high performance by using additional information about recently committed warps. According to our experimental results, our proposed technique improves GPU performance by 12.7% and 5.6% over LRR and GTO on average, respectively.
- 전남대학교
- KCI
- 한국컴퓨터정보학회논문지
저자 정보
| 이름 | 소속 | ||
|---|---|---|---|
| 등록된 데이터가 없습니다. | |||