Research Hub

대학 자원

대학 인프라와 자원을 공유해 공동 연구와 기술 활용을 지원합니다.

Loading...

논문 리스트

2019
Latency Hiding based Warp Scheduling Policy for High Performance GPUs Latency Hiding based Warp Scheduling Policy for High Performance GPUs
한국컴퓨터정보학회
논문정보
Publisher
한국컴퓨터정보학회논문지
Issue Date
2019-04-01
Keywords
-
Citation
-
Source
-
Journal Title
-
Volume
24
Number
4
Start Page
1
End Page
9
DOI
ISSN
1598849X
Abstract
LRR(Loose Round Robin) warp scheduling policy for GPU architecture results in high warp-level parallelism and balanced loads across multiple warps. However, traditional LRR policy makes multiple warps execute long latency operations at the same time. In cases that no more warps to be issued under long latency, the throughput of GPUs may be degraded significantly. In this paper, we propose a new warp scheduling policy which utilizes latency hiding, leading to more utilized memory resources in high performance GPUs. The proposed warp scheduler prioritizes memory instruction based on GTO(Greedy Then Oldest) policy in order to provide reduced memory stalls. When no warps can execute memory instruction any more, the warp scheduler selects a warp for computation instruction by round robin manner. Furthermore, our proposed technique achieves high performance by using additional information about recently committed warps. According to our experimental results, our proposed technique improves GPU performance by 12.7% and 5.6% over LRR and GTO on average, respectively.

저자 정보

이름 소속
등록된 데이터가 없습니다.