TY - JOUR
T1 - A consistency-free memory architecture for sort-last parallel rendering processors
AU - Park, Woo Chan
AU - Kim, Cheong Ghil
AU - Yoon, Duk Ki
AU - Lee, Kil Whan
AU - Kim, Il San
AU - Han, Tack Don
PY - 2007/5
Y1 - 2007/5
N2 - Current rendering processors are aiming to process triangles as fast as possible and they have the tendency of equipping with multiple rasterizers to be capable of handling a number of triangles in parallel for increasing polygon rendering performance. However, those parallel architectures may have the consistency problem when more than one rasterizer try to access the data at the same address. This paper proposes a consistency-free memory architecture for sort-last parallel rendering processors, in which a consistency-free pixel cache architecture is devised and effectively associated with three different memory systems consisting of a single frame buffer, a memory interface unit, and consistency-test units. Furthermore, the proposed architecture can reduce the latency caused by pixel cache misses because the rasterizer does not wait until cache miss handling is completed when the pixel cache miss occurs. The experimental results show that the proposed architecture can achieve almost linear speedup upto four rasterizers with a single frame buffer.
AB - Current rendering processors are aiming to process triangles as fast as possible and they have the tendency of equipping with multiple rasterizers to be capable of handling a number of triangles in parallel for increasing polygon rendering performance. However, those parallel architectures may have the consistency problem when more than one rasterizer try to access the data at the same address. This paper proposes a consistency-free memory architecture for sort-last parallel rendering processors, in which a consistency-free pixel cache architecture is devised and effectively associated with three different memory systems consisting of a single frame buffer, a memory interface unit, and consistency-test units. Furthermore, the proposed architecture can reduce the latency caused by pixel cache misses because the rasterizer does not wait until cache miss handling is completed when the pixel cache miss occurs. The experimental results show that the proposed architecture can achieve almost linear speedup upto four rasterizers with a single frame buffer.
UR - http://www.scopus.com/inward/record.url?scp=33947316920&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33947316920&partnerID=8YFLogxK
U2 - 10.1016/j.sysarc.2006.10.010
DO - 10.1016/j.sysarc.2006.10.010
M3 - Article
AN - SCOPUS:33947316920
SN - 1383-7621
VL - 53
SP - 272
EP - 284
JO - Euromicro Newsletter
JF - Euromicro Newsletter
IS - 5-6
ER -