The performance of multiprogrammed shared-memorymultiprocessors suffers often from scheduler interventions that neglect data locality. On cache-coherent distributedshared-memory (DSM) multiprocessors, such scheduler...
详细信息
The performance of multiprogrammed shared-memorymultiprocessors suffers often from scheduler interventions that neglect data locality. On cache-coherent distributedshared-memory (DSM) multiprocessors, such scheduler interventions tend to increase the rate of remote memory accesses. This paper presents a novel dynamic page migration algorithm that remedies this problem in iterative parallel programs, The purpose of the algorithm is the early detection of pages that will most likely be accessed remotely by threads associated with them via a thread-to-memory affinity relation. The key mechanism that enables timely identification of these pages is a communication interface between the page migration engine and the operating system scheduler. The algorithm improves previously proposed competitive page migration algorithms in many aspects. including accuracy, timeliness and cost amortization. Most notably, the algorithm is not biased by obsolete memory access history that may be accumulated in the page access counters at runtime. Experiments on the SGI Origin-2000 show that the algorithm outperforms by far the best static page placement algorithm and a customized page migration engine implemented in IRIX. the Origin-2000 operating system. The algorithm is implemented at user-level and its functionality is orthogonal to the scheduling police of the operating system. (C) 2002 Elsevier Science (USA).
暂无评论