As the Internet and World Wide web grow at a fast pace, it is essential that the web's performance should keep up with increased demand and expectations. web caching technology has been widely accepted as one of t...
详细信息
As the Internet and World Wide web grow at a fast pace, it is essential that the web's performance should keep up with increased demand and expectations. web caching technology has been widely accepted as one of the effective approaches to alleviating web traffic and increase the web Quality of Service (QoS). This paper provides an up-to-date survey of the rapidly expanding web caching literature. It discusses the state-of-the-art web caching schemes and techniques, with emphasis on the recent developments in web caching technology such as the differentiated web services, heterogeneous caching network structures, and dynamic content caching.
Research in web caching has yielded analytical tools to model the behavior of large-scale web caches. Recently, Wolman et al. (Proceedings of the 17th ACM Symposium on Operating Systems Principles, December 1999) have...
详细信息
Research in web caching has yielded analytical tools to model the behavior of large-scale web caches. Recently, Wolman et al. (Proceedings of the 17th ACM Symposium on Operating Systems Principles, December 1999) have proposed an analytical model and used it to evaluate the potential of cooperative web proxy caching for large populations. This paper shows how to apply the Wolman model to study the behavior of interior cache servers in multi-level caching systems. Focusing on interior caches gives a different perspective on the model's implications, and it allows three new uses of the model. First, we apply the model to large-scale caching systems in which the interior nodes belong to third-party content distribution services. Second, we explore the effectiveness of content distribution services as conventional web proxy caching becomes more prevalent. Finally, we correlate the model's predictions of interior cache behavior with empirical observations from the root caches of the NLANR cache hierarchy. (C) 2001 Published by Elsevier Science B.V.
We introduce and analyze new randomized policies for the management of web caches. The proposed policies are fully scalable, i.e., handling a hit or an eviction requires only O(1) time. Our analysis is probabilistic, ...
详细信息
We introduce and analyze new randomized policies for the management of web caches. The proposed policies are fully scalable, i.e., handling a hit or an eviction requires only O(1) time. Our analysis is probabilistic, in nature, and based on an extended version of the independent reference model. The extension is needed in order to deal with the varying-cost and -size features of web documents. Under this assumption, we derive closed-form expressions for the stationary probabilities of finding each possible arrangement of the documents within the cache. Our analysis shows that the performance of the proposed algorithms are close to that of the optimal off-line algorithm. Using simulations and real traces, we also show that the new algorithms perform at least as well as existing algorithms of higher complexity. Variations on the algorithms, aimed at increasing their responsiveness to non-stationary trends, are also investigated. (C) 2001 Elsevier Science B.V. All rights reserved.
We introduce intelligent web caching algorithms that employ predictive models of web requests;the general idea is to extend the least recently used (LRU) policy of web and proxy, servers by making it sensitive to web ...
详细信息
We introduce intelligent web caching algorithms that employ predictive models of web requests;the general idea is to extend the least recently used (LRU) policy of web and proxy, servers by making it sensitive to web access models extracted from web log data using data mining techniques. Two approaches have been studied in particular, frequent patterns and decision trees. The experimental results of the new algorithms show substantial improvement over existing LRU-based caching techniques, in terms of hit rate. We designed and developed a prototypical system, which supports data warehousing of web log data, extraction of data mining models and simulation of the web caching algorithms. (C) 2001 Elsevier Science B.V. All rights reserved.
web caching is an important technique for reducing Internet access latency, network traffic, and server load. This paper investigates cache management strategies for the en-route web caching environment, where caches ...
详细信息
web caching is an important technique for reducing Internet access latency, network traffic, and server load. This paper investigates cache management strategies for the en-route web caching environment, where caches are associated with routing nodes in the network. We propose a novel caching scheme that integrates both object placement and replacement policies and which makes caching decisions on all candidate sites in a coordinated fashion. In our scheme, cache status information along the routing path of a request is used in dynamically determining where to cache the requested object and what to replace if there is not enough space, The object placement problem is formulated as an optimization problem and the optimal locations to cache the object are obtained using a low-cost dynamic programming algorithm. Extensive simulation experiments have been performed to evaluate the proposed scheme in terms of a wide range of performance metrics. The results show that the proposed scheme significantly outperforms existing algorithms which consider either object placement or replacement at individual caches only.
An adaptive, highly scalable, and robust web caching system is needed to effectively handle the exponential growth and extreme dynamic environment of the World Wide web. Our work presented last year sketched out the b...
详细信息
An adaptive, highly scalable, and robust web caching system is needed to effectively handle the exponential growth and extreme dynamic environment of the World Wide web. Our work presented last year sketched out the basic design of such a system. This sequel paper reports our progress over the past year. To assist caches making web query forwarding decisions, we sketch out the basic design of a URL routing framework. To assist fast searching within each cache group, we let neighbor caches share content information. Equipped with the URL routing table and neighbor cache contents, a cache in the revised design can now search the local group, and forward all missing queries quickly and efficiently, thus eliminating both the waiting delay and the overhead associated with multicast queries. The paper also presents a proposal for incremental deployment that provides a smooth transition from the currently deployed cache infrastructure to the new design. (C) 1998 Published by Elsevier Science B.V. All rights reserved.
web caching is a technology to improve network traffic on the Internet. It is a temporary storage of web objects for later retrieval. Three significant advantages of web caching include reduction in bandwidth consumpt...
详细信息
web caching is a technology to improve network traffic on the Internet. It is a temporary storage of web objects for later retrieval. Three significant advantages of web caching include reduction in bandwidth consumption, server load, and latency. These advantages make the web to be less expensive yet it provides better performance. This research aims to introduce an advanced machine learning method for a classification problem in web caching that requires a decision to cache or not to cache web objects in a proxy cache server. The challenges in this classification problem include the issues in identifying attributes ranking and improve the classification accuracy significantly. This research includes four methods that are Classification and Regression Trees (CART), Multivariate Adaptive Regression Splines (MARS), Random Forest (RF) and Tree Net (TN) for classification on web caching. The experimental results reveal that CART performed extremely well in classifying web objects from the existing log data with a size of web objects as a significant attribute for web cache performance enhancement.
Peer-to-peer web caching has attracted a great attention from the research community recently, and is one of the potential peer-to-peer applications. In this paper, we systematically examine the three orthogonal dimen...
详细信息
Peer-to-peer web caching has attracted a great attention from the research community recently, and is one of the potential peer-to-peer applications. In this paper, we systematically examine the three orthogonal dimensions to design a peer-to-peer web caching system, including the caching algorithm, the document lookup algorithm, and the peer granularity. In addition to the traditional URL-based caching algorithm, we also evaluate the content-based caching algorithm for both dynamic and static web content. Four different document lookup algorithms are compared and evaluated in the context of four different peer granularities, i.e., host level, organization level, building level, and centralized. A detailed simulation, using the traces collected at a medium size education institution, is conducted for the purpose of performance evaluation. Finally, several implications derived from this analysis are also discussed. (c) 2005 Elsevier Inc. All rights reserved.
We extend the classic paging model by allowing reordering of requests under the constraint that a request is delayed by no longer than a predetermined number of time steps. We first give a dynamic programming algorith...
详细信息
We extend the classic paging model by allowing reordering of requests under the constraint that a request is delayed by no longer than a predetermined number of time steps. We first give a dynamic programming algorithm to solve the offline case. Then we give tight bounds on competitive ratios for the online case. For caches of size k, we obtain bounds of k + O(1) for deterministic algorithms and Theta(log k) for randomized algorithms. We also give bounds for the case where either the online or the offline algorithm can reorder the requests, but not both. Finally, we extend our analysis to the case where pages have different sizes. (C) 2004 Published by Elsevier B.V.
In this paper, the effect of web caching on network planning, in the sense of bandwidth computation for the access link interconnecting the ISP's subnet with the Internet, is studied by means of simulations. The l...
详细信息
In this paper, the effect of web caching on network planning, in the sense of bandwidth computation for the access link interconnecting the ISP's subnet with the Internet, is studied by means of simulations. The latency of a browser retrieving files is studied for given traffic characteristics, number of users, bandwidth of access link, and cache hit rate. From our analysis, we find that using a well-designed web cache with a 50% hit rate is more effective than doubling the bandwidth for an ISP's access link to the Internet, with respect to decreasing retrieval latency. In other words, we can reduce half of the bandwidth of the access Link by using web caching without compromising retrieval latency. The results are very encouraging and useful to the ISPs, enterprises and universities that are planning to interconnect their Intranets with the Internet. (C) 1999 Elsevier Science B.V. All rights reserved.
暂无评论