Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Impact of traffic distribution on web cache performance

Impact of traffic distribution on web cache performance Caches are a critical element of web-based information systems. Understanding the expected behaviour of cache policies is especially important for achieving good quality of service. Existing works have suggested that the behaviour of the web demand can be modelled as a Zipf distribution with 1. New evidence, which is presented in this paper, shows that today websites are following Zipf distributions with > 1. This article analyses real logs obtained from the client layer of high traffic websites. The main result of this article is that under these conditions, the cache hit ratio can be extremely high with a very small cache size. This means that a very expensive and high resource demanding cache is not needed for effective implementation: a cache size equal to 0.6% of the working set is enough to reach more than 80% of hit ratio, once the right replacement policy has been chosen. Keywords: web cache; Zipf distribution; web logs; hit rate; web performance. Reference to this paper should be made as follows: Zotano, M.G., G´mez-Sanz, J. and Pav´n, J. (2015) `Impact of traffic distribution on web o o cache performance', Int. J. Web Engineering and Technology, Vol. 10, No. 3, pp.202­213. Biographical http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png International Journal of Web Engineering and Technology Inderscience Publishers

Loading next page...
 
/lp/inderscience-publishers/impact-of-traffic-distribution-on-web-cache-performance-IFNMvaG8jc
Publisher
Inderscience Publishers
Copyright
Copyright © 2015 Inderscience Enterprises Ltd.
ISSN
1476-1289
eISSN
1741-9212
DOI
10.1504/IJWET.2015.072349
Publisher site
See Article on Publisher Site

Abstract

Caches are a critical element of web-based information systems. Understanding the expected behaviour of cache policies is especially important for achieving good quality of service. Existing works have suggested that the behaviour of the web demand can be modelled as a Zipf distribution with 1. New evidence, which is presented in this paper, shows that today websites are following Zipf distributions with > 1. This article analyses real logs obtained from the client layer of high traffic websites. The main result of this article is that under these conditions, the cache hit ratio can be extremely high with a very small cache size. This means that a very expensive and high resource demanding cache is not needed for effective implementation: a cache size equal to 0.6% of the working set is enough to reach more than 80% of hit ratio, once the right replacement policy has been chosen. Keywords: web cache; Zipf distribution; web logs; hit rate; web performance. Reference to this paper should be made as follows: Zotano, M.G., G´mez-Sanz, J. and Pav´n, J. (2015) `Impact of traffic distribution on web o o cache performance', Int. J. Web Engineering and Technology, Vol. 10, No. 3, pp.202­213. Biographical

Journal

International Journal of Web Engineering and TechnologyInderscience Publishers

Published: Jan 1, 2015

There are no references for this article.