To understand website complexity deeply, a web page complexity measurement system is developed. The system measures the complexity of a webpage at two levels: transport-level and content-level, using a packet trace-...
详细信息
To understand website complexity deeply, a web page complexity measurement system is developed. The system measures the complexity of a webpage at two levels: transport-level and content-level, using a packet trace-based approach rather than server or client logs. Packet traces surpass others in the amount of information contained. Quantitative analyses show that different categories of webpages have different complexity characteristics. Experimental results show that a news webpage usually loads much more elements at more accessing levels from much more web servers within diverse administrative domains over much more concurrent transmission control protocol (TCP) flows. About more than half of education pages each only involve a few logical servers, where most of elements of a webpage are fetched only from one or two logical servers. The number of content types for web game traffic after login is usually least. The system can help webpage designers to design more efficient webpages, and help researchers or Internet users to know communication details.
A significant fraction of today's Internet traffic is associated with popular web sites such as YouTube, Netflix or Facebook. In recent years, major Internet websites have become more complex as they incorporate a...
详细信息
A significant fraction of today's Internet traffic is associated with popular web sites such as YouTube, Netflix or Facebook. In recent years, major Internet websites have become more complex as they incorporate a larger number and more diverse types of objects (e.g. video, audio, code) along with more elaborate ways from multiple servers. These not only affect the loading time of pages but also determine the pattern of resulting traffic on the Internet. In this thesis, we characterize the complexity of major Internet websites through large-scale measurement and analysis. We identify thousands of the most popular Internet websites from multiple locations and characterize their complexities. We examine the effect of the relative popularity ranking and business type of the complexity of websites. Finally we compare and contrast our results with a similar study conducted 4 years earlier and report on the observed changes in different aspects.
暂无评论