Cloud computing infrastructure offers the computing resources as a homogeneous collection of virtual machine instances by different hardware on figurations, which is transparent to end users. In fact, the computationa...
详细信息
The aim of this paper is to propose a new model for supporting proactive decision making in large enterprises. These organizations usually encounter enormous electronic transactions upon distributed infrastructures. T...
详细信息
The necessity for capping carbon emission has significantly restricted the potential of modern data centers. For this matter, both industry and academia are proactively seeking opportunities on cross-layer power manag...
详细信息
ISBN:
(纸本)9781467355872
The necessity for capping carbon emission has significantly restricted the potential of modern data centers. For this matter, both industry and academia are proactively seeking opportunities on cross-layer power management schemes that could open a door for sustainable high-performance computing platform. In this paper we investigate an emerging trend in the IT industry: using promising onsite distributed generation (DG) techniques to provide premium clean energy to the computing load. We develop data center power demand shaping (PDS), a novel technique that allows data centers to utilize onsite green energy efficiently. In contrast to prior design, PDS takes advantage of a so-far unexplored power supply feature, i.e., the load following capabilities of DG systems to avoid the high performance penalty issue incurred during supply tracking. In addition, PDS features two adaptive power management schemes: DGR Boost and UPS Boost. These two workload-aware optimization methods leverage mature computer tuning knobs to achieve attractive data center performance improvement. Using real-world data center traces and industry data of distributed generation systems, we show that our technique can come within 1.2% performance of an ideal oracle, which is roughly a 37% improvement over existing supply tracking based design. Our design could save over 100 metric tons of carbon emissions annually for a 10MW data center.
parallel communicating grammar systems (PCGS) were introduced awhile ago purportedly to analyze concurrent systems on a language-theoretic level. To our knowledge however no actual relationship between PCGS and practi...
详细信息
The popularity of cloud-based interactive computing services (e. g., virtual desktops) brings new management challenges. Each interactive user leaves abundant but fluctuating residual resources while being intolerant ...
详细信息
ISBN:
(纸本)9780769551029
The popularity of cloud-based interactive computing services (e. g., virtual desktops) brings new management challenges. Each interactive user leaves abundant but fluctuating residual resources while being intolerant to latency, precluding the use of aggressive VM consolidation. In this paper, we present the Resource Harvester for Interactive Clouds (RHIC), an autonomous management framework that harnesses dynamic residual resources aggressively without slowing the harvested interactive services. RHIC builds ad-hoc clusters for running throughput-oriented "background" workloads using a hybrid of residual and dedicated resources. These hybrid clusters offer significant gains over normal dedicated clusters: 20-40% cost and 20-29% energy savings in our testbed. For a given background job, RHIC intelligently discovers and maintains the ideal cluster size and composition, to meet user-specified goals such as cost/energy minimization or deadlines. RHIC employs black-box workload performance modeling, requiring only systemlevel metrics and incorporating techniques to improve modeling accuracy with bursty and heterogeneous residual resources. We demonstrate the effectiveness and adaptivity of our RHIC prototype with two parallel data analytics frameworks, Hadoop and HBase. Our results show that RHIC finds near-ideal cluster sizes and compositions across a wide range of workload/goal combinations.
Temporal and multi-version databases are ideal candidates for a distributed store, which offers large storage space, and parallel and distributed processing power from a cluster of (commodity) machines. A key challeng...
详细信息
Most multi-core and some many-core processors implement cache coherency protocols that heavily complicate the design of optimal parallel algorithms. Communication is performed implicitly by cache line transfers betwee...
详细信息
One feature of cloud storage systems is data fragmentation (or sharding) so that data can be distributed over multiple servers and sub queries can be run in parallel on the fragments. On the other hand, flexible query...
详细信息
The proceedings contain 12 papers. The topics discussed include: exploiting in-network processing for big data management;deepsea: self-adaptive data partitioning and replication in scalable distributed data systems;d...
ISBN:
(纸本)9781450321556
The proceedings contain 12 papers. The topics discussed include: exploiting in-network processing for big data management;deepsea: self-adaptive data partitioning and replication in scalable distributed data systems;designing a database system for modern processing architectures;learning queries for relational, semi-structured, and graph databases;turning scientists into data explorers;information diffusion in online social networks;exploratory mining of collaborative social content;discovering and disambiguating named entities in text;the tantalizing new prospect of index-based diversified retrieval;effective hashing for large-scale multimedia search;efficient and scalable monitoring and summarization of large probabilistic data;and RDF-4G: algorithmic building blocks for large-scale graph analytics.
Polygon overlay is one of the complex operations in Geographic Information systems (GIS). In GIS, a typical polygon tends to be large in size often consisting of thousands of vertices. Sequential algorithms for this p...
详细信息
暂无评论