Numerical methods based on interval arithmetic ore efficient means to reliably solve nonlinear systems of equations. Algorithm bc3revise is an interval method that tightens variables' domains by enforcing a proper...
详细信息
ISBN:
(纸本)9780769534435
Numerical methods based on interval arithmetic ore efficient means to reliably solve nonlinear systems of equations. Algorithm bc3revise is an interval method that tightens variables' domains by enforcing a property called box consistency. It has been successfully used on difficult problems whose solving eluded traditional numerical methods. We present a new algorithm to enforce box consistency that is simpler than bc3revise, faster and easily data parallelizable. A parallel implementation with Intel SSE2 SIMD instructions shows that an increase in performance of up to an order of magnitude and more is achievable.
the development of numerical simulation software tools for the solution of real-world problems usually calls for domain experts in modeling. the GraPA framework,, as an abstraction layer on top of hardware characteris...
详细信息
ISBN:
(纸本)9780769534435
the development of numerical simulation software tools for the solution of real-world problems usually calls for domain experts in modeling. the GraPA framework,, as an abstraction layer on top of hardware characteristics, supports modelers in two respects: one is the built-in support for co-processing of multiple models and the other is the generically delivered high performance achieved by implementing concurrency features of multicore and distributed memory architectures. Technically, GraPA is designed as a C++ template framework, where the modeler's data structures and algorithms instantiate the framework. Using this approach, we handle parallel processing of lock-free data structures and message passing transperently to the modelers. In this paper, we report on the status of the implementation of GraPA and on its performance characteristics.
In recent years, wireless sensor networks have received great interest with rapid developments in both hardware and software aspects. However, there is always a problem with limited energy supply for the tiny sensor n...
详细信息
ISBN:
(纸本)9780769534435
In recent years, wireless sensor networks have received great interest with rapid developments in both hardware and software aspects. However, there is always a problem with limited energy supply for the tiny sensor node. In wireless imaging sensor network in which imaging devices are utilized, the bottleneck of energy consumption efficiency is even more serious. the main activities of such imaging networks are related to image data transmission over wireless channel. therefore it is necessary to seek a good balance between the image quality and energy conservation. We propose an energy-efficient control scheme based on recent dynamic range coding techniques. the simulation results are presented with emphasis on prolonging the sensor network lifetime.
As Internet grows exponentially, scalable routers on backbone are required to provide more number of ports, higher line-rates, and larger capacity under acceptable complexity. Until now, most routers are implemented o...
详细信息
ISBN:
(纸本)9780769533520
As Internet grows exponentially, scalable routers on backbone are required to provide more number of ports, higher line-rates, and larger capacity under acceptable complexity. Until now, most routers are implemented on the centralized single crossbar as the switched backplane fabric. In terms of crosspoint number, however, the complexity of a single Crossbar is unacceptable with large number of ports, which is increased with O(N2). distributed multiple-stage Clos network and parallel Packet Switching fabric were proposed to provide large number of ports and high line-card rate, respectively. To obtain both goals simultaneously, we study a novel multiple-plane and multiple-stage (MPMS) switching fabric in this paper. We first bring out a graphic model for the MPMS fabric based on its topological architecture. then we study the internal connectivity of the MPMS fabric through the concepts of Vertex In-Degree, Vertex Out-Degree and Vertex Mux Degree. Lastly, we analyze the performance of the MPMS fabric including its maximum number of ports, line-rate, switching capacity and complexity of crosspoints by comparison to that of the single-stage Crossbar fabric.
Clusters built from single-core systems are cost-effective as for the performance improvement and availability. However, the hardware constraints put limitations on the performance of single-core systems. Hence, it is...
详细信息
ISBN:
(纸本)9780769533520
Clusters built from single-core systems are cost-effective as for the performance improvement and availability. However, the hardware constraints put limitations on the performance of single-core systems. Hence, it is difficult to meet withthe increasing high performance requirements of diversified applications at different levels for general-purpose computing. A promising feasible solution is the novice multi-core systems which extend the parallelism to CPU level by integrating multiple processing units on a single die. this paper uses Finite-Difference Time-Domain (FDTD) algorithm as a case study, designing suitable parallel FDTD algorithms for three architectures: distributed-memory machines with single-core processors, shared-memory machines with dual-core processors, and the Cell Broadband Engine (Cell/B.E.) processor with nine heterogeneous cores. the experiment results show that the Cell/B.E. processor using 8 SPEs achieves a significant speedups of 7.05 faster than AMD single-core Opteron processor and 3.37 than AMD dual-core Opeteron processor at the Processor level.
this paper presents a novel video service system DCSVS (distributed Collaborative Set-top-box Video Service), which encompasses several practical and effective solutions to both Live and VoD (Video-on-Demand) services...
详细信息
ISBN:
(纸本)9780769534435
this paper presents a novel video service system DCSVS (distributed Collaborative Set-top-box Video Service), which encompasses several practical and effective solutions to both Live and VoD (Video-on-Demand) services. DCSVS is established on an overlay DHT (distributed Hash Table) network, which improves Kademlia Protocol to fit for Real-Time application. We use several types of Pre-Fetching to enhance continuity rate of VoD service, and we employ an effective scheduling algorithm based on transferring priorities. We design the Inner-First and Proxy-Forward strategy to relieve transferring failure, and we use Window-based storage and hashing to achieve better system performance. Boththeoretical analysis and experimental results show that strategies in DCSVS perform well in terms of efficiency and robustness, and could maintain a fine state and scalability in large-scale networking.
Scheduling tasks onto the processors of a parallel system is a crucial part of program parallelisation. Due to the NP-hard nature of the task scheduling problem, scheduling algorithms are based on heuristics that try ...
详细信息
ISBN:
(纸本)9780769534435
Scheduling tasks onto the processors of a parallel system is a crucial part of program parallelisation. Due to the NP-hard nature of the task scheduling problem, scheduling algorithms are based on heuristics that try to produce good rather than optimal schedules. Nevertheless, in certain situations it is desirable to have optimal schedules, for example for time critical systems or to evaluate scheduling heuristics. this paper investigates the task scheduling problem using A* search algorithm. the A* scheduling algorithm implemented can produce optimal schedules in reasonable time for small to medium sized task graphs. In comparison to a previous approach, the here presented A* scheduling algorithm has a significantly reduced search space due to a much improved cost function f (s) and additional pruning techniques. Last but not least, the experimental results show that the proposed A* scheduling algorithm significantly outperforms the previous approach.
the technology of parallel10 is one of the key technologies for high performance computer. Firstly, the 10 system of the newest Top500 typical machines will be introduced in this paper. Secondly, a new distributed sh...
详细信息
ISBN:
(纸本)9780769530994
the technology of parallel10 is one of the key technologies for high performance computer. Firstly, the 10 system of the newest Top500 typical machines will be introduced in this paper. Secondly, a new distributed shared parallel10 system for high performance computer (DSPIO) will be put forward, and some key technologies implemented in the system 14411 be discussed Finally, a prototype system is built. the experiment results show that this architecture can offer high 10 bandwidth, good scalability, and suit for high performance computing very much.
Ubiquitous computing aims at providing computing functionalities embedded into everyday life. technologies such as networked sensors, actuators, mobile devices, appliances, and stationary computing infrastructures are...
详细信息
ISBN:
(纸本)9781605583495
Ubiquitous computing aims at providing computing functionalities embedded into everyday life. technologies such as networked sensors, actuators, mobile devices, appliances, and stationary computing infrastructures are intended to support smart services by making use of context information. this information may be only of interest for the moment the service is provided, like location-based services, or collected for later use, e.g., to optimize logistic workflows. In particular in case of spontaneous interactions of components, it becomes difficult to keep track about the provenance of sensor data or even services that are provided in a particular situation. If problems occur, like privacy and security attacks or service failures, this lack of traceable information may decrease the trustworthiness of systems and legal requirements might not be fulfilled properly. In this position paper, we introduce provenance data to increase reliability in ubiquitous computing systems. Hereby, provenance is a summary of the historical information about gathering, collecting, and aggregating data. We demonstrate and motivate the usefulness of the approach and discuss open issues and further research directions by presenting a health-care emergency use case. Copyright 2008 ACM.
this paper describes the RTGrid distributed simulation framework for conformal radiotherapy. We introduce novel approaches through which several distributedcomputingtechnologies are made accessible to Monte Carlo si...
详细信息
ISBN:
(纸本)9781424416936
this paper describes the RTGrid distributed simulation framework for conformal radiotherapy. We introduce novel approaches through which several distributedcomputingtechnologies are made accessible to Monte Carlo simulations for radiotherapy dose calculations. Currently, radiotherapy treatment planning is typically performed on PCs and workstations which lack the computational power to run Monte Carlo simulations quickly enough to be useful to clinicians. therefore, although Monte Carlo simulation techniques offer highly accurate doses, they are seldom chosen for clinical deployment. the RTGrid project is investigating boththe capability and capacity modes of exploiting grid computing for radiotherapy treatment planning using Monte Carlo simulations.
暂无评论