Adaptive Body Biasing (ABB) is a popularly used technique to mitigate the increasing impact of manufacturing process variations on leakage power dissipation. The efficacy of the ABB technique can be improved by partit...
详细信息
ISBN:
(纸本)9781605584706
Adaptive Body Biasing (ABB) is a popularly used technique to mitigate the increasing impact of manufacturing process variations on leakage power dissipation. The efficacy of the ABB technique can be improved by partitioning a design into a number of "body-bias islands," each with its individual body-bias voltage. In this paper, we propose a system-level leakage variability mitigation framework to partition a multiprocessor system into body-bias islands at the processing element (PE) granularity at design time, and to optimally assign body-bias voltages to each island post-fabrication. As opposed to prior gate- and circuit-level partitioning techniques that constrain the global clock frequency of the system, we allow each island to run at a different speed and constrain only the relevant system performance metrics - in our case the execution deadlines. Experimental results show the efficacy of the proposed framework in reducing the mean and standard deviation of leakage power dissipation compared to a baseline system without ABB. At the same time, the proposed techniques provide significant runtime improvements over a previously proposed Monte-Carlo based technique while providing similar reductions in leakage power dissipation. Copyright 2008 ACM.
Configurable multiprocessor system is a promising design alternative because of its high degree of flexibility, short development time, and potentially high performance under constraints and challenges driven by appli...
详细信息
ISBN:
(纸本)9781605584706
Configurable multiprocessor system is a promising design alternative because of its high degree of flexibility, short development time, and potentially high performance under constraints and challenges driven by applications. An important design challenge at 45nm for multi-core system is manufacturing process variation. Due to increasing concern of WID variation, designers will have to choose configurations of processing cores that maximize yield of the system while not affecting performance and throughput constraints. Due to interdependency between processor configuration selection and task allocation and its impact on yield and latency constraints, we tackle both problems simultaneously. In this paper, we propose the problem of task allocation and configuration selection for yield optimization. We prove the problem is NP-hard and propose an optimal pseudo-polynomial on Serial-Parallel graphs. We target streaming applications in pipelined reconfigurable multiprocessor systems. We provide a case study of configurable Leon processors as the cores implemented on FPGA. Results show that proposed problem could result in significant improvement of the timing yield of the system by exploiting extra slack on tasks. Copyright 2008 ACM.
The paper gives an extension to the existing approach to hardware/softwarecodesign of embedded systems by including the missed out network element commonly seen in the today's embedded systems. The approach tries...
详细信息
Formal performance analysis is now regularly applied in the design of distributed embedded systems such as automotive electronics, where it greatly contributes to an improved predictability and platform robustness of ...
详细信息
ISBN:
(纸本)9781605584706
Formal performance analysis is now regularly applied in the design of distributed embedded systems such as automotive electronics, where it greatly contributes to an improved predictability and platform robustness of complex networked systems. Even though it might be highly beneficial also in MpSoC design, formal performance analysis could not easily be applied so far, because the classical task communication model does not cover processor-memory traffic, which is an integral part of MpSoC timing. Introducing memory accesses as individual transactions under the classical model has shown to be inefficient, and previous approaches work well only under strict orthogonalization of different traffic streams. Recent research has presented extensions of the classical task model and a corresponding analysis that covers performance implications of shared memory traffic. In this paper we present a multithreaded multiprocessors platform and multimedia application. We conduct performance analysis using the new analysis options and specifically benchmark the quality of the available approach. Our experiments show that corner case coverage can now be supplied with a very high accuracy, allowing to quickly investigate architectural alternatives. Copyright 2008 ACM.
The heterogeneity of modern MPSoC architectures, coupled with the increasing complexity of the applications mapped onto them has recently led to a lot of interest in hybrid performance model-ing techniques. Here, the ...
详细信息
ISBN:
(纸本)9781605584706
The heterogeneity of modern MPSoC architectures, coupled with the increasing complexity of the applications mapped onto them has recently led to a lot of interest in hybrid performance model-ing techniques. Here, the idea is to apply different modeling and analysis techniques to different subsystems/components of an ar-chitecture/application. Such hybrid techniques often turn out to be more efficient and accurate compared to relying on a single analy-sis technique for the entire system. However, the challenge asso-ciated with this approach is to combine the different analysis re-sults effectively to obtain conservative performance estimates for the entire system. In this paper we study a hybrid scheme where certain system components are simulated (e.g. using instruction set simulators), whereas others are analyzed using a formal tech-nique called Real-Time Calculus (RTC). The main novelty of our approach stems from our use of this hybrid technique even for mul-tiple tasks mapped onto a single processing element. In contrast to this, previous approaches relied on either full simulation or RTC-based analysis for an entire architectural component (e.g. a proces-sor or a bus). The techniques we develop in this paper therefore al-low for both intra- and inter-processor hybrid performance model-ing and show how the different analysis results can be combined to efficiently obtain tight performance estimates for complex MPSoC architectures. We demonstrate the usefulness of this approach us-ing an MPEG-2 decoder application that is partitioned and mapped onto two processing elements connected by FIFO buffers. Copyright 2008 ACM.
This work presents a novel approach to hardware-software co-synthesis of distributed embedded systems, based on the developlmental genetic programming. Unlike the other genetic approaches where chromosomes represent s...
详细信息
ISBN:
(纸本)9783540858560
This work presents a novel approach to hardware-software co-synthesis of distributed embedded systems, based on the developlmental genetic programming. Unlike the other genetic approaches where chromosomes represent solutions, in our method chromosomes represent system construction procedures. Thus, not the system architecture but the co-synthesis process is evolved. Finally a tree describing a construction of a final solution is obtained. The optimization process will be illustrated with examples. According to Our best knowledge it is the first DGP approach that deals with the hardware-software co-synthesis.
Mechatronics is the practice of creating systems that synergize electrical, mechanical, and software technology. With few exceptions, testing software that is embedded in mechatronics systems has historically been don...
详细信息
ISBN:
(纸本)9781605580302
Mechatronics is the practice of creating systems that synergize electrical, mechanical, and software technology. With few exceptions, testing software that is embedded in mechatronics systems has historically been done only with the hardware in the loop (HIL). There are many disadvantages to HIL testing, including cost, schedule delays, and resource bottlenecks. Ironically, cost and schedule delays are also often seen by technical managers as impediments to simulating the mechatronics hardware. We present a vision for agile simulation, with the intent to lower the cost and time barriers of simulating mechatronics hardware to test software. Some key ideas include: focus on validating the software alone, rather than the entire system;doing the minimum amount of dynamic modeling needed to test the software;work even with legacy systems that might not have been designed with testability in mind;leverage existing tools wherever possible;support stubbing out the hardware using agile techniques such as refactoring and test-driven development. Copyright 2008 ACM.
The proceedings contain 19 papers. The topics discussed include: system level simulation of autonomic SoCs with TAPES;topology-aware replica placement in fault-tolerant embedded networks;design of gate array circuits ...
详细信息
ISBN:
(纸本)3540781528
The proceedings contain 19 papers. The topics discussed include: system level simulation of autonomic SoCs with TAPES;topology-aware replica placement in fault-tolerant embedded networks;design of gate array circuits using evolutionary algorithms;a hardware packet re-sequencer unit for network processors;dynamic reconfiguration of FlexRay schedules for response time reduction in asynchronous fault-tolerant networks;synthesis of multi-dimensional high-speed FIFOs for out-of-order communication;a novel routing architecture for field-programmable gate-arrays;a predictable simultaneous multithreading scheme for hard real-time;soft real-time scheduling on SMT processors with explicit resource allocation;and a hardware/softwarecodesign of a co-processor for real-time hyperelliptic curve cryptography on a Spartan3 FPGA.
Modern embedded system designers face challenges of unprecedented scales, creating systems that integrate functionality spanning disparate scientific domains, with increasing computation demands and ever-stricter powe...
ISBN:
(纸本)9781424424177
Modern embedded system designers face challenges of unprecedented scales, creating systems that integrate functionality spanning disparate scientific domains, with increasing computation demands and ever-stricter power requirements. Meeting the constraints of these systems requires practical design flows that reduce development time without sacrificing design efficiency. Novel design description methodologies coupled with automated and semi-automated synthesis paths greatly accelerate the design of modern hardwaresystems. In the software space, however, synthesis methods are far from producing co-designs with the necessary efficiency. This is particularly evident at the hardware/software boundary, where the tight coupling of low-level firmware routines and hardware protocols require designers to have deep design knowledge in both domains. To address this issue, we propose a latency-insensitive software execution model that allows direct connection to elastic hardware control topologies.
High-speed underwater acoustic communication with multi-carrier is used for real-time transmission with high data rate and low bit error rate, and it plays an important role in AUV. OFDM (orthogonal frequency division...
详细信息
ISBN:
(纸本)9781424421787
High-speed underwater acoustic communication with multi-carrier is used for real-time transmission with high data rate and low bit error rate, and it plays an important role in AUV. OFDM (orthogonal frequency division multiplexing) is an attractive multi-carrier communication method around the world. According to the requirements of high-speed underwater acoustic communication with multi-carrier, a system is designed for OFDM communication from hardware and software. The multiple processors with 4 ADSP-TS101s makes up of cluster/data flow associated multiprocessing parallel processing structure as the operation kernel. The receiver composed of multi-channel synchronous sample module and the transmitter with FIFO (first in first out) are designed and implemented And the FPGA is used to realize the logical control and data package. The methods of interrupts of DMA (direct memory access) and ha half-full are used to realize data transmission with high speed at background And the signal synthesis method, DDS (direct digital synthesis), is adopted to realize producing OFDM signal in real time. Through the experiments in a lake, the results show that the system had good stability and real-time processing capability, it can complete the OFDM signal transmission and receiving with high speed.
暂无评论