In this paper, we have presented an efficient single layer ANN structure called functional link ANN (FLANN) for the prediction of short time load demand. In contrast to a feed forward ANN structure, i.e. a multilayer ...
详细信息
In this paper, we have presented an efficient single layer ANN structure called functional link ANN (FLANN) for the prediction of short time load demand. In contrast to a feed forward ANN structure, i.e. a multilayer perceptron (MLP), the FLANN is basically a single layer structure in which non-linearity is introduced by enhancing the input pattern with nonlinear function expansion. In FLANN structure the need of a hidden layer is eliminated and it requires much less computation than that of MLP. In this paper three different functional expansions have been applied. From the simulation studies, it is clear that with the Chebyshev functional expansion, the FLANN network performs best among other competing networks for this challenging and interesting problem. Performance comparisons of the all the networks were carried out through extensive computersimulations.
Cyber-Physical systems (CPSs) as advanced embeddedsystems integrating computation with physical process are increasingly penetrating into our life. modeling and analysis for such systems closely involved with us are ...
详细信息
Cyber-Physical systems (CPSs) as advanced embeddedsystems integrating computation with physical process are increasingly penetrating into our life. modeling and analysis for such systems closely involved with us are actively researched. A current challenging problem is how to take advantages of existing technologies like SysML/MARTE, Modelica and Statistical Model Checking (SMC) through effective integration. Moreover, the lack of efficient methodologies or tools for modeling and analysis of CPSs makes the gap between design and analysis models hard to bridge. To solve these problems, we present a framework named Modana to achieve an integrated process from modeling with SysML/MARTE to analysis with SMC for CPSs in terms of Non Functional Properties (NFP) such as time, energy, etc. Functional Mock-up Interface (FMI), as a connecting link between modeling and analysis, plays a major role in coordinating various tools for co-simulation to generate traces as the input of statistical model checker. To demonstrate the capability of Modana framework, we model energy-aware buildings as a case study, and discuss the analysis on energy consumption in different scenarios.
Biologically accurate neuron simulations are increasingly important in research related to brain activity. They are computationally intensive and feature data and task parallelism. In this paper, we present a case stu...
详细信息
ISBN:
(纸本)9781479937707
Biologically accurate neuron simulations are increasingly important in research related to brain activity. They are computationally intensive and feature data and task parallelism. In this paper, we present a case study for the mapping of a biologically accurate inferior-olive (InfOli), neural cell simulator on an many-core research platform. The Single-Chip Cloud computer (SCC) is an experimental processor created by Intel Labs. The target neurons provide a major input to the cerebellum and are involved in motor skills and space perception. We exploit task-and data-partitioning, scaling the simulation over more than 40,000 neurons. The voltage-and frequency-scaling capabilities of the chip are explored, achieving more than 20% energy savings with negligible performance degradation. Four platform configurations are evaluated and a mapping with balanced workload and constant voltage and frequency is formally derived as optimal.
This paper describes efficient, scalable parallel implementations of the conventional cell list method and a modified cell list method to calculate the total system intermolecular Lennard-Jones force interactions in t...
详细信息
ISBN:
(纸本)9781479989386
This paper describes efficient, scalable parallel implementations of the conventional cell list method and a modified cell list method to calculate the total system intermolecular Lennard-Jones force interactions in the Monte Carlo Gibbs ensemble. We targeted this part of the Gibbs ensemble for optimization because it is the most computationally demanding part of the force interactions in the simulation, as it involves all the molecules in the system. The modified cell list implementation reduces the number of particles that are outside the interaction range by making the cells smaller, thus reducing the number of unnecessary distance evaluations. Evaluation of the two cell list methods is done using a hybrid MPI+OpenMP approach and a hybrid MPI+CUDA approach. The cell list methods are evaluated on a small cluster of multicore CPUs, Intel Phi coprocessors, and GPUs. The performance results are evaluated using different combinations of MPI processes, threads, and problem sizes.
In this paper we will present a computational model of area CA3b in Hippocampus and through simulations show that this system is capable of sustaining a rhythm without any external input using only a small portion of ...
详细信息
ISBN:
(纸本)9781479937707
In this paper we will present a computational model of area CA3b in Hippocampus and through simulations show that this system is capable of sustaining a rhythm without any external input using only a small portion of its population. The model proposed here was devised using the biological data available on CA3. Our simulations have shown that the rhythmic emergent behaviour is sustainable and that we can alter its characteristics (i.e. spectrum) by altering the overall physical characteristics of the network such as the overall size of the structure itself or the extent of the connectivity field.
Recent design methodologies and tools aim at enhancing the design productivity by providing a software development platform before defining the final MPSoC architecture details. However, the simulation can only be eff...
详细信息
ISBN:
(纸本)9781479937707
Recent design methodologies and tools aim at enhancing the design productivity by providing a software development platform before defining the final MPSoC architecture details. However, the simulation can only be efficiently performed when using a modeling and simulation engine that supports the system behavior description in a high abstraction level. The lack of MPSoC virtual platform prototyping integrating both scalable hardware and software in order to create and evaluate new methodologies and tools motivated us to develop MPSoCBench. This toolset is a scalable set of MPSoCs including four different ISAs (PowerPC, MIPS, SPARC, and ARM) organized in platforms with 1, 2, 4, 8, 16, 32, or 64 cores, cross-compilers, IPs, interconnections, and 17 parallel version of software from well-known benchmarks. This tool also provides power consumption estimation for MIPS and SPARC processors. The MPSoCBench sums 864 different configurations automated through scripts.
This paper describes a recursive algorithm for constructing a general Space-Filling Curve (SFC) for an arbitrary distribution of points in 2D. We use the SFC to partition 2D meshes, both structured and unstructured, a...
详细信息
This paper describes a recursive algorithm for constructing a general Space-Filling Curve (SFC) for an arbitrary distribution of points in 2D. We use the SFC to partition 2D meshes, both structured and unstructured, and compare the quality of partitions with traditional SFCs and the multilevel partitioning schemes of Metis and Scotch. The algorithm is independent of the geometry of the mesh and can be easily adapted to irregular meshes. We discuss the advantages of SFCs over multilevel partitioners for meshes in scientific simulations. We define three performance metrics for a reasonable comparison of partitions: volume or load per partition, degree or the number of distinct edges of a partition in the communication graph and communication volume or the sum of the weights of outgoing edges for each partition in the communication graph. We propose a performance model for modern architectures using these metrics. We find our partitions comparable to and in some cases better than the best multilevel partitions, while being computed much faster. Unlike Metis, our hierarchical approach yields good hierarchical partitions (e.g., for partitioning to node and core level), and is appropriate for adaptive mesh refinement kernels.
With the advent of diverse enabling technologies, brain-related research has, in recent years, been seriously amplified and has already started yielding impressive findings across various fronts. With respect to compu...
详细信息
Heterogeneous system architectures are becoming more and more of a commodity in the scientific community. While it remains challenging to fully exploit such architectures, the benefits in performance and hybrid speed-...
详细信息
Heterogeneous system architectures are becoming more and more of a commodity in the scientific community. While it remains challenging to fully exploit such architectures, the benefits in performance and hybrid speed-up, by using a host processor and accelerators in parallel in a non-monolithic matter, are significant. Hereby, the energy efficiency is becoming an increasingly critical challenge for future high-performance computing (HPC) systems, which do want to exceed the Exascale barrier with several competing architecture concepts ranging from high-performance CPUs, combined with GPUs acting as floating-point accelerators, to computationally weak CPUs, paired with dedicated and highly-perform ant FPGA-based accelerators. In this paper, we realize and evaluate a hybrid computing approach based on a two-dimensional seismic streaming algorithm with several heterogeneous system architectures, including conventional HPC approaches based on powerful CPUs and GPUs. Furthermore, we elaborate the effort on an embedded system platform claiming to be a "mini supercomputer" [1]. Several CPU and accelerator combinations are utilized in a manual work-sharing manner with the aim of achieving significant performance speed-ups and a detailed energy-efficiency study. Based on roofline models and experimental evaluations, the paper provides an insight into the fact that hybrid computing is mostly unconditionally beneficial for balanced systems regarding the performance as well as the energy efficiency, aiding the programmer in the decision whether or not costly, manually tuned, homogeneous implementations are worthwhile.
This paper studies the schedulability of real-time tasks in the Priority-based Functional Reactive Programming (P-FRP) model under fixed priority scheduling, one of the influential scheduling policies. Since the abort...
详细信息
This paper studies the schedulability of real-time tasks in the Priority-based Functional Reactive Programming (P-FRP) model under fixed priority scheduling, one of the influential scheduling policies. Since the abort-and-restart execution paradigm of the P-FRP model is different from that of the classic pre-emptive model, the schedulability analysis for P-FRP tasks under fixed priority scheduling differs widely. In P-FRP, for a synchronous n-task set under fixed priority scheduling, the least common multiple (LCM) of all n task periods is the typical length of a testing interval for an exact (necessary and sufficient) schedulability test. In this paper, we propose and prove an optimal simulation based exact schedulability test in the P-FRP model under fixed priority scheduling for a given priority order, covering scenarios from synchronous task release to asynchronous task release with the initial busy condition, and from implicit deadlines to constrained deadlines. The length of a testing interval for the exact test is the LCM of the first n-1 task periods and its optimality is proved.
暂无评论