Dataflow architecture has shown its advantages in many high-performance computing cases. In dataflow computing, a large amount of data are frequently transferred among processing elements through the network-on-chip ...
详细信息
Dataflow architecture has shown its advantages in many high-performance computing cases. In dataflow computing, a large amount of data are frequently transferred among processing elements through the network-on-chip (NoC). Thus the router design has a significant impact on the performance of dataflow architecture. Common routers are designed for control-flow multi-core architecture and we find they are not suitable for dataflow architecture. In this work, we analyze and extract the features of data transfers in NoCs of dataflow architecture: multiple destinations, high injection rate, and performance sensitive to delay. Based on the three features, we propose a novel and efficient NoC router for dataflow architecture. The proposed router supports multi-destination; thus it can transfer data with multiple destinations in a single transfer. Moreover, the router adopts output buffer to maximize throughput and adopts non-flit packets to minimize transfer delay. Experimental results show that the proposed router can improve the performance of dataflow architecture by 3.6x over a state-of-the-art router.
We are witnessing the consolidation of the GPUs streaming paradigm in parallel computing. This paper explores stencil operations in CUDA to optimize on GPUs the Jacobi method for solving Laplace's differential equ...
详细信息
Short and efficient memory tests is the goal of every test designer. To reduce the cost of production tests, often a simple test which covers most of the faults, e.g. all simple (unlinked) faults, is desirable to elim...
详细信息
Short and efficient memory tests is the goal of every test designer. To reduce the cost of production tests, often a simple test which covers most of the faults, e.g. all simple (unlinked) faults, is desirable to eliminate most defective parts;a more costly test can be used thereafter to eliminate the remainder of the bad parts. Such a test-cost efficient approach is used by most manufacturers. In addition, system power-on tests are not allowed a long test time while a high fault coverage is desirable. The authors propose a new realistic fault model (the disturb fault model), and a set of tests for unlinked faults. These tests have the property of covering all simple (unlinked) faults at a very reasonable test time compared with existing tests.
Detecting traffic signs effectively under low-light conditions remains a significant challenge. To address this issue, we propose YOLO-LLTS, an end-to-end real-time traffic sign detection algorithm specifically design...
详细信息
This article consists of a collection of slides from the author's conference resentation on TRIPS, a distributed explicit data graph execution (EDGE) microprocessor. Some of the specific topics discussed include: ...
This work presents a feasible solution to the problem of book losses prediction from financial and general data in companies. The specific problem tackled in this work corresponds to a real dataset of Spanish companie...
详细信息
Data-parallel programs are both growing in importance and increasing in diversity, resulting in specialized processors targeted at specific classes of these programs. This paper presents a classification scheme for da...
详细信息
Pervasive computing is a field where many different entities come into play. In particular, development of useful applications in the field of pervasive healthcare requires dealing with substantial levels of complexit...
详细信息
ISBN:
(纸本)9789639799158
Pervasive computing is a field where many different entities come into play. In particular, development of useful applications in the field of pervasive healthcare requires dealing with substantial levels of complexity regarding the management of the pervasive environment where users and applications coexist. Service oriented middleware architectures simplify the task of creating those applications and turn the management of pervasive environments into a productive activity. We present our ongoing approach to the design of such a middleware architecture, known as the Smart Environment Application architecture.
Polymethyl methacrylate (PMMA) reinforced with boron nitride nanotubes (BNNT) at weight fill factors fw of 0.2 and 0.5 wt % are characterized by terahertz time-domain spectroscopy (THz-TDS). It is known that the intro...
详细信息
By means of the availability of mechanisms such as Dynamic Voltage and Frequency Scaling (DVFS) and heterogeneous architectures including processors with different power consumption profiles, it is possible to devise ...
详细信息
暂无评论