A neural network based motion-estimation technique is developed, that is applicable to sub-pixel as well as large movements. Experiments on typical test frame sequences indicate marked improvement in accuracy of motio...
详细信息
A neural network based motion-estimation technique is developed, that is applicable to sub-pixel as well as large movements. Experiments on typical test frame sequences indicate marked improvement in accuracy of motion vector estimates over the MPEG logarithmic block matching algorithm. The method utilizes a modified Hopfield neural network. Due to the neural network's fault-tolerant nature and parallel computation capability, fast, accurate, and reliable results are obtained. Application to MPEG based video compression is also discussed.< >
We describe a programming environment which is being developed for the automatic generation of parallel image processing applications. Through the use of model-based software synthesis, we transparently create large g...
详细信息
We describe a programming environment which is being developed for the automatic generation of parallel image processing applications. Through the use of model-based software synthesis, we transparently create large grained data parallel applications which can be executed on arbitrary processor networks. The high-level abstractions provided by the modeling paradigm isolates the user from the complexity of the underlying implementation, allowing developers with little or no experience in parallel programming to rapidly create parallel applications. The data parallel modeling facilities perform the same tasks as the data alignment and distribution compiler directives of High Performance Fortran and the aggregate objects of pC++. However, we have found that by introducing the parallelism on the system level, instead of in the algorithm, we can use traditional compilers and leave the application specific code unchanged. This allows us to take advantage of existing well developed image processing code libraries. Here we describe a system which generates data parallel versions of applications created in Khoros, the popular image processing package developed by the University of New Mexico. This system retains the best qualities of Khoros: its interactive and experimental nature, and its visual interface, but adds the capability for automatically generating much higher performance parallel implementations when needed. This system demonstrate the suitability of the model-based approach for developing parallel imaging software.< >
This paper describes 2x2 sorting nodes for implementing the parallel bitonic sort with optical holographic switching. The nodes perform maximum-switch, minimum-switch, or straight-through transfers of two input stream...
详细信息
This paper describes 2x2 sorting nodes for implementing the parallel bitonic sort with optical holographic switching. The nodes perform maximum-switch, minimum-switch, or straight-through transfers of two input streams. A 4x4 CMOS array has been fabricated using normal electrical interconnect between the GaAs optical transmitters and receivers and the CMOS array. A pipelined design for higher speed is also discussed.
The airflow around a 2D square rib placed within a boundary layer is computed by means of a massive parallel computer. The Navier-stokes equations are solved by a finite difference method with the well-known simulatio...
详细信息
The airflow around a 2D square rib placed within a boundary layer is computed by means of a massive parallel computer. The Navier-stokes equations are solved by a finite difference method with the well-known simulation method of ABMAC. The present paper deals with the implementation and optimization of this numerical method in connection with the massive parallel computer system, which is a MIMD (Multi-Instructions and Multi-Data stream) computing system consisting of up to 1024 processors, each with 16 Mbytes of local memory. parallelization of the algorithm is achieved by regional partitioning, i.e. grid partitioning;a grid system of 242 x 242 cells is divided into subdomains, and data belonging to each subdomain is distributed to each processor. Computations are carried out for different numbers of processors and for different algorithms of data exchange between the processors;the speedup ratio (i.e. increasing ratio of computing speed) and the efficiency of parallelization were measured. The results show that the communication method between individual processors and the size of the divided subdomains are rather crucial in their effects on granularity (i.e. the ratio of calculating time to communicating time for each processor).
Multigrid (MG) methods for partial differential equations (and for other important mathematical models in scientific computing) have turned out to be optimal on sequential computers. Clearly, one wants to apply them a...
详细信息
The hybrid-hash algorithm and its parallel variant have been recently found to outperform all other algorithms in joining disk-based large relations on uniprocessor and ring-interconnected distributed-memory multiproc...
详细信息
The data-flow model of computation is an attractive methodology for multiprocessor programming for it offers the potential for unlimited parallelism detection at no programmer's expense. It is here applied to a di...
详细信息
Numerical simulation of physical phenomena using realistic 3D models requires powerful computers. Because of discretization, such problems have gridstructure and the numerical processes are mainly local. As these num...
详细信息
ISBN:
(纸本)0818606371
Numerical simulation of physical phenomena using realistic 3D models requires powerful computers. Because of discretization, such problems have gridstructure and the numerical processes are mainly local. As these numerical problems are suitable for parallelcomputing, multiprocessor systems can be used. An architectural concept is described which allows for appropriate mapping of the problem structure, in particular multigrid problems, onto a multiprocessor structure. The basis of results from experimental multiprocessor systems, theoretical studies, and simulation experiments a high-performance multiprocessor system is proposed. The overall performance of the system will be 1 GFLOPS, 1-GB main memory, 38-GB mass storage, and 640-Mb/s data transfer from mass storage. A possible structure is given for the fast arithmetic coprocessor of the processor-memory-modules.
The volume contains 95 papers presented at the meeting, 72 of which are indexed separately. The papers are grouped under general topics that include modeling for finite element analysis, networks and distributed proce...
详细信息
The volume contains 95 papers presented at the meeting, 72 of which are indexed separately. The papers are grouped under general topics that include modeling for finite element analysis, networks and distributed processing, structural engineering applications, remote sensing, real-time applications, software quality assurance, mapping, railroad engineering applications, geotechnical applications, computer aided design methods, civil engineering software exchange, interactive graphics, hydrology and hydraulics applications, transportation systems applications, and others.
The volume contains 95 papers presented at the meeting, 72 of which are indexed separately. The papers are grouped under general topics that include modeling for finite element analysis, networks and distributed proce...
详细信息
The volume contains 95 papers presented at the meeting, 72 of which are indexed separately. The papers are grouped under general topics that include modeling for finite element analysis, networks and distributed processing, structural engineering applications, remote sensing, real-time applications, software quality assurance, mapping, railroad engineering applications, geotechnical applications, computer aided design methods, civil engineering software exchange, interactive graphics, hydrology and hydraulics applications, transportation systems applications, and others.
暂无评论