The authors describe GGI, a visual system that allows the user to execute an automatically generated dataflow graph containing code modules that perform natural language processing tasks. These code modules operate o...
详细信息
The authors describe GGI, a visual system that allows the user to execute an automatically generated dataflow graph containing code modules that perform natural language processing tasks. These code modules operate on text documents. GGI has a suite of text visualisation tools that allows the user useful views of the annotation data that is produced by the modules in the executable graph. GGI forms part of the GATE natural language engineering system.
Many signal processing systems use event driven mechanisms-typically based on finite state machines (FSMs)-to control the operation of computationally intensive (dataflow) parts. The state machines in turn are often ...
详细信息
Many signal processing systems use event driven mechanisms-typically based on finite state machines (FSMs)-to control the operation of computationally intensive (dataflow) parts. The state machines in turn are often fueled by external inputs as well as by feedback from the signal processing portions of the system. Packet-based transmission systems are a good example for such a close interaction between data and control flow. For an efficient design flow it is of crucial importance to be able to model and analyze the complete functionality of the system within one single design environment. Therefore, we developed a computational model that integrates the specification of control and dataflow by combining the notion of dataflow graphs with event driven process activation.
The need to partition computation across multiple programmable devices in array architecture CCMs leads to performance bottlenecks in dataflow through the computer and wiring delays between adjacent devices. However,...
详细信息
The need to partition computation across multiple programmable devices in array architecture CCMs leads to performance bottlenecks in dataflow through the computer and wiring delays between adjacent devices. However, significant improvements in FPGA capacities have brought one to a threshold where direct inter-chip connections are not required because an entire algorithm can be implemented on a single device for important problems in areas such as image processing. One can now implement architectures that are similar to today's parallel computers in which interprocessor communication is done through shared memory or dedicated communication hardware. The benefits of this approach are system-wide scalability and flexibility. The authors illustrate this new style of CCM with examples from image processing, in particular a novel FPGA implementation of block motion estimation (as for MPEG encoding). Based on the lessons learned from these specific examples, they generalize and speculate on implications for new CCM architectures.
We present an approach allowing communication optimization during the hardware-software partitioning task. Our methodology focuses on systems represented by a dataflow graph whose nodes are elements of libraries. To a...
详细信息
ISBN:
(纸本)9780818678950
We present an approach allowing communication optimization during the hardware-software partitioning task. Our methodology focuses on systems represented by a dataflow graph whose nodes are elements of libraries. To abstract the communication constraints, we include communication nodes in this graph. Consequently assignment and scheduling of communications and operations can be determined together by the same partitioning algorithm. During partitioning, protocol optimization and bus scheduling are realized. We illustrate with a telecommunication system example the feasibility and the usefulness of our methodology.
The wormhole run-time reconfiguration (RTR) computing paradigm is a method for creating high performance computational pipelines. The scalability, distributed control and dataflow features of the paradigm allow it to...
详细信息
The wormhole run-time reconfiguration (RTR) computing paradigm is a method for creating high performance computational pipelines. The scalability, distributed control and dataflow features of the paradigm allow it to fit neatly into the configurable computing machine (CCM) domain. To date, the field has been dominated by large bit-oriented devices whose flexibility can lead to lowered silicon utilization efficiencies. In an effort to raise this efficiency, the Colt CCM has been created based on the wormhole RTR paradigm. This paper outlines methods of implementation and performance for several common operations using these concepts. They serve as indicators of the diversity of algorithms that can be instantiated through the high-speed run-time reconfiguration that these devices make possible. Particular attention is paid to floating point multiplication. Also discussed is the topic of data dependent computation which would seem to be counter intuitive to the wormhole RTR paradigm. The paper concludes with a summary of performance of the three computations.
Movement estimation of the temporomandibular joint from sequential images of X-ray television comprises two procedures: (1) computation of the velocity field (optical flow) from two successive images; and (2) estimati...
详细信息
Movement estimation of the temporomandibular joint from sequential images of X-ray television comprises two procedures: (1) computation of the velocity field (optical flow) from two successive images; and (2) estimation of the velocity and the angular velocity (motion parameters) from data of the optical flow. The mandibular head of the X-ray image is observed merely as its blurred profile, superimposed by images of cranial bones. In order to achieve the movement tracking in such image data of inferior quality, the authors developed an accurate estimation of the optical flow using the Gabor wavelet transform, and an iterative estimation of the motion parameters. Through experiments employing an actual image-sequence of the TMJ, the authors' methodology was verified to be effective for improving the estimation accuracy of the movement, thereby the successful tracking of the mandibular head being achieved.
Measurement time is of the essence for most ground test facilities, so measurement techniques capable of measuring planes of data are much more attractive than those which measure only single points. Many techniques a...
详细信息
Measurement time is of the essence for most ground test facilities, so measurement techniques capable of measuring planes of data are much more attractive than those which measure only single points. Many techniques already exist which allow for nonintrusive measurements of pressure, temperature and density; several of which are two-dimensional by design. Holography and holographic interferometry are well established techniques; however, for the most part they provide qualitative data. A computer program for achieving quantitative two-dimensional or axis-symmetric density maps using a phase-shifted holographic interferometer is currently being developed. Several phase unwrapping and data reduction methods were investigated in an effort to produce a computer program capable of correctly analyzing a flow field. Several test holographic interferograms of a candle flame and a heat gun, made in a laboratory setting were used to validate various aspects of the software algorithms. A discussion of the algorithms investigated and the determination of which algorithm best analyzes the flowfield structures will be presented.
Vis-à-Web was designed as a distributed visualization software on the word-wide-web (WWW). Using Vis-à-Web, owners of visualization algorithms may offer their visualization tool as a service on the WWW witho...
详细信息
Vis-à-Web was designed as a distributed visualization software on the word-wide-web (WWW). Using Vis-à-Web, owners of visualization algorithms may offer their visualization tool as a service on the WWW without giving away their code. Users can access such a service as simply as ordinary WWW-pages. data may be processed on a variety of servers and the resulting visualization may be combined to a single complex scene. A prototype application using two distributed servers at the DLR in Germany and the TU Delft in the Netherlands was successfully demonstrated.
This paper examines the implementation of pipelined applications using run-time reconfiguration. Throughput and latency of pipelined applications can be significantly improved when reconfiguration is performed at the ...
详细信息
This paper examines the implementation of pipelined applications using run-time reconfiguration. Throughput and latency of pipelined applications can be significantly improved when reconfiguration is performed at the level of individual pipeline stages, as opposed to configuration of the entire FPGA. If reconfiguration and execution can be performed simultaneously, the performance of a pipelined application approaches its theoretical maximum. This paper proposes a new FPGA configuration mechanism, called striping, that supports pipeline stage reconfiguration and simultaneous configuration and execution. Additionally, the use of the pipeline stage as the atomic unit of reconfiguration introduces a design abstraction that enables the development families of upwardly-compatible FPGAs and virtual hardware design.
暂无评论