Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal cor...
详细信息
Digital convergence is precipitating the addition of soft real-time applications to mainstream desktop and server operating environments. Most traditional debuggers for mainstream systems lack a notion of temporal correctness, making them unsuitable for real-time system design and analysis. We propose leveraging complete system simulation to build a temporal debugger which is capable of analyzing mixed real-world workloads. Traditional real-time system debuggers based on simulation utilize slow, but accurate, simulators. Complete system simulators accept an approximate model of time in exchange for higher performance. the higher performance allows these simulators to analyze high-end commercial operating systems and applications. We describe a temporal debugger design based on complete system simulation and report on some early experiences in analyzing a simple workload. the tool offers a non-intrusive, predictable environment for debugging complex workloads with partial real-time constraints. the simulator foundation allows for interactive debugging of time-critical sequences while preserving a model of execution time flow.
In this paper, a new integration scheme with multilayer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition problem. the idea of meta-synthesis is emphasized in this scheme, human ...
详细信息
ISBN:
(纸本)0769507506
In this paper, a new integration scheme with multilayer perceptron (MLP) networks is proposed to solve handwritten Chinese character recognition problem. the idea of meta-synthesis is emphasized in this scheme, human intelligence and computer capabilities are combined together through a procedure of two-step supervised learning. Compared with previous integration schemes, this scheme has much better performance and provides a promising way of applying MLP to large vocabulary classification.
In this paper we outline a fully parallel and locally connected computation model for the spatio-temporal segmentation of motion events in video sequences. We are searching for a new algorithm, which can be easily imp...
详细信息
ISBN:
(纸本)0769507506
In this paper we outline a fully parallel and locally connected computation model for the spatio-temporal segmentation of motion events in video sequences. We are searching for a new algorithm, which can be easily implemented in one-pixel/one-processor cell-array VLSI architectures at high-speed. Our proposed algorithm starts from an oversegmented image, then the segments are merged by applying the information coming from the spatial and temporal auxiliary data: motion fields and motion history, which is calculated from consecutive image frames. this grouping process is defined through a similarity measure of neighboring segments, which is based on the values of intensity, speed and the time-depth of motion history. As for checking the merging process there is a feedback implemented, by that we can accept or refuse the cancellation of a segment-border. Our parallel approach is independent of the number of segments and objects, since instead of graph representation and serial processing of these components, image features are defined on the pixel-level. We use simple functions, easily realizable in VLSI, like arithmetic and logical operators, local memory transfers and convolution.
It is shown that, under a local clustering condition, a set of points of a given class, embedded in binary space by a set of randomly parameterized surfaces, is linearly separable from other classes, with arbitrarily ...
详细信息
ISBN:
(纸本)0769507506
It is shown that, under a local clustering condition, a set of points of a given class, embedded in binary space by a set of randomly parameterized surfaces, is linearly separable from other classes, with arbitrarily high probability. We call such a data set a local relative cluster. the size of the embedding set is linear in the input dimension and inversely proportional to the squared local clustering degree. A simple parameterization by embedding hyperplanes leads to the separation of multi-cluster data by a network with two internal layers. the computational complexity is linear in the number of relative clusters in the data. this represents a considerable reduction of the learning problem with respect to known techniques, resolving a long-standing question on the complexity of random embedding. Numerical tests show that the proposed method performs as well as state-of the-art methods, in a small fraction of the time.
All-to-all communication is one of the most dense communication patterns and occurs in many important applications in parallel computing. In this paper, we present a new all-to-all broadcast algorithm in all-port mesh...
详细信息
ISBN:
(纸本)0769500048
All-to-all communication is one of the most dense communication patterns and occurs in many important applications in parallel computing. In this paper, we present a new all-to-all broadcast algorithm in all-port mesh and torus networks. Unlike existing all-to-all broadcast algorithms, the new algorithm takes advantage of overlapping of message switching time and transmission time, and achieves optimal transmission time for all-to-all broadcast. In addition, in most cases, the total communication delay is close to the lower bound of all-to-ail broadcast within a small constant range. Finally, the algorithm is conceptually simple, and symmetrical for every message and every node so that it can be easily implemented in hardware and achieves the optimum in practice.
Taking advantage of the Grid for high-performance scientific computing requires a software framework layered above the Grid that supports a new style of programming, component-based programming. the component architec...
详细信息
Taking advantage of the Grid for high-performance scientific computing requires a software framework layered above the Grid that supports a new style of programming, component-based programming. the component architecture toolkit (CAT) facilitates such programming by providing a conceptually simple `port-based' component model together with a suite of developer tools for incorporating existing scientific codes into the framework, as well as a set of end user tools for locating, composing, building and running distributed component application over the Grid.
In this study, the impact of a variety of features provided by emerging system-area networks on the implementation of Software Distributed Shared Memory (SDSM), and on the Cashmere system in particular, is evaluated. ...
详细信息
In this study, the impact of a variety of features provided by emerging system-area networks on the implementation of Software Distributed Shared Memory (SDSM), and on the Cashmere system in particular, is evaluated. the results suggest that for systems of modest size, low latency is much more important for SDSM performancethan are remote writes, broadcast, or total ordering. In addition, results on an emulated 32-node system indicate that broadcast based on remote writes of widely-shared data may improve performance by up to 51% for some applications.
Seamless access to different systems of different vendors at different sites is an important prerequisite to effective and efficient use of distributed resources. Learning about new systems, new software, and new inte...
详细信息
Seamless access to different systems of different vendors at different sites is an important prerequisite to effective and efficient use of distributed resources. Learning about new systems, new software, and new interfaces is a time-consuming task for users who actually want to run their applications. UNICORE is a project to overcome these difficulties by providing a uniform interface for job preparation and control which gives seamless and secure access to supercomputer resources. It is an ambitious project delivering a production ready prototype within two years. the presentation will focus on the UNICORE architecture, especially the protocol and the underlying security mechanisms.
this paper describes work in progress to develop a standard for interoperability among high-performance scientific components. this research stems from growing recognition that the scientific community needs to better...
详细信息
this paper describes work in progress to develop a standard for interoperability among high-performance scientific components. this research stems from growing recognition that the scientific community needs to better manage the complexity of multidisciplinary simulations and better address scalable performance issues on parallel and distributed architectures. Driving forces are the need for fast connections among components that perform numerically intensive work and for parallel collective interactions among components that use multiple processes or threads. this paper focuses on the areas we believe are most crucial in this context, namely, an interface definition language that supports scientific abstractions for specifying component interfaces and a ports connection model for specifying component interactions.
RSD (Resource and Service Description) is a software architecture for specifying, registering and accessing resources and services in complex heterogeneous computing environments. It has three major components: a grap...
详细信息
RSD (Resource and Service Description) is a software architecture for specifying, registering and accessing resources and services in complex heterogeneous computing environments. It has three major components: a graphical resource editor, a compiler system for translating textual specifications, and an application programming interface. In this paper we focus on the language definition and the compiler system that generates portable data objects for registering and accessing distributed resources. We also present the mechanisms used by RSD to maintain information on the status of dynamic resources such as CPU load or network bandwidth. RSD is a purely descriptive tool that provides up-to-date information on system resources. Other tools, like resource brokers, mappers or job migrators, build on it.
暂无评论