Many dynamic reconfigurable processors are evolving rapidly that offer fast and run-time reconfiguration. To efficiently utilize run-time reconfiguration, we must consider additional memory access cycles due to reconf...
详细信息
Many dynamic reconfigurable processors are evolving rapidly that offer fast and run-time reconfiguration. To efficiently utilize run-time reconfiguration, we must consider additional memory access cycles due to reconfiguration. In general, dynamic reconfigurable processors read input data from external memory after reconfiguration and write output data to external memory before the next reconfiguration. To realize highperformance, we must schedule tasks considering the total execution cycles that include reconfiguration time and memory access overhead. In this paper, we propose a parameterized reconfigurable processor model (PRP-model) and a task partitioning optimization algorithm for the architecture exploration of multicontext dynamic reconfigurable processors. The proposed task partitioning algorithm corresponds to various dynamic reconfigurable architectures by changing parameters and can realize the evaluation of a lot of reconfigurable architectures. Experimental results showed that the proposed algorithm can quickly find near optimal solutions. Using the proposed algorithm, designers can easily evaluate the performance of various dynamic reconfigurable architectures for specific applications
3D surface matching is a fundamental issue in computer vision with many applications such as shape registration, 3D object recognition and classification. However, surface matching with noise, occlusion and clutter is...
详细信息
3D surface matching is a fundamental issue in computer vision with many applications such as shape registration, 3D object recognition and classification. However, surface matching with noise, occlusion and clutter is a challenging problem. In this paper, we analyze a family of conformal geometric maps including harmonic maps, conformal maps and least squares conformal maps with regards to 3D surface matching. As a result, we propose a novel and computationally efficient surface matching framework that uses least squares conformal maps. According to conformal geometry theory, each 3D surface with disk topology can be mapped to a 2D domain through a global optimization and the resulting map is a diffeomorphism, i.e., one-to-one and onto. This allows us to simplify the 3D surface-matching problem to a 2D image-matching problem, by comparing the resulting 2D conformal geometric maps, which are stable, insensitive to resolution changes and robust to occlusion and noise. Therefore, highly accurate and efficient 3D surface matching algorithms can be achieved by using conformal geometric maps. Finally, the performance of conformal geometric maps is evaluated and analyzed comprehensively in 3D surface matching with occlusion, noise and resolution variation. We also provide a series of experiments on real 3D face data that achieve high recognition rates.
We introduce some methods for constrained nonlinear programming that are widely used in practice and that are known under the names SQP for sequential quadratic programming and SCP for sequential convex programming. I...
详细信息
ISBN:
(纸本)1402077602
We introduce some methods for constrained nonlinear programming that are widely used in practice and that are known under the names SQP for sequential quadratic programming and SCP for sequential convex programming. In both cases, convex subproblems are formulated, in the first case a quadratic programming problem, in the second case a separable nonlinear program in inverse variables. The methods are outlined in a uniform way and the results of some comparative performance tests are listed. We especially show the suitability of sequential convex programming methods to solve some classes of very large scale nonlinear programs, where implicitly defined systems of equations seem to support the usage of inverse approximations. The areas of interest are structural mechanical optimization, i.e., topology optimization, and optimal control of partial differential equations after a full discretization. In addition, a few industrial applications and case studies are shown to illustrate practical situations under which the codes implemented by the authors are in use.
Distributed Genetic algorithms (DGAs) designed for the Internet have to take its high communication cost into consideration. For island model GAs, the migration topology has a major impact on DGA performance. This pap...
详细信息
ISBN:
(纸本)1595930108
Distributed Genetic algorithms (DGAs) designed for the Internet have to take its high communication cost into consideration. For island model GAs, the migration topology has a major impact on DGA performance. This paper describes and evaluates an adaptive migration topology optimizer that keeps the communication load low while maintaining high solution quality. Experiments on benchmark problems show that the optimized topology outperforms static or random topologies of the same degree of connectivity. The applicability of the method on real-world problems is demonstrated on a hard optimization problem in VLSI design.
We have implemented two hardware accelerated Thin Plate Spline (TPS) warping algorithms. The first algorithm is a hardware-software approach (HW-TPS) that uses OpenGL Vertex Shaders to perform a grid warp. The second ...
详细信息
ISBN:
(纸本)0819457183
We have implemented two hardware accelerated Thin Plate Spline (TPS) warping algorithms. The first algorithm is a hardware-software approach (HW-TPS) that uses OpenGL Vertex Shaders to perform a grid warp. The second is a Graphics Processor based approach (GPU-TPS) that uses the OpenGL Shading Language to perform all warping calculations on the GPU. Comparison with a software TPS algorithm was used to gauge the speed and quality of both hardware algorithms. Quality was analyzed visually and using the Sum of Absolute Difference (SAD) similarity metric. Warping was performed using 92 user-defined displacement vectors for 512x512x173 serial lung CT studies, matching normal-breathing and deep-inspiration scans. On a Xeon 2.2 Ghz machine with an ATI Radeon 9800XT GPU the GPUTPS required 26.1 seconds to perform a per-voxel warp compared to 148.2 seconds for the software algorithm. The HWTPS needed 1.63 seconds to warp the same study while the GPU-TPS required 1.94 seconds and the software grid transform required 22.8 seconds. The SAD values calculated between the outputs of each algorithm and the target CT volume were 15.2%, 15.4% and 15.5% for the HW-TPS, GPU-TPS and both softwarealgorithms respectively. The computing power of ubiquitous 3D graphics cards can be exploited in medical image processing to provide order of magnitude acceleration of nonlinear warping algorithms without sacrificing output quality.
Reliable measurements and parameter estimates are required in order to properly apply Model Based Control in industrial facilities. Powerful algorithms have been described in the literature for nonlinear Dynamic Data ...
详细信息
ISBN:
(纸本)9781629934693
Reliable measurements and parameter estimates are required in order to properly apply Model Based Control in industrial facilities. Powerful algorithms have been described in the literature for nonlinear Dynamic Data Reconciliation (NDDR) of chemical processes based on nonlinear programming techniques. In some instances, these have been shown to be more reliable than the commonly used Extended Kalman Filtering (EKF) approach. Often, efficient NDDR algorithms require the provision of analytical derivatives and are difficult to apply, requiring a high degree of coding. Here we present and evaluate, using a simple chemical reactor model, an easy to use NDDR methodology applying three general purpose nonlinear programming codes within Matlab environment. The NDDR problem is tackled using the simultaneous optimization solution approach, which has been shown to be more efficient than the sequential approach. The performance of the NDDR algorithm has been compared with that of a second order nonlinear filter and practical guidelines to simplify the application of these algorithms in real time are provided.
N-version Programming (NVP) is a programming approach to fault tolerant software systems employs functionally equivalent, yet independently developed software components. Generally, an optimization model for NVP is us...
详细信息
In this work we describe three sequential algorithms and their parallel counterparts for solving nonlinear systems, when the Jacobian matrix is symmetric and positive definite. This case appears frequently in unconstr...
详细信息
Reconfigurable computing has already confirmed a significant potential for accelerating certain computing tasks. However, the most successful applications relied on user expertise to design a specific architecture imp...
详细信息
ISBN:
(纸本)0819458325
Reconfigurable computing has already confirmed a significant potential for accelerating certain computing tasks. However, the most successful applications relied on user expertise to design a specific architecture implemented by the hardware structures of the reconfigurable computing device. Hence, one of the most challenging issues is to map, efficiently and automatically, computations (described in software programming languages) to reconfigurable computing devices. This paper presents CHIADO, a research project aiming a compiler framework to map efficiently software programs to reconfigurable computing platforms, especially the ones based on FPGA (Field-Programmable Gate Array) devices. The framework is also intended to support research of new optimization techniques. The project, based on our previous work on compiling Java bytecodes to FPGAs, focuses on high-performance solutions, schemes to estimate the impact of some transformations supported by the compiler (partial/full loop unrolling), and schemes to take advantage of dynamic reconfiguration (e.g., temporal partitioning). This paper gives an overview about the CHIADO project, shows the framework, and enumerates the main project goals.
An improved orthogonal multi-objective evolutionary algorithm (OMOEA), called OMOEA-II, is proposed in this paper. Two new crossovers used in OMOEA-II are orthogonal crossover and linear crossover. By using these two ...
详细信息
ISBN:
(纸本)3540249834
An improved orthogonal multi-objective evolutionary algorithm (OMOEA), called OMOEA-II, is proposed in this paper. Two new crossovers used in OMOEA-II are orthogonal crossover and linear crossover. By using these two crossover operators, only small orthogonal array rather than large orthogonal array is needed for exploiting optimal in the global space. Such reduction in orthogonal array can avoid exponential creation of solutions of OMOEA and improve the performance in robusticity without degrading precision and distribution of solutions. Experimental results show that OMOEA-II can solve problems with high dimensions and large number of local Pareto-optimal fronts better than some existing algorithms recently reported in the literatures.
暂无评论