parallelism can be introduced into computers in a number of ways and at several levels within the processing operation. These include pipelining, array or vector processing, parallel processing of data and multiple pr...
详细信息
parallelism can be introduced into computers in a number of ways and at several levels within the processing operation. These include pipelining, array or vector processing, parallel processing of data and multiple processors. Software algorithms are being reformulated to exploit more fully the potential of parallel computers.
We propose a new computational method for wavefront shaping holograms that is suitable for laser slicing of SiC crystals. This method enables multi-beam parallel processing and conducts aberration correction simultane...
详细信息
We propose a new computational method for wavefront shaping holograms that is suitable for laser slicing of SiC crystals. This method enables multi-beam parallel processing and conducts aberration correction simultaneously, thereby ensuring processing quality while enhancing processing efficiency. It applies to wavefront shaping utilizing SLM. Grounded in the GSW method, while accomplishing laser beam splitting, it employs optical path compensation to correct the aberration arising from refraction when the laser beam is incident from air into the medium. The principle and algorithm of holograms for wavefront shaping are elaborated. Additionally, we analyze the effects of different design parameters, materials, and objective lens parameters on the holograms and provide a method for determining the optimal parameters. Finally, the feasibility of this method is verified through simulation and experiment. The results of slicing experiments on small area n-type SiC samples show that the damage is less than 25 mu m, the slicing time for a 6-inch ingot is 16 min and the loss is less than 50 mu m.
As a computer-integrated manufacturing system, cluster tools are widely used for semiconductor manufacturing. To tackle their scheduling problems with parallel processing chambers, existing studies assume that the par...
详细信息
As a computer-integrated manufacturing system, cluster tools are widely used for semiconductor manufacturing. To tackle their scheduling problems with parallel processing chambers, existing studies assume that the parallel processing chambers at each step are identical. However, due to machine aging, some differences among them could appear in reality. This work represents the first one to report the scheduling problems of cluster tools with non-identical parallel processing chambers. By focusing on time-constrained single and dual-armed cluster tools with non-identical parallel processing chambers, it derives the sufficient conditions under which a system is schedulable. Based on them, it develops two algorithms to obtain the feasible and optimal schedules. It also develops two linear programming models to find the feasible and optimal ones if the sufficient conditions are violated. The experimental results based on 5400 random instances show that the proposed scheduling method can be readily used to generate schedules for both time-constrained single and dual-armed cluster tools.
Meta-heuristic techniques have been popular in solving highly complex optimization problems. parallel processing is very important in signal representation and real-time analysis, as it helps lower the time required f...
详细信息
ISBN:
(数字)9798331542375
ISBN:
(纸本)9798331542382
Meta-heuristic techniques have been popular in solving highly complex optimization problems. parallel processing is very important in signal representation and real-time analysis, as it helps lower the time required for the data process so that algorithms can perform faster. The biggest challenge with the traditional parallel processing techniques is load balancing and higher communication overhead, which may need to scale up to a larger extent. Meta-heuristic techniques reduce the complexity of problems that are NP-complete or NP-hard by providing satisfactory solutions and approximate answers to difficult ones. This has helped in parallel processing for signal representation and real-time analysis. parallel processing has also been useful in using meta-heuristic algorithms like genetic algorithms, ant colony optimization, and particle swarm optimization. Meta-heuristic algorithms produce flexible, adaptive mechanisms for solving complex problems.
A binarized neural-network (BNN) accelerator macro is developed based on a processing-in-memory (PIM) architecture having the ability of eight-parallel multiply-accumulate (MAC) processing. The parallel-processing PIM...
详细信息
A binarized neural-network (BNN) accelerator macro is developed based on a processing-in-memory (PIM) architecture having the ability of eight-parallel multiply-accumulate (MAC) processing. The parallel-processing PIM macro, referred to as a PPIM macro, is designed to perform the parallel processing with no use of multiport SRAM cells and to achieve the energy minimum point (EMP) operation for inference. The proposed memory array in the PPIM macro is configured with single-port Schmitt-trigger-type cells just by adding multiple bit lines with spatial address mapping modulation, resulting in a highly area-efficient cell array. The EMP operation of the developed PPIM macro can maximize the energy efficiency. As a result, an energy efficiency higher than 100 tera-operations-per-second per Watt (TOPS/W) can be achieved at around the EMP voltage. The EMP operation is also beneficial for enhancing the processing performance [measured in units of tera-operations per second (TOPS)] of the macro. The performance of fully connected-layer (FCL) networks configured with a multiple of the PPIM macro is also demonstrated.
In H.264,computational complexity and memory access of deblocking filters are variable,dependent on video *** paper proposes a VLSI architecture of deblocking filters with adaptive dynamic power,which avoids redundant...
详细信息
In H.264,computational complexity and memory access of deblocking filters are variable,dependent on video *** paper proposes a VLSI architecture of deblocking filters with adaptive dynamic power,which avoids redundant computations and memory accesses by precluding the blocks that can be *** vertical and horizontal edges are simulta-neously processed in an advanced scan order to speed up the *** a result,dynamic power of the proposed architecture can be reduced adaptively(up to about 89%) for different videos,and the off-chip memory access is improved when compared to previous ***,the processing capability of the proposed architecture is in particular appropriate for real-time deblocking of high-definition television(HDTV,1920×1080 pixels/frame,60 frames/s video signals) video operation at 62 *** the proposed architecture,power can be reduced by up to about 89% and processing time by from 25% to 81% compared with previous designs.
It is important to get a steady-state operation of a power electronic circuit for its design. The shooting method, which is one of the steady-state analysis methods, solves a boundary value condition which equates ini...
详细信息
It is important to get a steady-state operation of a power electronic circuit for its design. The shooting method, which is one of the steady-state analysis methods, solves a boundary value condition which equates initial values to final values after one-period transient. This paper proposes how to process the analysis in parallel by using a parallel Virtual Machine (PVM3) system. In particular, the multiple shooting method (MSM) which divides the shooting time into subsections is suitable because it can parallel numerical processes for the divided shooting times by using multiple CPU units independently. This parallel method is applied to converter examples which are processes by PVM3. The computed results are investigated for their convergence and CPU times to check their parallel effects. According to the MSM algorithm, CPU times are reduced almost to one-fifth by using five computer units from a PVM3 system example. (C) 2001 Scripta Technica.
The problem of co-ordinating the setting of directional overcurrent relays in an interconnected power system is considered. The power system is decomposed into subsystems to give a constraint matrix of a diagonal stru...
详细信息
The problem of co-ordinating the setting of directional overcurrent relays in an interconnected power system is considered. The power system is decomposed into subsystems to give a constraint matrix of a diagonal structure with linking variables. Each subsystem is solved using the sparse dual revised simplex algorithm of linear programming. Benders algorithm for linear programming may be used to co-ordinate the solution from each subsystem and the master system. The authors propose a new parallel processing algorithm for this co-ordination procedure which depends on the interaction balance principle of linear programming. This algorithm has been applied to various size systems and the results compared with those obtained using a conventional linear programming algorithm and Benders algorithm. It is shown that the proposed algorithm is more computationally efficient than previous methods.
A classification of the objectives and types of parallel processing gives the context of the prinicipal subjects of this paper. A compiler and machine organization which are capable of detecting opportunities for para...
详细信息
A classification of the objectives and types of parallel processing gives the context of the prinicipal subjects of this paper. A compiler and machine organization which are capable of detecting opportunities for parallel processing in algebraic expressions and executing these by multiple processing units operating concurrently are described. The compiler algorithm, based on a tree structure of algebraic expressions, translates the expression into an object program containing concurrency groups of instructions. Within each such group. all of the instructions can be executed at the same time. The machine organization (described in a schematic way) includes a simple control for executing the object program by allocating instructions to processors within a group and interlocking advance to the next group until execution of the current group is completed.
We apply optical parallel processing to operations for multiplication modulo, which is one of the key components of a factorization algorithm. With this method, optical phase modulation provides the results of modulo ...
详细信息
We apply optical parallel processing to operations for multiplication modulo, which is one of the key components of a factorization algorithm. With this method, optical phase modulation provides the results of modulo operations. We construct a prototype system based on a Michelson interferometer with a photodetector array. Mirrors are set at both object and reference arms to generate interference fringes. A mirror in the object arm is tilted slightly, whereas the reference arm is set perpendicular to the optical axis. The tilt angle is determined by parameters for the target modulo operations. The presented system can achieve massive data processing in parallel with only simple implementation. We present our experimental results to verify the usefulness of our method. (C) 2008 Optical Society of America.
暂无评论