检索结果-内蒙古大学图书馆

An adaptive mesh refinement method for indirectly solving optimal control problems

NUMERICAL ALGORITHMS 2022年第1期91卷 193-225页

作者： Yang, Chaoyi Fabien, Brian C. Univ Washington Mech Engn Seattle WA 98195 USA

The indirect solution of optimal control problems (OCPs) with inequality constraints and parameters is obtained by solving the two-point boundary value problem (BVP) involving index-1 differential-algebraic equations (DAEs) associated with its first-order optimality conditions. This paper introduces an adaptive mesh refinement method based on a collocation method for solving the index-1 BVP-DAEs. The paper first derives a method to estimate the relative error between the numerical solution and the exact solution. The relative error estimate is then used to guide the mesh refinement process. The mesh size is increased when the estimated error within a mesh interval is beyond the numerical tolerance by either increasing the order of the approximating polynomial or dividing the interval into multiple subintervals. In the mesh interval where the error tolerance has been met, the mesh size is reduced by either decreasing the degree of the approximating polynomial or merging adjacent mesh intervals. An efficient parallel implementation of the method is implemented using Python and CUDA. The paper presents three examples which show that the approach is more computationally efficient and robust when compared with fixed-order methods.

关键词： Optimal control Indirect method Collocation method Adaptive mesh refinement gpu parallel programming

来源：评论

学校读者我要写书评

暂无评论

Efficient Signal Processing Acceleration using OpenCL-based FPGA-gpu Hybrid Cooperation for Reconfigurable ECG Diagnosis 18

Efficient Signal Processing Acceleration using OpenCL-based ...

引用

18th International SoC Design Conference (ISOCC)

作者： Lee, Dongkyu Lee, Seungmin Park, Daejin Kyungpook Natl Univ Sch Elect & Elect Engn Daegu South Korea

ISBN: (纸本)9781665401746

With the development of Internet of things (IoT), where humans and machines interact, healthcare that measures and diagnoses bio-signals is advancing. The electrocardiogram (ECG) signal has different normal beat characteristics for each person, and it requires long-term data for detecting abnormalities. In this paper, we increased the detection rate of the normal signals by learning the reference signal, which is the standard for diagnosing ECG signals, as individual-specific signals from existing fixed data. In addition, we proposed an OpenCL-based FPGA-gpu hybrid cooperative platform to efficiently diagnose long-term, large-capacity ECG signals.

关键词： FPGA acceleration gpu parallel programming electrocardiogram

来源：评论

学校读者我要写书评

暂无评论

gpu-Based Blind Watermarking Scheme for 3D Multiresolution Meshes Using Unlifted Butterfly Wavelet Transformation

引用

CIRCUITS SYSTEMS AND SIGNAL PROCESSING 2020年第3期39卷 1533-1560页

作者： Hachicha, Soumaya Sayahi, Ikbel Elkefi, Akram Amar, Chokri Ben Zaied, Mourad Univ Sfax Natl Engn Sch Sfax ENIS Sfax Tunisia Private Natl Engn Sch Monastir ESPRIMS Monastir 5060 Tunisia Univ Gabes Natl Engn Sch Gabes ENIG Gabes Tunisia

3D mesh watermarking in the transform domain requires significant computational complexity. This is due mainly to the incessant use of high-resolution meshes which require more and more resources. Normally, this is an expensive work that harms the commercial chain of low computational cost applications requiring content protection or enrichment. To tackle this issue, we proposed herein a high-capacity and blind watermarking scheme for 3D multiresolution semi-regular meshes while maintaining a trade-off between efficiency and robustness. For this purpose, our solution uses an unlifted butterfly wavelet transform technique that explores the computing power of the Graphic Processing Units (gpu) architecture and the Open Computing Language (OpenCL) framework. The robustness was optimized by generating a turbo-encoded watermark. This latter is embedded in the wavelet coefficients after their spherical parametrization at various levels of details using the least significant bit technique. The method allows a better imperceptibility of the watermark and invariability to affine transformation. It also shows comparative robustness against most of the geometric attacks including additive noise, quantization, smoothing and compression. Moreover, the comparison with other serial watermarking schemes proves the effectiveness in terms of computational complexity of our method. OpenCL embedding implementation offers 3-9 x speedups with a low-power gpu architecture for different mesh sizes. In case of extraction procedure, the speedups obtained vary between 2 x and 12 x.

关键词： 3D watermarking Multiresolution mesh LSB OpenCL Unlifted BWT gpu parallel programming

来源：评论

学校读者我要写书评

暂无评论

Lossless Data Compression for Improving the Performance of a gpu-Based Beamformer

引用

ULTRASONIC IMAGING 2015年第2期37卷 135-151页

作者： Lok, U-Wai Fan, Gang-Wei Li, Pai-Chi Natl Taiwan Univ Grad Inst Biomed Elect & Bioinformat Taipei 10764 Taiwan Natl Taiwan Univ Dept Elect Engn Taipei 10764 Taiwan

The powerful parallel computation ability of a graphics processing unit (gpu) makes it feasible to perform dynamic receive beamforming However, a real time gpu-based beamformer requires high data rate to transfer radio-frequency (RF) data from hardware to software memory, as well as from central processing unit (CPU) to gpu memory. There are data compression methods (e.g. Joint Photographic Experts Group (JPEG)) available for the hardware front end to reduce data size, alleviating the data transfer requirement of the hardware interface. Nevertheless, the required decoding time may even be larger than the transmission time of its original data, in turn degrading the overall performance of the gpu-based beamformer. This article proposes and implements a lossless compression-decompression algorithm, which enables in parallel compression and decompression of data. By this means, the data transfer requirement of hardware interface and the transmission time of CPU to gpu data transfers are reduced, without sacrificing image quality. In simulation results, the compression ratio reached around 1.7. The encoder design of our lossless compression approach requires low hardware resources and reasonable latency in a field programmable gate array. In addition, the transmission time of transferring data from CPU to gpu with the parallel decoding process improved by threefold, as compared with transferring original uncompressed data. These results show that our proposed lossless compression plus parallel decoder approach not only mitigate the transmission bandwidth requirement to transfer data from hardware front end to software system but also reduce the transmission time for CPU to gpu data transfer.

关键词： beamformer compression parallel decoder gpu parallel programming

来源：评论

学校读者我要写书评

暂无评论

Gyro-aided feature tracking for a moving camera: fusion, auto-calibration and gpu implementation

引用

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH 2011年第14期30卷 1755-1774页

作者： Hwangbo, Myung Kim, Jun-Sik Kanade, Takeo Carnegie Mellon Univ Inst Robot Pittsburgh PA 15213 USA

When a camera rotates rapidly or shakes severely, a conventional KLT (Kanade-Lucas-Tomasi) feature tracker becomes vulnerable to large inter-image appearance changes. Tracking fails in the KLT optimization step, mainly due to an inadequate initial condition equal to final image warping in the previous frame. In this paper, we present a gyro-aided feature tracking method that remains robust under fast camera-ego rotation conditions. The knowledge of the camera's inter-frame rotation, obtained from gyroscopes, provides an improved initial warping condition, which is more likely within the convergence region of the original KLT. Moreover, the use of an eight-degree-of-freedom affine photometric warping model enables the KLT to cope with camera rolling and illumination change in an outdoor setting. For automatic incorporation of sensor measurements, we also propose a novel camera/gyro auto-calibration method which can be applied in an in-situ or on-the-fly fashion. Only a set of feature tracks of natural landmarks is needed in order to simultaneously recover intrinsic and extrinsic parameters for both sensors. We provide a simulation evaluation for our auto-calibration method and demonstrate enhanced tracking performance for real scenes with aid from low-cost microelectromechanical system gyroscopes. To alleviate the heavy computational burden required for high-order warping, our publicly available gpu implementation is discussed for tracker parallelization.

关键词： Auto-calibration gpu parallel programming KLT feature tracking visual/inertial sensor fusion

来源：评论

学校读者我要写书评

暂无评论

A gpu-based hyperbolic SVD algorithm

引用

BIT NUMERICAL MATHEMATICS 2011年第4期51卷 1009-1030页

作者： Novakovic, Vedran Singer, Sanja Univ Zagreb Fac Mech Engn & Naval Architecture Zagreb 10000 Croatia

A one-sided Jacobi hyperbolic singular value decomposition (HSVD) algorithm, using a massively parallel graphics processing unit (gpu), is developed. The algorithm also serves as the final stage of solving a symmetric indefinite eigenvalue problem. Numerical testing demonstrates the gains in speed and accuracy over sequential and MPI-parallelized variants of similar Jacobi-type HSVD algorithms. Finally, possibilities of hybrid CPU-gpu parallelism are discussed.

关键词： One-sided Jacobi algorithm Hyperbolic singular value decomposition Symmetric indefinite eigenvalue problem gpu parallel programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：