检索结果-内蒙古大学图书馆

36th conference on Neural Information processing Systems (NeurIPS)

作者： Patel, Kumar Kshitij Wang, Lingxiao Woodworth, Blake Bullins, Brian Srebro, Nati TTIC Chicago IL 60637 USA INRIA SIERRA Paris France Purdue Univ W Lafayette IN 47907 USA

ISBN: (纸本)9781713871088

We study the problem of distributed stochastic non-convex optimization with intermittent communication. We consider the full participation setting where M machines work in parallel over R communication rounds and the partial participation setting where M machines are sampled independently every round from some meta-distribution over machines. We propose and analyze a new algorithm that improves existing methods by requiring fewer and lighter variance reduction operations. We also present lower bounds, showing our algorithm is either optimal or almost optimal in most settings. Numerical experiments demonstrate the superior performance of our algorithm.

关键词： Convex optimization

来源：评论

学校读者我要写书评

暂无评论

Research and application of image detection technology of pear tree pests and pests 24

Research and application of image detection technology of pe...

引用

6th International conference on Telecommunications and Communication Engineering

作者： Feng, Yuanyuan Xu, Zhihan Yang, Xiaoyu Chengdu Univ Technol Coll Mech & Elect Engn Chengdu Peoples R China

ISBN: (纸本)9798400709630

This paper aims to achieve precise identification of diseases and pests affecting pear trees through the integration of YOLOv5, Jetson Nano, big data, and deep learning techniques. The objective is to facilitate timely detection of these issues, thereby enabling early prevention and control measures for pear tree health. The study employs YOLOv5 as the primary model, which is implemented on the embedded device jetson Nano. By leveraging the GPU parallel processing capabilities of jetson Nano's deep learning framework, this approach enhances picture analysis and detection speed while improving quality and efficiency. [1] Furthermore, it integrates big data with deep learning methodologies to bolster the accuracy of disease detection and identification. Utilizing these advanced technologies allows for accurate recognition of diseases and pests associated with pear trees through image analysis. This significantly reduces both the complexity involved in detecting such conditions and lowers operational thresholds for practitioners in the field. In comparison to traditional detection methods, YOLOvI technology exhibits no stringent requirements regarding environmental conditions or backgrounds;thus, it remains less susceptible to variations caused by weather factors making it a superior choice for pest and disease detection in agricultural settings.

关键词： image analysis Big data Embedded equipment Plant diseases and insect pests

来源：评论

学校读者我要写书评

暂无评论

Collaborative Segmentation Model for Colonoscopy Ulcers based on Fuzzy Labeling 9

Collaborative Segmentation Model for Colonoscopy Ulcers base...

引用

9th IEEE International conference on Advanced Robotics and Mechatronics (ICARM)

作者： Lin, Yanning Chen, Jie Ding, Ziqian Chen, Linxu Li, Le Zhu, Zhongsheng Wang, Zhaoxia Li, Jianqiang Shenzhen Univ Natl Engn Lab Big Data Syst Comp Technol Shenzhen 518000 Peoples R China Shenzhen Childrens Hosp Gastroenterol Shenzhen 518000 Peoples R China Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen 518000 Peoples R China Shenzhen Childrens Hosp Shenzhen 518000 Peoples R China

ISBN: (纸本)9798350385731;9798350385724

Inflammatory Bowel Disease (IBD) is a global chronic intestinal inflammatory disease, and its incidence rate increases year by year with the progress of economic globalization. Currently, the diagnosis of IBD in children mainly relies on endoscopic examination, but scoring endoscopic images is a challenging issue, especially in distinguishing different types of ulcers. To address this issue, this article designs a mobile application to accelerate data annotation processing and may provide reference for other unlabeled datasets. In the context of image segmentation, blurring labels has become an important issue. Deep learning methods are widely used in medical image segmentation, but their accuracy depends on high-quality annotated data. However, there are low-quality noise areas in the annotated data, and obtaining accurate and high-quality annotations becomes more time-consuming with limited annotation budgets. This article proposes a collaborative training framework to improve learning of noisy pixels. This framework determines the label confidence of an image by calculating the similarity between image pixels and surrounding pixels. Then, two parallel deep networks were constructed for semantic prediction, which aimed to guide each other on pixels that may have noise. By applying consistency in dual network prediction, the semantic information of uncertain pixels is corrected as much as possible. Experimental results have shown that this framework is slightly superior to models trained with pixel level precise labels, thus more effectively utilizing existing annotated data in the case of fuzzy labels.

关键词： image segmentation fuzzy label data annotation

来源：评论

学校读者我要写书评

暂无评论

GPU Accelerated Modelling and Real-time Rendering of Fluid Motion 38

GPU Accelerated Modelling and Real-time Rendering of Fluid M...

引用

38th International conference on image and Vision Computing New Zealand, IVCNZ 2023

作者： Valentine, William Mukundan, Ramakrishnan University of Canterbury Dept. Computer Science and Software Engineering Christchurch New Zealand

ISBN: (纸本)9798350370515

This paper proposes a fluid rendering pipeline that uses OpenGL-4 shaders to employ the parallel processing capabilities of the GPU. The fluid's surface mesh is produced using tessellation shader stages where the input patches are assigned tessellation levels based on the fluid heightmap's curvature. The curvature is stored using a texture buffer object which allows access by shaders, thus allowing the tessellation calculations to be carried out in parallel. Use of this adaptive tessellation method increases both the simulation's framerate as well as its capacity to handle a greater number of primitives. Furthermore, it more optimally distributes the mesh's vertices to effectively increase the level of detail without using more primitives. Polygon culling using the geometry shader further optimises the number of primitives used to define the fluid surface. The Phong-Blinn model is used for surface lighting. We propose two GPU-based fluid surface flow visualisation methods. Texture buffer objects can be used to store and update a surface texture. Alternatively, particle positions are updated each frame using the geometry shader and stored in a buffer object using transform feedback. These flow visualisation techniques are particularly effective for communicating the swirling motion of vortices. © 2023 IEEE.

关键词： Graphics processing unit

来源：评论

学校读者我要写书评

暂无评论

Residual Hybrid Attention Enhanced Video Super-Resolution with Cross Convolution 7th

Residual Hybrid Attention Enhanced Video Super-Resolution wi...

引用

7th Chinese conference on Pattern Recognition and Computer Vision

作者： Yuan, Shiqian Li, Boyue Zhao, Xin Lan, Rushi Luo, Xiaonan Guilin Univ Elect Technol Guangxi Key Lab Image & Graph Intelligent Proc Guilin 541004 Peoples R China Guilin Univ Elect Technol Int Joint Res Lab Spatiotemporal Informat & Intel Guilin 541004 Peoples R China

ISBN: (纸本)9789819785070;9789819785087

In video super-resolution reconstruction, traditional methods often fall short in capturing details, particularly in edges and occluded areas, which affects the realism and clarity of the images. To address this issue, we propose a novel model-the Residual Hybrid Attention-Enhanced Video Super-Resolution Model, augmented by Cross Convolution techniques, denoted as RCVSR. The model ingeniously integrates a residual hybrid attention mechanism, refining the learning of global and local features through parallel channel attention and self-attention mechanisms. Simultaneously, our model introduces overlapping cross-attention blocks to enhance dynamic interactions between frames, thereby boosting the model's performance. Furthermore, the design of the cross-convolution blocks allows for parallel processing of vertical and horizontal gradient information in images, effectively extracting edge details. In multiple benchmark tests, the RCVSR model demonstrated its excellent reconstruction effects and outstanding performance.

关键词： Video super-resolution Hybrid attention Cross convolution

来源：评论

学校读者我要写书评

暂无评论

FERI: Feature Enhancement and Relational Interaction for image-text Matching

FERI: Feature Enhancement and Relational Interaction for Ima...

引用

International conference on parallel and distributed Systems (ICPADS)

作者： Yu Zhang Jianqiang Zhang Gongpeng Song Qin Lu Shuo Zhao Key Laboratory of Computing Power Network and Information Security Ministry of Education Shandong Computer Science Center Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Engineering Research Center of Big Data Applied Technology Faculty of Computer Science and Technology Qilu University of Technology (Shandong Academy of Sciences) Jinan China Shandong Provincial Key Laboratory of Computer Networks Shandong Fundamental Research Center for Computer Science Jinan China Shandong Branch of China Mobile Communication Group Design Institute Co. Jinan China

ISBN: (数字)9798331515966

ISBN: (纸本)9798331515973

image-text matching is an important problem at the intersection of computer vision and natural language processing. It aims to establish the semantic link between image and text to achieve high-quality semantic alignment between the two modalities. However, the existing methods have the problem that the meaning expressed in the image or the complex narrative in the text cannot be fully understood due to insufficient feature extraction. Moreover, due to the essential modal differences between images and texts, how to effectively and accurately align the semantic contents in images and texts has become the key of research. In order to solve the above problems, this paper proposes a method based on feature enhancement and relationship interaction. When processing images, the proposed method fuses labeled features, region features and location features to represent images. When processing text, a combination of Bi-GRU and self-attention mechanism is used to represent the text. In order to further align the semantic content in images and texts accurately, this paper improves two relational interaction mechanisms by identifying connection relationships and learning association relationships. Thus, the relation enhanced embedding is obtained. Finally, it calculated the similarity of the enhanced embedding to judge the matching degree of the image and text. Extensive experiments on the public datasets Flickr30K and MSCOCO demonstrate the effectiveness of our method.

关键词： Computer vision Fuses image matching Semantics Feature extraction Natural language processing

来源：评论

学校读者我要写书评

暂无评论

A Heterogeneous KBA parallel Algorithm for the Cartesian Discrete Ordinates for Multizone Heterogeneous System 8

A Heterogeneous KBA Parallel Algorithm for the Cartesian Dis...

引用

8th International conference on Computer and Communication Systems, ICCCS 2023

作者： Li, Runhua Liu, Jie National University of Defense Technology Science and Technology on Parallel and Distributed Processing Laboratory Changsha China

ISBN: (纸本)9781665456128

Innovations in powerful high-performance computing (HPC) architecture are enabling high-fidelity whole-core neutron transport simulations at reasonable time. Especially, the currently fashionable heterogeneous architectures make the cost of such simulations at very low level. Neutron distribution of a reactor core is governed by the Boltzmann neutron transport equation (BTE), first viable solutions of which need tremendous computer resources. Among of the high-fidelity numerical methods, the discrete ordinates method (SN) is becoming popular in the reaction design community by taking a good balance between computational cost and accuracy. Recently, MT-3000, which is a multizone heterogeneous architecture with a peak double precision performance of 11.6 TFLOPS, is proposed. In this work, the BTE is solved by the SN with heterogenous Koch-Baker-Alcouffe (KBA) parallel algorithms based on the MT-3000 architecture. A communication mechanism has been established to efficiently transmit data among the acceleration cores and the CPU cores. The kernel computation procedure is largely accelerated by the vectorization and instruction pipelining techniques. Numerical experiments show that our formulation could achieve 1.37 TFLOPs with single MT-3000, that is 11.8% of its peak performance. © 2023 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A Multi-level parallel Integer/Floating-Point Arithmetic Architecture for Deep Learning Instructions 29th

A Multi-level Parallel Integer/Floating-Point Arithmetic Arc...

引用

29th International conference on parallel and distributed Computing (Euro-Par)

作者： Tan, Hongbing Zhang, Jing Huang, Libo He, Xiaowei Dong, Dezun Wang, Yongwen Xiao, Liquan Natl Univ Def Technol Changsha 410073 Peoples R China

ISBN: (纸本)9783031396977;9783031396984

The extensive instruction-set for deep learning (DL) significantly enhances the performance of general-purpose architectures by exploiting data-level parallelism. However, it is challenging to design arithmetic units capable of performing parallel operations on a wide range of formats to perform DL instructions (DLIs) efficiently. This paper presents a multi-level parallel arithmetic architecture capable of supporting intra- and inter-operation parallelism for integer and a wide range of FP formats. For intra-operation parallelism, the proposed architecture supports multi-term dot-product for integer, half-precision, and Brain-Float16 formats using mixed-precision methods. For inter-operation parallelism, a dual-path execution is enabled to perform integer dot-product and single-precision (SP) addition in parallel. Moreover, the architecture supports the commonly used fused multiply-add (FMA) operations in general-purpose architectures. The proposed architecture strictly adheres to the computing requirements of DLIs and can efficiently implement them. When using benchmarked DNN inference applications where both integer and FP formats are needed, the proposed architecture can significantly improve performance by up to 15.7% compared to a single-path implementation. Furthermore, compared with state-of-the-art designs, the proposed architecture achieves higher energy efficiency and works more efficiently in implementing DLIs.

关键词： Deep Learning instruction data-level parallelism Arithmetic Architecture Dot-Product Mixed-Precision Inter- and Intra-operation parallelism

来源：评论

学校读者我要写书评

暂无评论

Enhancing Skin Disease Diagnosis with a Dual Vision Transformer Model 15

Enhancing Skin Disease Diagnosis with a Dual Vision Transfor...

引用

15th International conference on Computing Communication and Networking Technologies, ICCCNT 2024

作者： Gairola, Ajay Krishan Kumar, Vidit Sahoo, Ashok Kumar Graphic Era Hill University Department of Computer Science and Engineering Dehradun India Graphic Era deemed to be university Department of Computer Science and Engineering Dehradun India

ISBN: (纸本)9798350370249

An increasing number of people are experiencing skin problems, and it is never easy for a clinician to make a correct diagnosis. One potential solution to these problems is the use of deep learning for skin disease diagnosis. In this paper, a new kind of neural network is proposed for diagnosing skin diseases. We provide a new dual Vision Transformer (ViT) model that has two patches for image embedding and transformer encoding, as well as two multi-level permutation (MLP) heads to combine the two models, in light of the fact that the research datasets are images of skin diseases. To get deep characteristics from images, the proposed framework employs a ViT model that is both appropriate and effective. As a first step in pre-processing, images are first divided into uniformly sized patches, and then compressed into a string of tokens. The second step is to process the flattened patches using two transformer encoders that are connected in parallel. For the transformer encoder, there are Ni, Nj layers, where each layer has two sublayers. The first component is a self-attention unit with multiple heads, while the second component is a feedforward network with all connections enabled. After normalizing the inputs to each of the two sublayers, we evaluate the strength of the residual connections between them. The implementation of a classification block follows the concatenation of MLP heads. The block is made up of two layers: one that is flattened and one that is dense and optimized for batch processing. The experimental results of the method show classification accuracies of 88% (ISIC2016), 90% (ISIC2017), 91% (HAM10000), and 88% (Skin-disease-v1). The proposed model represents an advancement over state-of-the-art methods and signifies progress in identifying skin diseases. © 2024 IEEE.

关键词： deep learning image processing neural network skin disease classification Transformer

来源：评论

学校读者我要写书评

暂无评论

Constant Time Median Filter using 2D Wavelet Matrix

引用

ACM TRANSACTIONS ON GRAPHICS 2022年第6期41卷 p1-10页

作者： Moroto, Yuji Umetani, Nobuyuki Univ Tokyo Tokyo Japan

The median filter is a simple yet powerful noise reduction technique that is extensively applied in image, signal, and speech processing. It can effectively remove impulsive noise while preserving the content of the image by taking the median of neighboring pixels;thus, it has various applications, such as restoration of a damaged image and facial beautification. The median filter is typically implemented in one of two major approaches: the histogram-based method, which requires O(1) computation time per pixel when focusing on the kernel radius r, and the sorting-based method, which requires approximately O(r(2)) computation time per pixel but has a light constant factor. These are used differently depending on the kernel radius and the number of bits in the image. However, the computation time is still slow, particularly when the kernel radius is in the mid to large range. This paper introduces novel and efficient median filter with constant complexity O(1) for kernel size using the wavelet matrix data structure, which has been applied to query-based searches on one-dimensional data. We extended the original wavelet matrix to two-dimensional data for application to computer graphics problems. The objective of this study was to achieve high-speed median filter computation in parallel computing environment with many threads (i.e., GPUs). Our implementation for the GPU is an order of magnitude faster than the histogram method for 8-bit images. Unlike traditional histogram methods, which suffer from significant computational overhead, the proposed method can handle images with high pixel depth (e.g., 16- and 32-bit high dynamic range images). When the kernel radius is greater than 12 for 8-bit images, the proposed method outperforms the other median filter computation methods.

关键词： Median filters Wavelet matrix parallelization High dynamic range image

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：