检索结果-内蒙古大学图书馆

Retinal image Quality Assessment Using Sharpness and Connected Components 1

6th international conference on computer vision and image processing, cvip 2021

作者： Kiruthika, S. Masilamani, V. Indian Institute of Information Technology Design and Manufacturing Kancheepuram Chennai India

ISBN: (数字)9783031113499

ISBN: (纸本)9783031113482

Mobile application based diagnosis has become an aid nowadays. For better diagnosis, the quality of image needs to be good. Automatic assessment of images will help the ophthalmologists to focus more on the diagnosis. To assist the experts, an automated retinal image quality assessment method has been proposed. the proposed method make use of the features extracted from the sharpness and connected components of the fundus image. In particular, the image is divided into patches and the features are extracted. those extracted features are used to train a machine learning model. the proposed model has achieved comparable results on the private dataset and outperformed the existing methods on public datasets. © 2022, the Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Machine learning

来源：评论

学校读者我要写书评

暂无评论

DHFormer: A vision Transformer-Based Attention Module for image Dehazing 1

引用

8th international conference on computer vision and image processing (cvip)

作者： Wasi, Abdul Shiney, O. Jeba Chandigarh Univ Mohali India

ISBN: (数字)9783031581816

ISBN: (纸本)9783031581809;9783031581816

images acquired in hazy conditions have degradations induced in them. Dehazing such images is a vexed and ill-posed problem. Scores of prior-based and learning-based approaches have been proposed to mitigate the effect of haze and generate haze-free images. Many conventional methods are constrained by their lack of awareness regarding scene depth and their incapacity to capture long-range dependencies. In this paper, a method that uses residual learning and vision transformers in an attention module is proposed. It essentially comprises two networks: In the first one, the network takes the ratio of a hazy image and the approximated transmission matrix to estimate a residual map. the second network takes this residual image as input and passes it through convolution layers before superposing it on the generated feature maps. It is then passed through global context and depth-aware transformer encoders to obtain channel attention. the attention module then infers the spatial attention map before generating the final haze-free image. Experimental results including several quantitative metrics demonstrate the efficiency and scalability of the suggested methodology.

关键词： Residual Learning Transmission Matrix vision Transformer Attention Module

来源：评论

学校读者我要写书评

暂无评论

Combing color index and region growing with simple non-iterative clustering for plant segmentation 6

Combing color index and region growing with simple non-itera...

引用

6th international conference on image, vision and Computing, ICIVC 2021

作者： Mei, Jie Sun, Kaiqiong Xu, Xin School of Mathematics and Computer Wuhan Polytechnic University Wuhan China

ISBN: (纸本)9781665443685

Plant segmentation is an important application of computer vision processing technology in agriculture. Relying on plant segmentation technology, many crop problems can be significantly discovered and prevented. At the same time, plant conditions can be quickly and accurately understood, thereby increasing yield. this paper proposes a plant segmentation method based on the combination of color index and region growing. threshold on the color index image produces object mask and background mask. the rest of pixels are segmented by region growing with the mask as seed points. the region growing is implemented with a simple non-iterative clustering method. the experimental results on four sets of data sets show that the accuracy has been improved with our method compared to index-based method. © 2021 IEEE.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

Is Grad-CAM Explainable in Medical images? 1

引用

8th international conference on computer vision and image processing (cvip)

作者： Suara, Subhashis Jha, Aayush Sinha, Pratik Sekh, Arif Ahmed XIM Univ Bhubaneswar India

ISBN: (数字)9783031581816

ISBN: (纸本)9783031581809;9783031581816

Explainable Deep Learning has gained significant attention in the field of artificial intelligence (AI), particularly in domains such as medical imaging, where accurate and interpretable machine learning models are crucial for effective diagnosis and treatment planning. Grad-CAM is a baseline that highlights the most critical regions of an image used in a deep learning model's decision-making process, increasing interpretability and trust in the results. It is applied in many computer vision (CV) tasks such as classification and explanation. this study explores the principles of Explainable Deep Learning and its relevance to medical imaging, discusses various explainability techniques and their limitations, and examines medical imaging applications of Grad-CAM. the findings highlight the potential of Explainable Deep Learning and Grad-CAM in improving the accuracy and interpretability of deep learning models in medical imaging. the code is available in (https://***/ beasthunter758/GradEML).

关键词： Explainable Deep Learning Gradient-weighted Class Activation Mapping (Grad-CAM) Medical image Analysis

来源：评论

学校读者我要写书评

暂无评论

image Captioning with Visual Positional Embedding and Bi-linear Pooling 8th

Image Captioning with Visual Positional Embedding and Bi-lin...

引用

8th international conference on computer vision and image processing (cvip)

作者： Nair, Sidharth Guha, Prithwijit HCLTech Chennai Tamil Nadu India Indian Insitute Technol Guwahati Dept Elect & Elect Engn Gauhati India

ISBN: (纸本)9783031581809;9783031581816

Recent approaches to image captioning typically follow an encoder-decoder architecture. the feature vectors extracted from the region proposals obtained from an object detector network serve as input to encoder. Without any explicit spatial information about the visual regions, the caption synthesis model is limited to learn relationship from captions only. However, the structure between the semantic units in images and sentences is different. this work introduces a grid based spatial position encoding scheme to learn relationship from both domains. Furthermore, bi-linear pooling is used with attention for exploiting spatial and channel-wise attention distribution to capture second order interaction between multi-modal inputs. these are integrated within the Transformer architecture achieving a competitive CIDEr score.

关键词： Transformer Positional Embedding image Captioning Bi-linear Pooling

来源：评论

学校读者我要写书评

暂无评论

Dyadic Interaction Recognition Using Dynamic Representation and Convolutional Neural Network 1

引用

6th international conference on computer vision and image processing, cvip 2021

作者： Shebiah, R. Newlin Arivazhagan, S. Centre for Image Processing and Pattern Recognition Department of Electronics and Communication Engineering Mepco Schlenk Engineering College Tamilnadu Sivakasi626005 India

ISBN: (数字)9783031113468

ISBN: (纸本)9783031113451

Human interaction recognition can be used in video surveillance to recognise human behaviour. the goal of this research is to classify human interaction by converting video snippets into dynamic images and deep CNN architecture for classification. the human interaction input video is snipped into a certain number of smaller segments. For each segment, dynamic image is constructed that efficiently encodes a video segment into an image with an action silhouette, which plays an important role in interaction recognition. the discriminative features are learned and classified from dynamic image using Convolutional Neural Network. the efficacy of the proposed architecture for interaction recognition is demonstrated by the obtained results on the SBU Kinect Interaction dataset, IXMAS, and TV Human Interaction datasets. © 2022, the Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Near-Infrared image Colorization Using Unsupervised Contrastive Learning 1

引用

8th international conference on computer vision and image processing (cvip)

作者： Rao, Devesh Jayaraj, P. B. Pournami, P. N. Natl Inst Technol Calicut Dept Comp Sci & Engn Kozhikode 673601 Kerala India

ISBN: (数字)9783031581816

ISBN: (纸本)9783031581809;9783031581816

Near-Infrared (NIR) images are widely used in a variety of low-light situations for security and safety applications. A colorised version of NIR images provide better image understanding and interpretation of features. Because the number of NIR-RGB paired datasets is limited and often unavailable, a method to convert a given NIR image to an RGB image is highly desirable. the present work proposes an unsupervised image to image translation technique for generating colorized images (UGCI) for transforming an input NIR image to an RGB image. UGCI outperforms present NIR-RGB colorizing models and have shown approximately 57% improvement in terms of Frechet inception distance (FID) with reduced training time and less memory usage. Finally, a thorough comparative study based on different datasets is carried out to confirm superiority over leading colorization approaches in qualitative and quantitative assessments.

关键词： near-infrared images colorization unsupervised learning

来源：评论

学校读者我要写书评

暂无评论

Generic Multispectral image Demosaicking Algorithm and New Performance Evaluation Metric 1

引用

6th international conference on computer vision and image processing, cvip 2021

作者： Rathi, Vishwas Goyal, Puneet Department of Computer Science and Engineering Indian Institute of Technology Ropar Punjab Rupnagar India

ISBN: (数字)9783031113468

ISBN: (纸本)9783031113451

Color image demosaicking is key in developing low-cost digital cameras using a color filter array(CFA). Similarly, multispectral image demosaicking can be used to develop low-cost and portable multispectral cameras using a multispectral filter array (MSFA). In this work, we propose a generic multispectral image demosaicking algorithm based on spatial and spectral correlation. We also propose a new image quality metric Average-Normalized-Multispectral-PSNR (ANMPSNR), which helps in easily comparing the relative performance of different demosaicking algorithms. In experimental results, we prove the efficacy of the proposed algorithm using two publicly available datasets as per different image quality metrics. © 2022, the Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Costs

来源：评论

学校读者我要写书评

暂无评论

Static global scheduling for optimal computer vision and image processing operations on distributed-memory multiprocessors 6th

引用

6th international conference on computer Analysis of images and Patterns, CAIP 1995

作者： Lee, Cheolwhan Wang, Yuan-Fang Yang, Tao Department of Computer Science University of California at Santa Barbara Santa BarbaraCA93106 United States

ISBN: (纸本)3540602682

In this paper, we develop a static global scheduling scheme for mapping computer vision and image procesaing (cvip) operations on distributed-memory multiprocessors. the scheduler operates on task graphs containing linear chains of tasks, loops, and data-dependent operations. the scheduler employs a shortest path algorithm to optimize the global parallel time, taking into consideration variations in task and resource parameters (such as the image size and number of processors used), and both the intra- and the inter-operation computation and communication costs. © Springer-Verlag Berlin Heidelberg 1995.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Parallel architectures for image processing

引用

ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL 1998年第3期10卷 139-151页

作者： Downton, A Crookes, D Univ Essex Dept Elect Syst Engn Colchester CO4 3SQ Essex England Queens Univ Belfast Dept Comp Sci Belfast BT7 1NN Antrim North Ireland

image processing is often considered a good candidate for the application of parallel processing because of the large volumes of data and the complex algorithms commonly encountered. this paper presents a tutorial introduction to the field of parallel image processing. After introducing the classes of parallel processing a brief review of architectures for parallel image processing is presented. Software design for low-level image processing and parallelism in high-level image processing are discussed and an application of parallel processing to handwritten postcode recognition is described. the paper concludes with a look at future technology and market trends.

关键词： image processing image recognition Parallel architecture Digital signal processing chips software design Microprocessors and microcomputers parallel architectures handwritten postcode recognition Optical information, image and video signal processing digital signal processing chips optical character recognition computer vision and image processing techniques high-level image processing low-level image processing parallel processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：