检索结果-内蒙古大学图书馆

IEEE/CVF Winter Conference on applications of Computer vision (WACV)

作者： Yan, Zizheng Wu, Yushuang Qin, Yipeng Han, Xiaoguang Cui, Shuguang Li, Guanbin CUHKSZ FNii Shenzhen Peoples R China CUHKSZ SSE Shenzhen Peoples R China Cardiff Univ Cardiff Wales Sun Yat Sen Univ Guangzhou Peoples R China Sun Yat Sen Univ Res Inst Shenzhen Peoples R China

ISBN: (纸本)9798350318920;9798350318937

In this paper, we introduce a realistic and challenging domain adaptation problem called Universal Semi-supervised Model Adaptation (USMA), which i) requires only a pre-trained source model, ii) allows the source and target domain to have different label sets, i.e., they share a common label set and hold their own private label set, and iii) requires only a few labeled samples in each class of the target domain. To address USMA, we propose a collaborative consistency training framework that regularizes the prediction consistency between two models, i.e., a pretrained source model and its variant pre-trained with target data only, and combines their complementary strengths to learn a more powerful model. The rationale of our framework stems from the observation that the source model performs better on common categories than the target-only model, while on target-private categories, the target-only model performs better. We also propose a two-perspective, i.e., sample-wise and class-wise, consistency regularization to improve the training. Experimental results demonstrate the effectiveness of our method on several benchmark datasets.

关键词： Algorithms Algorithms and algorithms formulations image recognition and understanding machine learning architectures

来源：评论

学校读者我要写书评

暂无评论

Analysis of Impact of image Restoration and Segmentation on Classification Model 7

Analysis of Impact of Image Restoration and Segmentation on ...

引用

2023 7th International Conference On Computing, Communication, Control And Automation, ICCUBEA 2023

作者： Vispute, Sushma Rahul Rajeswari, K. Nema, Aryan Jagtap, Arya Kulkarni, Mrugendra Mohite, Pranav PCCOE Department of Computer Engineering Pune India

ISBN: (纸本)9798350304268

A widely studied problem in computer science is the restoration, segmentation, and classification of images, which involves image processing, computer vision, and machine learning techniques. Deep learning has made significant contributions to this field, bringing machine learning closer to artificial intelligence. Deep learning has a broad range of applications, including in surveillance, healthcare, medicine, and sports. Convolutional neural networks (CNN), a combination of artificial neural networks (ANN) and deep learning techniques, have made incredible advancements in deep learning. This survey compares methods for restoring noisy images, such as wiener filter, wavelet method, and wiener filtering with BM3D technique, using Gaussian blurring and noising methods. The survey also examines the RGB colour model and YCbCr colour model for image segmentation. image classification is studied using CNN, where the survey discusses various parameters of convolutional neural networks, including activation functions and pooling methods. © 2023 IEEE.

关键词： computer vision convolutional neural network face detection face recognition image classification image restoration noise reduction segmentation

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 2024 International Conference on Advances in Electrical Engineering and Computer applications, AEECA 2024

Proceedings - 2024 International Conference on Advances in E...

引用

5th International Conference on Advances in Electrical Engineering and Computer applications, AEECA 2024

ISBN: (纸本)9798350355253

The proceedings contain 127 papers. The topics discussed include: Advanced data storage and processing technologies in a next-generation electric information acquisition system;analyzing file access characteristics for deep learning workloads on mobile devices;optimal scheduling of distributed energy storage for electric vehicles based on evolutionary dissipation theory;a novel semi-supervised learning approach for referring expression comprehension;research and implementation of material image subject segmentation method based on machine vision;application of image recognition and 3D reconstruction technology in virtual museum system;knowledge graph technology-based active research and judgment technology for electric power customer complaint risk;and path planning for unmanned underwater vehicles based on improved ant colony algorithm.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automated Fundus image Standardization Using a Dynamic Global Foreground Threshold Algorithm 8

Automated Fundus Image Standardization Using a Dynamic Globa...

引用

8th International Conference on image, vision and Computing, ICIVC 2023

作者： Kiefer, Riley Abid, Muhammad Ardali, Mahsa Raeisi Steen, Jessica Amjadian, Ehsan Florida Polytechnic University Department of Computer Science LakelandFL United States College of Optometry Nova Southeastern University Fort LauderdaleFL United States Cheriton School of Computer Science University of Waterloo WaterlooON Canada

ISBN: (纸本)9798350335231

A generic fundus foreground extractor is required for the standardization of fundus datasets in machine-learning applications due to the vast range of retinal fundus images. Some fundus images have a large amount of non-essential background data and others have missing data because of clipping. To standardize these varied images for machine learning applications while preserving the aspect resolution, a generalized threshold algorithm is needed to separate the foreground and background. Existing threshold algorithms fail to segment images with low contrast. There is a need for a generalized algorithm to handle varied image conditions in a dynamic manner. The proposed segmentation algorithm uses shifts in histogram frequency using intensity extrema to find the ideal threshold value. The proposed post-processing algorithm crops, pads, and resizes the image to a standardized size of 512x512 pixels using the segmentation map output. To demonstrate the effectiveness of this proposed standardization approach on downstream tasks, an ablation experiment of popular standardization strategies is evaluated on a newly proposed benchmark dataset, EyePACS-light. The experimental results demonstrate the benefits of using this standardization approach for resizing fundus images. © 2023 IEEE.

关键词： Standardization

来源：评论

学校读者我要写书评

暂无评论

Learning from Deep Stereoscopic Attention for Simulator Sickness Prediction

引用

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2023年第2期29卷 1415-1423页

作者： Du, Minghan Cui, Hui Wang, Yuan Duh, Henry Been-Lirn La Trobe Univ Dept Comp Sci & Informat Technol Bundoora Vic 3086 Australia

Simulator sickness induced by 360 & DEG;stereoscopic video contents is a prolonged challenging issue in Virtual Reality (VR) system. Current machine learning models for simulator sickness prediction ignore the underlying interdependencies and correlations across multiple visual features which may lead to simulator sickness. We propose a model for sickness prediction by automatic learning and adaptive integrating multi-level mappings from stereoscopic video features to simulator sickness scores. Firstly, saliency, optical flow and disparity features are extracted from videos to reflect the factors causing simulator sickness, including human attention area, motion velocity and depth information. Then, these features are embedded and fed into a 3-dimensional convolutional neural network (3D CNN) to extract the underlying multi-level knowledge which includes low-level and higher-order visual concepts, and global image descriptor. Finally, an attentional mechanism is exploited to adaptively fuse multi-level information with attentional weights for sickness score estimation. The proposed model is trained by an end-to-end approach and validated over a public dataset. Comparison results with state-of-the-art models and ablation studies demonstrated improved performance in terms of Root Mean Square Error (RMSE) and Pearson Linear Correlation Coefficient.

关键词： Stereoscopic video simulator sickness virtual reality attention mechanism 3D CNN I.4.9 [image processing and computer vision] applications H.5.1 [information interfaces and presentation] multimedia information systems

来源：评论

学校读者我要写书评

暂无评论

Development Of An image Restoration Algorithm Utilizing Generative Adversarial Networks (GAN’s) For Enhanced Performance In Engineering applications: A Comprehensive Approach To Improving image Quality And Clarity Through Advanced machine Learning Techniques

Development Of An Image Restoration Algorithm Utilizing Gene...

引用

2024 IEEE International Conference on Innovation and Novelty in Engineering and Technology, INNOVA 2024

作者： Manjunath, T.C. Pavithra, G. Samyama Gunjal, G.H. Ninawe, Swapnil S. Dept. of Electronics & Communication Engineering Rajrajeswari College of Engineering Bangalore India Department of Electronics & Communication Engineering Dayananda Sagar College of Engineering Karnataka Bangalore India Department of Computer Science & Engineering University Visvesvaraya College of Engineering Karnataka Bangalore India

ISBN: (纸本)9798331505134

image restoration, a critical task in computer vision and image processing, focuses on recovering degraded or damaged images to their original, high-quality state. This paper introduces an innovative approach to image restoration using Generative Adversarial Networks (GANs). GANs, a prominent deep learning framework, consist of two neural networks—a generator and a discriminator—that compete to produce and evaluate realistic images. The generator creates images, while the discriminator distinguishes between real and generated ones, refining the generator's capability through adversarial training. Leveraging GANs' ability to learn complex image features, the proposed algorithm restores degraded images affected by noise, blur, and low resolution, producing high-quality, realistic results. Simulation outcomes demonstrate significant advancements in image restoration, showcasing GANs as a powerful tool for addressing challenges in this domain. The study underscores the potential of GANs in generating visually appealing restorations and advancing the state-of-the-art in image processing and restoration tasks. © 2024 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Research on the Application of Generative Adversarial Networks in Aerial image Generation

Research on the Application of Generative Adversarial Networ...

引用

International Conference on image processing, Computer vision and machine Learning (ICICML)

作者： Cai, H. X. Zhu, X. Y. Wen, P. C. Bai, L. T. Li, R. Q. Han, W. AVIC Xian Aeronaut Comp Tech Res Inst Xian Peoples R China

ISBN: (纸本)9781665464680

Computer vision is one of the important areas and directions of deep learning research, which requires different approaches to be chosen for different fields due to the complexity and diversity of vision tasks. In the field of aviation, the existing image resources are still far from the real needs due to the influence and constraints of realistic scenes and difficulties of image acquisition. More detailed and comprehensive images can better provide reliable technical support and basis for applications, and then make more accurate decisions on problems, which requires generating more effective images to expand the data. Generative Adversarial Networks (GAN) are the fastest growing and most effective generation method in recent years, so this experiment investigates the application of GAN on aviation data, taking images of airplanes, cars and ships as examples to conduct quantitative research. The effect on the effect of GAN is studied from the perspective of image size, number of images, number of iterations, and different categories of images, in order to obtain better parameter settings for generating effective images, which provides a theoretical and experimental basis for the subsequent application of GAN in the aviation field to generate more images with similar characteristics and solve the problem of insufficient data.

关键词： Generative Adversarial Networks image generation aerial applications quantitative research

来源：评论

学校读者我要写书评

暂无评论

Digital Information image Recognition and Acquisition System Based on Artificial Intelligence Technology

Digital Information Image Recognition and Acquisition System...

引用

作者： Gao, Qi Bai, Jinniu Department of Computer Science and Technology Baotou Medical College Inner Mongolia Baotou014040 China

As a very important branch of computer science and engineering, graphics, and image processing is a research topic of capturing, storing, and manipulating information from reflected electromagnetic waves from objects or scenes. Graphics and image processing technology has a wide range of applications in many important fields such as satellite navigation, military applications, machine vision, and Internet search. The main purpose of this paper is to study the acquisition system of digital information and image recognition based on artificial intelligence technology. This paper mainly designs and completes the multi-channel data acquisition system based on FPGA, including system hardware design, system digital logic design, and system measurement error calibration. Experiments show that referring to the data sheet of OV5640, it can reach 90 fps for transmitting pictures of 640 × 480 size. The hardware processing scheme actually takes 16,135 us to transmit a frame of pictures, and the actual frame rate is 61.98 fps, which is more than 6 times higher than that of the software scheme. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： image recognition

来源：评论

学校读者我要写书评

暂无评论

Revolutionizing image Recognition: Next-Generation CNN Architectures for Handwritten Digits and Objects 8

Revolutionizing Image Recognition: Next-Generation CNN Archi...

引用

8th IEEE Symposium on Wireless Technology and applications, ISWTA 2024

作者： Absur, Md Nurul Nasif, Kazi Fahim Ahmad Saha, Sourya Nova, Sifat Nawrin Department of Computer Science City University of New York New York United States College of Computing and Software Engineering Kennesaw State University GA United States Department of Computer Science Chalmers University of Technology Gothenburg Sweden

ISBN: (纸本)9798350351354

This study addresses the pressing need for computer systems to interpret digital media images with a level of sophistication comparable to human visual perception. By leveraging Convolutional Neural Networks (CNNs), we introduce two innovative architectures tailored to distinct datasets: the MNIST handwritten digit dataset and the Fashion MNIST dataset. Unlike traditional machine learning methods such as Support Vector machines (SVM) and Random Forests, our customized CNN models remarkably enhance image attribute comprehension and recognition accuracy. Specifically, the model developed for the MNIST dataset achieved an unprecedented accuracy of 98.71% without any bias, while the Fashion MNIST model reached 91.39%, marking significant advancements over conventional algorithms without any bias. This research showcases the superior efficiency of CNNs in processing and understanding digital images. It underscores the potential of deep learning technologies in bridging the gap between computational systems and human-like visual recognition. Through meticulous experimentation and analysis, we illustrate how deep CNNs require less preparatory work than other image-processing algorithms, setting a new benchmark in computer vision. © 2024 IEEE.

关键词： Support vector machines

来源：评论

学校读者我要写书评

暂无评论

MAPHIS-Measuring arthropod phenotypes using hierarchical image segmentations

引用

METHODS IN ECOLOGY AND EVOLUTION 2024年第1期15卷 36-42页

作者： Mraz, Radoslav Stepka, Karel Pekar, Matej Matula, Petr Pekar, Stano Masaryk Univ Fac Informat Dept Visual Comp Brno Czech Republic Masaryk Univ Fac Sci Dept Bot & Zool Brno Czech Republic

1. Animal phenotypic traits are utilised in a variety of studies. Often the traits are measured from images. The processing of a large number of images can be challenging;nevertheless, image analytical applications, based on neural networks, can be an effective tool in automatic trait collection.2. Our aim was to develop a stand-alone application to effectively segment an arthropod from an image and to recognise individual body parts: namely, head, thorax (or prosoma), abdomen and four pairs of appendages. It is based on convolutional neural network with U-Net architecture trained on more than a thousand images showing dorsal views of arthropods (mainly of wingless insects and spiders). The segmentation model gave very good results, with the automatically generated segmentation masks usually requiring only slight manual adjustments.3. The application, named MAPHIS, can further (1) organise and preprocess the images;(2) adjust segmentation masks using a simple graphical editor;and (3) calculate various size, shape, colouration and pattern measures for each body part organised in a hierarchical manner. In addition, a special plug-in function can align body profiles of selected individuals to match a median profile and enable comparison among groups. The usability of the application is shown in three practical examples.4. The application can be used in a variety of fields where measures of phenotypic diversity are required, such as taxonomy, ecology and evolution (e.g. mimetic similarity). Currently, the application is limited to arthropods, but it can be easily extended to other animal taxa.

关键词： arachnids arthropods convolutional neural networks hierarchical segmentation image analysis insects machine vision morphological traits

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：