检索结果-内蒙古大学图书馆

Developments in image processing Using Deep Learning and Reinforcement Learning

JOURNAL OF IMAGING 2023年第10期9卷 207-207页

作者： Valente, Jorge Antonio, Joao Mora, Carlos Jardim, Sandra Techframe Informat Syst P-2785338 Sao Domingos De Rana Portugal Polytech Inst Tomar Smart Cities Res Ctr P-2300313 Tomar Portugal

The growth in the volume of data generated, consumed, and stored, which is estimated to exceed 180 zettabytes in 2025, represents a major challenge both for organizations and for society in general. In addition to being larger, datasets are increasingly complex, bringing new theoretical and computational challenges. Alongside this evolution, data science tools have exploded in popularity over the past two decades due to their myriad of applications when dealing with complex data, their high accuracy, flexible customization, and excellent adaptability. When it comes to images, data analysis presents additional challenges because as the quality of an image increases, which is desirable, so does the volume of data to be processed. Although classic machine learning (ML) techniques are still widely used in different research fields and industries, there has been great interest from the scientific community in the development of new artificial intelligence (AI) techniques. The resurgence of neural networks has boosted remarkable advances in areas such as the understanding and processing of images. In this study, we conducted a comprehensive survey regarding advances in AI design and the optimization solutions proposed to deal with image processing challenges. Despite the good results that have been achieved, there are still many challenges to face in this field of study. In this work, we discuss the main and more recent improvements, applications, and developments when targeting image processing applications, and we propose future research directions in this field of constant and fast evolution.

关键词： artificial intelligence deep learning reinforcement learning image processing

来源：评论

学校读者我要写书评

暂无评论

A Survey on Graph neural networks and its applications in Various Domains

引用

SN Computer Science 2025年第1期6卷 1-12页

作者： Murgod, Tejaswini R. Reddy, P. Srihith Gaddam, Shamitha Sundaram, S. Meenakshi Anitha, C. Department of Artificial Intelligence & Machine Learning BNM Institute of Technology Karnataka Bengaluru India Department of Computer Science and Engineering NITTE Meenakshi Institute of Technology Karnataka Bengaluru India

Graph neural networks (GNNs) are neural models that use message transmission between graph nodes to represent the dependency of graphs. Variants of Graph neural networks (GNNs), such as graph recurrent networks (GRN), graph attention networks (GAT), and graph convolutional networks (GCN), have shown remarkable results on a variety of deep learning tasks in recent years. In this study, we offer a generic design pipeline for GNN models, go over the variations of each part, classify the applications in an organized manner, and suggest four outstanding research issues. Dealing with graph data, which provides extensive connection information among pieces, is necessary for many learning tasks. A model that learns from graph inputs is required for modelling physics systems, learning molecular fingerprints, predicting protein interfaces, and identifying illnesses. Reasoning on extracted structures (such as the dependency trees of sentences and the scene graphs of photos) is an important research issue that also requires graph reasoning models in other domains, such as learning from non-structural data like texts and images. Graph neural networks (GNNs) are primarily designed for dealing with graph-structured data, where relationships between entities are modeled as edges in a graph. While GNNs are not traditionally applied to image classification problems, researchers have explored ways to leverage graph-based structures to enhance the performance of Convolutional neural networks (CNNs) in certain scenario. GNN have been increasingly applied to Natural Language processing (NLP) tasks, leveraging their ability to model structured data and capture relationships between elements in a graph. GNN are also applied for traffic related problems particularly in modeling and optimizing traffic flow, analyzing transportation networks, and addressing congestion issues. GNN can be used for traffic flow prediction, dynamic routing & navigation, Anomaly detection, public transport network

关键词： Computer vision Graph neural networks Intrusion detection Natural language processing neural networks Traffic control

来源：评论

学校读者我要写书评

暂无评论

Convolutional Channel-Wise Competitive Learning for the Forward-Forward Algorithm 38

Convolutional Channel-Wise Competitive Learning for the Forw...

引用

38th AAAI conference on artificial Intelligence (AAAI) / 36th conference on Innovative applications of artificial Intelligence / 14th Symposium on Educational Advances in artificial Intelligence

作者： Papachristodoulou, Andreas Kyrkou, Christos Timotheou, Stelios Theocharides, Theocharis Univ Cyprus KIOS Res & Innovat Ctr Excellence Dept Elect & Comp Engn Nicosia Cyprus

ISBN: (纸本)1577358872

The Forward-Forward (FF) Algorithm has been recently proposed to alleviate the issues of backpropagation (BP) commonly used to train deep neural networks. However, its current formulation exhibits limitations such as the generation of negative data, slower convergence, and inadequate performance on complex tasks. In this paper we take the main ideas of FF and improve them by leveraging channel-wise competitive learning in the context of convolutional neural networks for image classification tasks. A layer-wise loss function is introduced that promotes competitive learning and eliminates the need for negative data construction. To enhance both the learning of compositional features and feature space partitioning, a channel-wise feature separator and extractor block is proposed that complements the competitive learning process. Our method outperforms recent FF-based models on image classification tasks, achieving testing errors of 0.58%, 7.69%, 21.89%, and 48.77% on MNIST, Fashion-MNIST, CIFAR-10 and CIFAR-100 respectively. Our approach bridges the performance gap between FF learning and BP methods, indicating the potential of our proposed approach to learn useful representations in a layer-wise modular fashion, enabling more efficient and flexible learning. Our source code and supplementary material are available at https://***/andreaspapac/CwComp.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

COMPREHENSIVE ANALYSIS OF CONVOLUTIONAL neural networks APPLIED TO CIFAR-10 DATASET 2

COMPREHENSIVE ANALYSIS OF CONVOLUTIONAL NEURAL NETWORKS APPL...

引用

2nd International conference on Mechatronic Automation and Electrical Engineering, ICMAEE 2024

作者： Su, Biao College of Physics and Optoelectronic Engineering Shenzhen University Shenzhen518060 China

ISBN: (纸本)9781837242672

This article explores the advancements and applications of Convolutional neural networks (CNNs) in image classification, focusing on the CIFAR-10 dataset. Since their inception in 2006, CNNs have revolutionized computer vision by efficiently extracting high-level features from raw images. This study discusses the structure of CNNs, including convolutional layers, pooling, and fully connected layers, which enhance feature extraction and classification accuracy. Experimentally, CNNs have demonstrated superior performance on tasks requiring the identification of complex patterns such as in autonomous driving, security, and medical imaging. The research further investigates the optimization of CNN models through architectural enhancements, attention mechanisms, data augmentation, model compression, and cross-domain knowledge transfer. This study concludes that multicore configurations with batch processing significantly outperform single-core setups, achieving lower latency and higher throughput, thus underscoring the potent applications and continuous evolution of deep learning in modern artificial intelligence challenges. © The Institution of Engineering & Technology 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Patch-Wise Graph Contrastive Learning for image Translation 38

Patch-Wise Graph Contrastive Learning for Image Translation

引用

38th AAAI conference on artificial Intelligence (AAAI) / 36th conference on Innovative applications of artificial Intelligence / 14th Symposium on Educational Advances in artificial Intelligence

作者： Jung, Chanyong Kwon, Gihyun Ye, Jong Chul Korea Adv Inst Sci & Technol Dept Brain & Bio Engn Daejeon South Korea Korea Adv Inst Sci & Technol Kim Jaechul Grad Sch AI Daejeon South Korea

ISBN: (纸本)1577358872

Recently, patch-wise contrastive learning is drawing attention for the image translation by exploring the semantic correspondence between the input and output images. To further explore the patch-wise topology for high-level semantic understanding, here we exploit the graph neural network to capture the topology-aware features. Specifically, we construct the graph based on the patch-wise similarity from a pretrained encoder, whose adjacency matrix is shared to enhance the consistency of patch-wise relation between the input and the output. Then, we obtain the node feature from the graph neural network, and enhance the correspondence between the nodes by increasing mutual information using the contrastive loss. In order to capture the hierarchical semantic structure, we further propose the graph pooling. Experimental results demonstrate the state-of-art results for the image translation thanks to the semantic encoding by the constructed graphs.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Research on Expression Recognition Algorithm Based on University Students' Mental State 2

Research on Expression Recognition Algorithm Based on Univer...

引用

2nd IEEE International conference on image processing and Computer applications, ICIPCA 2024

作者： Luo, Guangli Guangzhou Institute of Science and Technology Guangzhou City510540 China

ISBN: (纸本)9798350360240

With the continuous development of higher education in our country, the number of college students is increasing year by year. There are more and more campus accidents caused by college students' psychological problems, which has aroused people's great attention to college students' psychological problems. Many scholars have conducted in-depth research on college students' psychological problems. To solve students' psychological problems, we must first find out the problems in time. With the improvement of the level of social information, face recognition technology integrates multiple key technologies such as computer vision, artificial intelligence, and deep learning, which has been widely used in different fields of society. However, there are few researches and applications on the mental health status of college students. Therefore, this paper combines the expression recognition technology with the psychological problems of college students, fully excavates the static and dynamic features of dynamic expression behavior through convolutional neural networks, establishes a facial expression recognition model, and conducts comparative experiments through open data sets to verify the accuracy of different algorithms, so as to timely discover the psychological problems of college students. The method of expression recognition is used to control the psychology of college students. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

TRAINING AN artificial INTELLIGENCE MODEL FOR THE DETECTION OF GESTURES RELATED TO TRICHOTILLOMANIA 22

TRAINING AN ARTIFICIAL INTELLIGENCE MODEL FOR THE DETECTION ...

引用

22nd International conference on e-Society 2024, ES 2024 and 20th International conference on Mobile Learning 2024, ML 2024

作者： de Gois Paulino, Daniel Victor Costa De Sousa Alves, Robinson Luis Instituto Federal de Educação Ciência e Tecnologia do Rio Grande do Norte Brazil

This article presents an artificial intelligence model capable of identifying actions strongly related to trichotillomania, a psychiatric disorder that causes people to have a desire to pull their hair. The model was trained with images and videos collected and variations generated through artificial intelligence to improve the image database. The work focused on the user's frontal perspective to optimize the construction of the dataset and neural network training. As a result, we obtained 89% precision in the model, which requires further testing and optimization to be used in real applications – still limited – that can provide users with statistics and results related to the disorder for possible treatments or alert the user to decrease their involuntary actions. © 2024 International conferences e-Society 2024 and Mobile Learning 2024. All rights reserved.

关键词： artificial Intelligence Behavioral Pattern Recognition Computational Psychiatry image processing Machine Learning neural networks Trichotillomania

来源：评论

学校读者我要写书评

暂无评论

VF-NET: ROBUSTNESS VIA UNDERSTANDING DISTORTIONS AND TRANSFORMATIONS 31

VF-NET: ROBUSTNESS VIA UNDERSTANDING DISTORTIONS AND TRANSFO...

引用

2024 International conference on image processing

作者： Amerehi, Fatemeh Healy, Patrick Univ Limerick Limerick V94 T9PX Ireland

ISBN: (纸本)9798350349405;9798350349399

Ensuring the secure and dependable deployment of deep neural networks hinges on their ability to withstand distributional shifts and distortions. While data augmentation enhances robustness, its effectiveness varies across different types of data corruption. It tends to excel in cases where corruptions share perceptually similar traits or have a high-frequency nature. In response, a strategy is to encompass a broad spectrum of distortions. Yet, it is often impractical to incorporate every conceivable modification that images may undergo within augmented data. Instead, we show that providing the model with a stronger inductive bias to learn the underlying concept of "change" would offer a more reliable approach. To this end, we develop Virtual Fusion (VF), a technique that treats corruptions as virtual labels. Diverging from conventional augmentation, when an image undergoes any form of transformation, its label becomes linked with the specific name attributed to the distortion. The finding indicates that VF effectively enhances both clean accuracy and robustness against common corruptions. On previously unseen corruptions, it shows an 11.90% performance improvement and a 12.78% increase in accuracy. In similar corruption scenarios, it achieves a 7.83% performance gain and a significant accuracy improvement of 22.04% on robustness benchmarks.

关键词： Deep neural networks Distribution Shifts Robustness Generalization Augmentation

来源：评论

学校读者我要写书评

暂无评论

Entropy Induced Pruning Framework for Convolutional neural networks 38

Entropy Induced Pruning Framework for Convolutional Neural N...

引用

38th AAAI conference on artificial Intelligence (AAAI) / 36th conference on Innovative applications of artificial Intelligence / 14th Symposium on Educational Advances in artificial Intelligence

作者： Lu, Yiheng Guan, Ziyu Yang, Yaming Zhao, Wei Gong, Maoguo Xu, Cai Xidian Univ Key Lab Collaborat Intelligence Syst Minist Educ Xian Peoples R China

ISBN: (纸本)1577358872

Structured pruning techniques have achieved great compression performance on convolutional neural networks for image classification tasks. However, the majority of existing methods are sensitive with respect to the model parameters, and their pruning results may be unsatisfactory when the original model is trained poorly. That is, they need the original model to be fully trained, to obtain useful weight information. This is time-consuming, and makes the effectiveness of the pruning results dependent on the degree of model optimization. To address the above issue, we propose a novel metric named Average Filter Information Entropy (AFIE). It decomposes the weight matrix of each layer into a low-rank space, and quantifies the filter importance based on the distribution of the normalized eigenvalues. Intuitively, the eigenvalues capture the covariance among filters, and therefore could be a good guide for pruning. Since the distribution of eigenvalues is robust to the updating of parameters, AFIE can yield a stable evaluation for the importance of each filter no matter whether the original model is trained fully. We implement our AFIE-based pruning method for three popular CNN models of AlexNet, VGG-16, and ResNet-50, and test them on three widely-used image datasets MNIST, CIFAR-10, and imageNet, respectively. The experimental results are encouraging. We surprisingly observe that for our methods, even when the original model is trained with only one epoch, the AFIE score of each filter keeps identical to the results when the model is fully-trained. This fully indicates the effectiveness of the proposed pruning method.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Spline Interpolation and Deep neural networks as Feature Extractors for Signature Verification Purposes

引用

IEEE INTERNET OF THINGS JOURNAL 2023年第3期10卷 2152-2161页

作者： Wei, Wei Ke, Qiao Polap, Dawid Wozniak, Marcin Xian Univ Technol Sch Comp Sci & Engn Xian 710048 Peoples R China Xian Univ Technol Shaanxi Key Lab Network Comp & Secur Technol Xian 710048 Peoples R China Northwestern Polytech Univ Sch Software Xian 710129 Peoples R China Silesian Tech Univ Fac Appl Math PL-44100 Gliwice Poland

Digital security in modern systems very often uses biometric, and increasingly, new implementations appear. Such applications can be found everywhere, even when picking up the package from courier, we certify its receipt through our signature on the tablet. However, verification of this form is not one of the simplest elements in information processing systems. Given the different sizes, angles, or writing conditions that may affect its stability, new methods to evaluate signatures are constantly needed. In this article, we propose the use of spline interpolation and two types of artificial neural networks to verify the identity of a person based on selected local and global features extracted from the image of a signature. Global features are extracted concerning interpolation and graphic processing methods, while local features are verified using convolutional neural networks. Both sets of features are used in the identity verification process. The article presents the model of the operation together with experiments, taking into account various parameters of the proposed extraction. We have reached an accuracy of 87.7% on the SVC2004 database.

关键词： Feature extraction neural networks Mathematical model Interpolation Internet of Things Biometrics (access control) Splines (mathematics) Deep neural networks feature extraction signature verification spline interpolation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：