检索结果-内蒙古大学图书馆

Morlet wavelet transformation based deep similarity structured neural learning for image quality assessment

Multimedia Tools and applications 2025年第16期84卷 16789-16807页

作者： Balakrishnan, N. Robinson, Y. Harold Department of MCA Sona College of Technology Salem India Department of Computer Science and Engineering Francis Xavier Engineering College Tirunelveli India

Quality assessment is a key problem to be resolved in image processing. Few research works have been designed to analyze the quality of images using different techniques. However, the accuracy involved during the process of image quality assessment was not sufficient. A novel method Morlet Wavelet Transformation Based Deep Similarity Structured neural Learning (MWT-DSSNL) is proposed to enhance the performance of image quality assessment with minimal peak signal-to-noise ratio. The MWT-DSSNL Method is based on artificial neural networks with representation learning. The MWT-DSSNL Method initially gets the number of images from a given dataset as input. The MWT-DSSNL Method uses multiple layers to extract higher-level features from the input images. The MWT-DSSNL Method is a Feed-Forward network where each layer employs the output from the previous layer as input. The MWT-DSSNL Method is designed based on a biological neural network of the human brain. Contrary to the existing method, the MWT-DSSNL Method uses multiple hidden layers and Morlet wavelet transformation to deeply analyze input images and thereby extract features such as luminance, contrast, and structure with a minimal amount of time consumption. By considering the discovered features, finally MWT-DSSNL Method determines structural similarity and thereby exactly identifies the quality of input images with a lower time. From that, the MWT-DSSNL Method achieves enhanced image quality assessment performance when compared to existing works. The simulation of the MWT-DSSNL Method is conducted on factors such as PSNR, average processing time, quality detection accuracy, and false positive rate with different numbers of input images. The simulation result depicts that the MWT-DSSNL method increases the accuracy and also minimizes the time of image quality assessment when compared to conventional works. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024

关键词： Network layers

来源：评论

学校读者我要写书评

暂无评论

Innovation and Improvement of Traditional Chinese Medicine Preparation Technology Combined with artificial Intelligence 2

Innovation and Improvement of Traditional Chinese Medicine P...

引用

2nd IEEE International conference on image processing and Computer applications, ICIPCA 2024

作者： Chu, Hang Li, Qiuju School of Biomedicine Beijing City University Beijing China Heilongjiang Agricultural Engineering Vocational College Heilongjiang Harbin China

ISBN: (纸本)9798350360240

Traditional Chinese medicine preparation techniques have limitations in terms of efficiency and quality, which seriously restrict the development of the Chinese medicine industry. This article adopts an intelligent preparation method for traditional Chinese medicine compound formulations based on artificial neural networks and genetic algorithms, and studies the formulation and preparation process of compound formulations. By constructing a sensor network, key technical parameters during the processing of traditional Chinese medicine are monitored in real-time, achieving real-time monitoring and adjustment of parameters, and connecting various links in the monitoring and adjustment process. Cost analysis can be conducted on the production costs of traditional preparation and artificial intelligence technology preparation, including considerations of raw material costs, labor costs, energy consumption, and other aspects. The preparation method of artificial intelligence technology has also reduced energy consumption, from 800 yuan/batch to 600 yuan/batch. Combining traditional Chinese medicine preparation techniques with artificial intelligence can help reduce the cost of traditional Chinese medicine preparation. © 2024 IEEE.

关键词： Cost benefit analysis

来源：评论

学校读者我要写书评

暂无评论

PPLNs: Parametric Piecewise Linear networks for Event-Based Temporal Modeling and Beyond 38

PPLNs: Parametric Piecewise Linear Networks for Event-Based ...

引用

38th conference on neural Information processing Systems, NeurIPS 2024

作者： Song, Chen Liang, Zhenxiao Sun, Bo Huang, Qixing Department of Computer Science The University of Texas at Austin AustinTX78712 United States

We present Parametric Piecewise Linear networks (PPLNs) for temporal vision inference. Motivated by the neuromorphic principles that regulate biological neural behaviors, PPLNs are ideal for processing data captured by event cameras, which are built to simulate neural activities in the human retina. We discuss how to represent the membrane potential of an artificial neuron by a parametric piecewise linear function with learnable coefficients. This design echoes the idea of building deep models from learnable parametric functions recently popularized by Kolmogorov-Arnold networks (KANs). Experiments demonstrate the state-of-the-art performance of PPLNs in event-based and image-based vision applications, including steering prediction, human pose estimation, and motion deblurring. The source code of our implementation is available at https://***/chensong1995/PPLN. © 2024 neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

TargetSR:Towards Semantic location Real-World image Super-Resolution with Diffusion Prior

TargetSR:Towards Semantic location Real-World Image Super-Re...

引用

International Joint conference on neural networks (IJCNN)

作者： Wu, Shouhao Zhang, Junjie Wuhan Text Univ Sch Comp Sci & Artificial Intelligence Wuhan Peoples R China Hubei Prov Engn Res Ctr Intelligent Text & Fash Engn Res Ctr Hubei Prov Clothing Informat Wuhan Peoples R China

ISBN: (纸本)9798350359329;9798350359312

In Real-World super-resolution, the intricate degradation of images can result in artifacts or textures that are not sufficiently rich in the generated high-resolution image. Several studies have progressively embraced Stable Diffusion as a prior for generating more textured outcomes. Nevertheless, the diffusion model faces challenges in preserving the fidelity of the image during denoising. To tackle this issue, we introduce TargetSR. This framework enhances the network's ability to recognize and localize objects in Real-World degraded images, resulting in the generation of high-resolution images with both reasonable and rich textures. This contributes to improving the semantic and visual fidelity of the images. Our Object-CLIP module identifies and locates objects in an image, retrieves corresponding text, and integrates the encoded text and object location information into the denoising network. This integration enhances the network's effective utilization of information from text features. During the preparation stage, we employ correction-recovery processing to manipulate the degraded images. The correction of degradation enhances the network's capacity to address various types of degradation in the real world, while recovery contributes to enhancing the fidelity of the image during the denoising stage of image generation. Experimental results demonstrate the enhanced effectiveness of our approach in leveraging the textual features of an image to generate textures that are more reasonable and rich.

关键词： Super-Resolution Stable Diffusion Controlnet Latent Space

来源：评论

学校读者我要写书评

暂无评论

Hand-written Digit Recognition using Convolutional neural Network in Python with Tensorflow 5

Hand-written Digit Recognition using Convolutional Neural Ne...

引用

5th IEEE International conference on Recent Trends in Computer Science and Technology, ICRTCST 2024

作者： Anand, Prakash Ranjan, Piyush Srivastava, Priyanka Jharkhand Rai University Department of Computer Science Ranchi India Sarala Birla University Department of Computer Science Ranchi India

ISBN: (纸本)9798350351378

Recently, deep learning has transformed machine learning by significantly enhancing its artificial intelligence as artificial neural networks (ANN) have become increasingly prevalent. Due of its extensive range of applications in fields such as intelligence, healthcare, medical, athletics, robots, etc., Machine learning algorithms is remarkably used in a wide range of industries. At the core of incredible advancements in deep learning are convolutional neural networks (CNN), which integrate artificial neural networks (ANN) and contemporary deep learning algorithms. Across a variety of applications, including pattern classification, phrase classification, voice recognition, image identification, text summarization, documentary analysis, scene recognition, and handwritten digit recognition, it has been used. Our research's objective is to develop a model that can accurately compare image comparisons and identify handwritten numbers. We used the Modified National Institute of Standards and Technology (MNIST) dataset to conduct our experiment CNN. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Convolutional neural Network image Classification Method Integrating Classic image Features 9

Convolutional Neural Network Image Classification Method Int...

引用

9th IEEE International conference on Computational Intelligence and applications, ICCIA 2024

作者： Hao, Qi Huang, Jie Wang, Bin Zhou, Feng Cyber Science and Engineering Southeast University Nanjing China Zhejiang Key Laboratory of Artificial Intelligence of Things Network and Data Security Hangzhou China

ISBN: (纸本)9798350352214

Classic image features were once widely used in image classification but have been almost entirely replaced by neural networks today. While the performance of neural networks, especially convolutional neural networks (CNNs), is indisputable, their lack of interpretability has become a significant limitation in recent years. This paper explores the effective combination of classic image features and convolutional image features, investigating their differences and similarities. Through a series of experiments, a method is proposed to effectively integrate these two types of features by employing a multiplicative attention mechanism. This approach combines standardized MPEG-7 descriptors with convolutional features before feeding them into the fully connected layer for classification. The final model demonstrates an improvement in classification accuracy and indicates the potential of learning traditional image features from the integrated features. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Research on image Segmentation Methods of Highway Pavement Distress based on Semantic Segmentation Convolutional neural Network

Research on Image Segmentation Methods of Highway Pavement D...

引用

9th International conference on Signal and image processing (ICSIP)

作者： Shao, Yongjun Zhang, Ziyi Wang, Xingang Zhao, Chihang Zheng, Youfeng Ma, Xinyi Deng, Wenhao Huang, Yaxin Shaanxi Expressway Engn Testing & Testing Co Ltd Xian Peoples R China Southeast Univ Sch Transportat Nanjing Peoples R China

ISBN: (纸本)9798350350920

Aiming at the problem that the image segmentation accuracy of highway pavement distress is easily affected by complex texture, noisy background, uneven illumination conditions and external environmental interference, this paper studies the image segmentation methods of highway pavement distress based on semantic segmentation Convolutional neural networks (CNN). Firstly, the methods of the image segmentation highway pavement distress based on FCN-DenseNet, DeepLabv3+, MobileNet are compared and analyzed. Secondly, the four variants of CNN models are investigated for the image segmentation of highway pavement distress, including FCN-DenseNet121 for Pavement Distress Segmentation (FCN-D121-PDS), DeepLabv3-DRN for Pavement Distress Segmentation (DL-D-PDS), DeepLabv3-MobilenetV3 for Pavement Distress Segmentation (DL-M-PDS and DeepLabv3-Mobilenet1 for Pavement Distress Segmentation (DL-M1-PDS). Finally, the comparative experiments were conducted, and the results showed that the average of DL-M1-PDS network is superior to the other three methods, with image segmentation accuracy of 98.20%.

关键词： Highway Pavement Distress image Segmentation Convolutional neural networks (CNN) Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

Touching the limit of Rolling Multilayer Perceptron for efficient two-dimensional medical image segmentation

引用

ENGINEERING applications OF artificial INTELLIGENCE 2025年 153卷

作者： Liu, Yutong Zhu, Haijiang An, Ning Xu, Li Beijing Univ Chem Technol Coll Informat Sci & Technol Beijing 100029 Peoples R China China Coal Res Inst Res Inst Mine Artificial Intelligence Beijing 100013 Peoples R China State Key Lab Intelligent Coal Min & Strata Contro Beijing 100013 Peoples R China Capital Med Univ Beijing Jishuitan Hosp Dept Radiol Beijing 100035 Peoples R China

Significant progress has been made in medical image segmentation using deep learning techniques, with the Ushaped architecture being a classic choice. However, effectively capturing and integrating both local features and remote dependencies remains a key challenge for improving deep learning-based segmentation methods. In this paper, we propose a flexible Rolling Multilayer Perceptron (Rolling-MLP) module to address this issue. Building upon this concept, we present the Rolling-Unet network, which combines the strengths of Multilayer Perceptrons (MLPs) with Convolutional neural networks (CNNs) to efficiently extract and fuse local features and remote dependencies. Furthermore, to explore the potential of Rolling-MLP for two-dimensional medical image segmentation, we propose Rolling-MLP configurations with distinct receptive field shapes (linear and area-shaped) and summarize the influence of Rolling-MLP's key parameters on the shape of receptive fields. We conducted extensive experiments on four datasets, surpassing a variety of state-of-the-art methods in accuracy. Moreover, Rolling-MLP is far ahead in Central processing Unit (CPU) inference speed, indicating its potential in medical cyber-physical systems engineering applications. This paper demonstrates the strong comprehensive ability of Rolling-MLP in two-dimensional medical image segmentation tasks, providing a novel approach for constructing medical image segmentation networks, alternative to CNNs and Transformers.

关键词： Cyber-physical systems engineering application Medical image segmentation Multilayer perceptron Remote dependency

来源：评论

学校读者我要写书评

暂无评论

Enhancing image Captioning Accuracy through Hybrid Deep Learning Models 1

Enhancing Image Captioning Accuracy through Hybrid Deep Lear...

引用

1st IEEE International conference on Advances in Computing, Communication and Networking, ICAC2N 2024

作者： Kaushik, Priyanka Rameshchandra, Patel Saileshchandra Kol, Mitali Narayan, Ritushree Gupta, Abhishek Kumar Rajalakshmi, R. Chandigarh University Dept. of Comp. Science Engineering Chandigarh India Madhav University Dept. of Humanities and S. Science Pindwara India Usha Martin University Dept. of Computing and IT Jharkhand India Mangalayatan University Dept. of Comp. Eng. and Applications Aligarh India Dhanalakshmi Srinivasan College Dept. of ECE Coimbatore India

ISBN: (纸本)9798350356816

This paper introduces an artificial neural Network model that integrates advanced deep learning techniques from computer vision and natural language processing domains. The model focuses on automating the captioning process for images, a crucial task in artificial intelligence. By employing Convolutional neural networks (CNNs) and Long Short-Term Memory networks (LSTMs), the model is trained to optimize the likelihood estimation of descriptive labels corresponding to each image in the dataset. Evaluation of the model's performance includes both quantitative metrics and qualitative assessment using the state-of-the-art BLEU-1 scoring method. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Malware image classification based on Mamba 24

Malware Image classification based on Mamba

引用

2nd International conference on artificial Intelligence, Systems and Network Security, AISNS 2024

作者： Wang, Haoxuan Jiangsu University of Science and Technology Jiangsu Zhenjiang China

ISBN: (纸本)9798400711237

Malicious code detection is one of the important research directions in the field of cybersecurity. Converting code into image information using convolutional neural networks (CNN) for malicious code detection has been applied. However, although converting the time series information in the code into two-dimensional information can fully utilize the computational efficiency of CNN, the impact of this two-dimensional processing is unknown. This article proposes a malicious code recognition method based on Mamba, which utilizes Mamba’s advantages in processing long sequence data for malicious code recognition. This article evaluated the effectiveness and robustness of the model on the Malimg dataset, achieving a recognition accuracy rate of 98.56%. © 2024 Copyright held by the owner/author(s).

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：