检索结果-内蒙古大学图书馆

5th International conference on Industrial Engineering and artificial Intelligence, IEAI 2024

ISBN: (纸本)9798350386363

The proceedings contain 19 papers. The topics discussed include: bacterial colony counter using different image processing algorithms;detection of facial expressions based on three feature points using image processing with artificial neural networks;YOLO-based helmet detection system for safety compliance in oil and gas industry;virtual sample generation using conditional adversarial network with latent spaces as noise inputs;IoT integrated conveyor centralized system;weighted subgraph knowledge distillation for graph model compression;bacterial colony counter using different image processing algorithms;detection of facial expressions based on three feature points using image processing with artificial neural networks;and verifying the effectiveness of using virtual characters for the promotion of a university department.

关键词：

来源：评论

学校读者我要写书评

暂无评论

artificial Intelligence Based On-Board image Compression for the Φ-Sat-2 Mission

引用

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING 2023年 16卷 8063-8075页

作者： Guerrisi, Giorgia Del Frate, Fabio Schiavon, Giovanni Tor Vergata Univ Rome Dept Civil Engn & Comp Sci Engn I-00133 Rome Italy

The growing amount of data collected by Earth Observation (EO) satellites requires new processing procedures able to manage huge quantity of information. artificial intelligence (AI) and deep learning (DL) can provide advanced information also because of their ability to extract valuable information from complex data. Thanks to specific hardware platforms, these algorithms can be used also in space, opening the possibility for new procedures for intelligent data processing. The European Space Agency phi-Sat-2 mission was designed with the purpose of demonstrating the benefits of using AI in space by running AI-based applications on-board a CubeSat. We present here the convolutional autoencoder-based algorithm developed for on-board lossy image compression of the phi-Sat-2 mission and provide a first benchmark addressing a real space mission and a new image compression end-to-end architecture based on AI. image compression is a crucial application that allows to save transmission bandwidth and storage. In fact, images acquired by the sensor can be compressed on-board and sent to the ground where they are reconstructed. DL algorithms have already been successfully applied for image compression however performance degradation may occur in the context of a representative on-board environment. Therefore, besides analyzing the results for the local hardware environment, this article investigates the performance variation for the on-board setting. An additional piece of innovation is the introduction of an applicative metric for the evaluation of the compression to assess the applicability of the reconstructed images for other tasks. Such metric completes those more traditional based on the original-reconstructed image similarity.

关键词： artificial intelligence (AI) convolutional neural networks (CNNs) CubeSat image compression on-board processing

来源：评论

学校读者我要写书评

暂无评论

DDCBlock: parallel lightweight modules that focus more on long-distance information

DDCBlock: parallel lightweight modules that focus more on lo...

引用

2024 International conference on image processing and artificial Intelligence, ICIPAl 2024

作者： Liu, Yunda Liu, Haokun Chongqing Normal University Chongqing China

ISBN: (纸本)9781510681514

In recent years, artificial intelligence technology has become increasingly closely connected with various fields. However, due to the high requirements of traditional convolutional neural networks for memory and computing resources, it is relatively difficult to deploy them on mobile devices. Therefore, the demand for lightweight neural networks that can be deployed on mobile intelligent terminals is becoming increasingly urgent. This article is inspired by dilated convolution and GhostNetV2, and proposes a new lightweight module-DDC Block. It uses dilated convolution with a larger receptive field structure, combined with depthwise separable convolution, to generate more feature maps through these inexpensive operations, achieving the goal of lightweight. And introduce a decoupled fully connected attention mechanism to ensure high accuracy of the module. We conducted experiments on the CIFAR-10 and CIFAR-100 datasets and compared them with other neural networks. The results showed that this module not only reduced a significant amount of parameter and computational complexity, but also ensured high accuracy. © 2024 SPIE.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

Mixed-decomposed convolutional network:A lightweight yet efficient convolutional neural network for ocular disease recognition

引用

CAAI Transactions on Intelligence Technology 2024年第2期9卷 319-332页

作者： Xiaoqing Zhang Xiao Wu Zunjie Xiao Lingxi Hu Zhongxi Qiu Qingyang Sun Risa Higashita Jiang Liu Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering Southern University of Science and TechnologyShenzhenChina Tomey Corporation NagoyaJapan Guangdong Provincial Key Laboratory of Brain‐inspired Intelligent Computation Department of Computer Science and EngineeringSouthern University of Science and TechnologyShenzhenChina Singapore Eye Research Institute SingaporeSingapore

Eye health has become a global health concern and attracted broad *** the years,researchers have proposed many state-of-the-art convolutional neural networks(CNNs)to assist ophthalmologists in diagnosing ocular diseases efficiently and ***,most existing methods were dedicated to constructing sophisticated CNNs,inevitably ignoring the trade-off between performance and model *** alleviate this paradox,this paper proposes a lightweight yet efficient network architecture,mixeddecomposed convolutional network(MDNet),to recognise ocular *** MDNet,we introduce a novel mixed-decomposed depthwise convolution method,which takes advantage of depthwise convolution and depthwise dilated convolution operations to capture low-resolution and high-resolution patterns by using fewer computations and fewer *** conduct extensive experiments on the clinical anterior segment optical coherence tomography(AS-OCT),LAG,University of California San Diego,and CIFAR-100 *** results show our MDNet achieves a better trade-off between the performance and model complexity than efficient CNNs including MobileNets and ***,our MDNet outperforms MobileNets by 2.5%of accuracy by using 22%fewer parameters and 30%fewer computations on the AS-OCT dataset.

关键词： artificial intelligence deep learning deep neural networks image analysis image classification medical applications medical image processing

来源：评论

学校读者我要写书评

暂无评论

Half-Split ResUNet Denoiser Based Deep Unrolling For Photon Limited image Deblurring 15

Half-Split ResUNet Denoiser Based Deep Unrolling For Photon ...

引用

15th International conference on Signal processing and Communications (SPCOM)

作者： Kumar, Koyyada Dinesh Sahoo, Sujit Kumar Indian Inst Technol Goa Sch Elect Sci Ponda 403401 India

ISBN: (纸本)9798350350463;9798350350456

Photon-limited deblurring is a complex and demanding problem encountered in various applications where low-light conditions prevail. The scarcity of photons in such situations leads to the introduction of shot noise, resulting in a degradation of image quality. Solving this problem with neural networks often involves constructing models empirically, making the behavior of the underlying architecture challenging to comprehend. A recent technique known as algorithm unrolling has enabled the connection of iterative algorithms with neural networks, where the Convolutional neural Network (CNN) acts as a denoiser. This paper introduces a reduced parameter denoiser to enhance image quality and preserve finer details or avoid over-smoothing of the image during reconstruction. As a result, the unrolled model surpasses existing deblurring methods for improving image quality in low-light conditions. The proposed denoiser reduces the number of parameters by a factor of 3.84 and preserves the finer details while reconstructing. Our model improves computational efficiency and storage requirements compared to the state-of-the-art.

关键词： Photon limited Poisson deconvolution non-blind deblurring algorithm unfolding plug and play

来源：评论

学校读者我要写书评

暂无评论

A Modern Approach to High Dynamic Range image processing with Machine Learning Architectures 27

A Modern Approach to High Dynamic Range Image Processing wit...

引用

27th International conference on Soft Computing and Measurements, SCM 2024

作者： Kovtun, Roman S. Chuprov, Sergej S. Gataullin, Ruslan I. Ruchkan, Aleksandr D. Alhasan, Ali Wixnin, IlaJa I. Saint Petersburg Electrotechnical University 'LETI' Dept. Computer Science St. Petersburg Russia Rochester Institute of Technology Golisano College of Computing and Information Sciences Rochester United States

ISBN: (纸本)9798350363708

In this paper, we analyze modern approaches and methods of neural networks employment in the tasks of capturing and demonstrating a wide dynamic sound stage (HDR). We highlight the essence of the problem and its relevance to the modern image processing applications. We identify the limitations of the conventional HDR image processing and discuss how the use of neural networks allows to address these limitations. We overview the use of various types of Machine Learning architectures in HDR image processing, such as convolutional neural networks, generative adversarial networks, and recurrent neural networks. In addition, we discuss unsupervised learning-based methods also considered by modern research. Based on our review, we develop recommendations of employing various HDR image processing methods in distinct real-world scenarios. © 2024 IEEE.

关键词： Sound stages

来源：评论

学校读者我要写书评

暂无评论

AIM: Additional image Guided Generation of Transferable Adversarial Attacks 39

AIM: Additional Image Guided Generation of Transferable Adve...

引用

39th Annual AAAI conference on artificial Intelligence, AAAI 2025

作者： Li, Teng Ma, Xingjun Jiang, Yu-Gang Shanghai Key Lab of Intell. Info. Processing School of CS Fudan University China

ISBN: (纸本)157735897X

Transferable adversarial examples highlight the vulnerability of deep neural networks (DNNs) to imperceptible perturbations across various real-world applications. While there have been notable advancements in untargeted transferable attacks, targeted transferable attacks remain a significant challenge. In this work, we focus on generative approaches for targeted transferable attacks. Current generative attacks focus on reducing overfitting to surrogate models and the source data domain, but they often overlook the importance of enhancing transferability through additional semantics. To address this issue, we introduce a novel plug-and-play module into the general generator architecture to enhance adversarial transferability. Specifically, we propose a Semantic Injection Module (SIM) that utilizes the semantics contained in an additional guiding image to improve transferability. The guiding image provides a simple yet effective method to incorporate target semantics from the target class to create targeted and highly transferable attacks. Additionally, we propose new loss formulations that can integrate the semantic injection module more effectively for both targeted and untargeted attacks. We conduct comprehensive experiments under both targeted and untargeted attack settings to demonstrate the efficacy of our proposed approach. © 2025, Association for the Advancement of artificial Intelligence (***). All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Design of area-speed efficient Anurupyena Vedic multiplier for deep learning applications

引用

ANALOG INTEGRATED CIRCUITS AND SIGNAL processing 2024年第3期119卷 521-533页

作者： Kalaiselvi, C. M. Sabeenian, R. S. Sona Coll Technol Salem India Sona Coll Technol Dept ECE Salem India

Hardware such as multipliers and dividers is necessary for all electronic systems. This paper explores Vedic mathematics techniques for high-speed and low-area multiplication. In the study of multiplication algorithms, various bits-width ranges of the Anurupyena sutra are used. Parallelism is employed to address challenging problems in recent studies. Various designs have been developed for the Field Programmable Gate Array (FPGA) implementation employing Very Large-Scale integration (VLSI) design approaches and parallel computing technology. Signal processing, machine learning, and reconfigurable computing research should be closely monitored as artificial intelligence develops. Multipliers and adders are key components of deep learning algorithms. The multiplier is an energy-intensive component of signal processing in Arithmetic Logic Unit (ALU), Convolutional neural networks (CNN), and Deep neural networks (DNN). For the DNN, this method introduces the Booth multiplier blocks and the carry-save multiplier in the Anurupyena architecture. Traditional multiplication methods like the array multiplier, Wallace multiplier, and Booth multiplier are contrasted with the Vedic mathematics algorithms. On a specific hardware platform, Vedic algorithms perform faster, use less power, and take up less space. Implementations were carried out using Verilog HDL and Xilinx Vivado 2019.1 on Kintex-7. The area and propagation delay were reduced compared to other multiplier architectures.

关键词： Vedic mathematics algorithms Anurupyena sutra Multiplier optimization Priority encoder Bit Reduction

来源：评论

学校读者我要写书评

暂无评论

CT manifestations of gallbladder carcinoma based on neural network

引用

neural COMPUTING & applications 2023年第3期35卷 2039-2044页

作者： Chang, Yigang Wu, Qian Chi, Limin Huo, Huaying Shanxi Prov Peoples Hosp Taiyuan 030012 Shanxi Peoples R China Shanxi Tumor Hosp Taiyuan 030013 Shanxi Peoples R China Shanxi Med Univ Hosp 1 Taiyuan 030001 Shanxi Peoples R China

Gallbladder cancer is a relatively rare but highly malignant tumor. This study mainly explores the CT findings of gallbladder cancer based on neural networks. This study designed a gallbladder cancer LDCT image denoising network. Ability to process different doses of gallbladder cancer LDCT images with significant differences in noise and artifact distribution, this study designed the noise level estimation sub-network as a codec structure;the decoding part is used to generate the noise level of the gallbladder cancer LDCT image Artifact image. artificial neural network is a kind of artificial neural network that simulates the behavior characteristics of animal neural network and achieves the purpose of processing information by adjusting the interconnection between a large number of internal nodes. In order to meet the requirements of medical diagnosis for gallbladder cancer LDCT image quality, this study designed the backbone noise reduction network as a GAN framework that can be internally optimized. The discriminator network structure of this study is a multi-scale inception structure. As a sub-network of GAN, the discriminator network is used to distinguish true and false images and constrain the generator to make the generated images close to real images. In addition, it can be used as a noise evaluation sub-network to evaluate the noise gallbladder cancer LDCT. The treatment methods of gallbladder cancer include surgery, chemotherapy, radiation therapy, arterial interventional perfusion therapy, targeted therapy, etc. Surgery is currently the first choice for the treatment of gallbladder cancer, and the choice of surgery depends on the stage and growth site of gallbladder cancer. The image denoising network was used to evaluate the quality of the noise-reduced image. The average precision of GAN network for gallbladder cancer area is 91.0%, and the highest value is 95.2%. This study will provide a reliable reference value for the auxiliary diagnosis of gallbl

关键词： neural network Gallbladder cancer CT appearance LDCT image information GAN network

来源：评论

学校读者我要写书评

暂无评论

Unveiling the Power of Convolutional neural networks: A Comprehensive Study on Remote Sensing image Captioning and Encoder Selection

Unveiling the Power of Convolutional Neural Networks: A Comp...

引用

International Joint conference on neural networks (IJCNN)

作者： Das, Swadhin Khandelwal, Akshat Sharma, Raksha Indian Inst Technol Comp Sci & Engn Roorkee Haridwar India Indian Inst Technol Chem Engn Roorkee Roorkee India

ISBN: (纸本)9798350359329;9798350359312

Extracting semantic information from remote sensing (RS) images has gained attention for its wide applications in defense, disaster management, and urban planning. Captioning RS images is challenging due to intricate properties like resolutions, color bands, and object types. Generating precise captions requires domain expertise, and manual annotation is timeconsuming. The common approach involves using an encoderdecoder-based framework for RS image captioning, where an input image is encoded into a feature vector and decoded into a caption. Selecting the right image encoder is vital for optimizing caption prediction systems in specific domains. While Convolutional neural Network (CNN) based encoders are acknowledged for extracting crucial image features, it's important to assess variations in their mechanisms and architectures carefully. This paper thoroughly examines various CNNs to evaluate their effectiveness in RS image captioning. We also explore the performance of two caption generation techniques, viz., greedy search and beam search. The encoders are clustered as good, medium, and bad, with ResNet (CNN) emerging as the preferred choice in the good cluster across all considered datasets. The impact of choosing between beam search and greedy search is minimal. Additionally, we conduct a subjective evaluation of leading models to address limitations associated with purely numerical assessments. The paper is a novel contribution, providing the first-of-its-kind subjective evaluation of CNN-based encoders for the RS image captioning task.

关键词： Remote Sensing (RS) images captioning CNN encoder-decoder beam search greedy search and subjective evaluation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：