检索结果-内蒙古大学图书馆

Classification of Animal Species Using a Deep Neural Network-Based Feature Extraction Method 21

Classification of Animal Species Using a Deep Neural Network...

21st IEEE International Conference on Smart Communities: Improving Quality of Life using AI, Robotics and IoT, HONET 2024

作者： Ibrahim, Mohammed Al-Kubaise, Khamis Alkapti, Ali Almusa, Abdullah Abdelaziz, Osama Al-Maadeed, Somaya Sadasivuni, Kishor Kumar Qatar University Computer Science and Engineering Department Doha Qatar Center for Advanced Materials Qatar University PoBox 2713 Doha Qatar

ISBN: (纸本)9798350378078

This study presents an innovative approach to animal classification and recognition utilizing machine learning and deep learning methodologies. Leveraging advanced algorithms, the proposed system achieves remarkable accuracy in identifying diverse animal species. By integrating sophisticated image processing techniques, the system enhances image quality, improving overall performance. The research demonstrated that the SvM model combined with deep neural network-based feature extraction achieved the highest accuracy of 95.65%. This paper represents a significant stride toward improving the precision and efficiency of animal classification, offering promising applications in biodiversity conservation and ecological monitoring by using advanced feature extraction approach with deep learning. © 2024 IEEE.

关键词： Animal Recognition Computer vision Deep Learning Feature Extraction KNN

来源：评论

学校读者我要写书评

暂无评论

Recommendation Systems using Artificial Intelligence and using machine Learning 15

Recommendation Systems using Artificial Intelligence and usi...

引用

15th International Conference on Advances in Computing, Control, and Telecommunication Technologies, ACT 2024

作者： Turukmane, Anil v. Adhikari, Aviraj Das Chandini, Kollati School of Computer Science and Engineering VIT-AP University AP Vijaywada India

ISBN: (纸本)9798331300579

Recommender systems make individualized suggestions for users based on their actions and preferences by utilizing machine learning (ML) and artificial intelligence (AI).. These systems have evolved significantly, incorporating various AI techniques like fuzzy techniques, transfer learning, genetic algorithms, neural networks, deep learning, and more. The use of AI in recommender systems aims to enhance prediction accuracy and address data sparsity issues.1 Key methodologies in recommender systems include deep neural networks, transfer learning, active learning, fuzzy techniques, evolutionary algorithms, natural language processing, and computer vision.1 These techniques play crucial roles in knowledge representation, reasoning, planning, communication, perception, and image processing within recommender systems.1 machine Learning plays a vital role in recommendation systems by utilizing algorithms like KNN clustering, Naive Bayes, collaborative filtering and content filtering to suggest products to users overwhelmed by information on e-commerce platforms.4 Additionally, Recommender Systems (RSs) are widely used across various domains such as e-commerce, tourism, health, and e-learning to enhance user experience and increase sales through personalized recommendations based on user preferences.5 RSs have become integral in guiding decisions for users in online transactions and improving the quality of their interactions with platforms like Amazon, Netflix, YouTube, Spotify, Facebook, and Twitter5 used to pinpoint people who are most at risk for developing complications from an illness or who are most likely to have poor treatment outcomes. These data can be used to develop personalized treatment plans for patients. © Grenze Scientific Society, 2024.

关键词： Knowledge representation

来源：评论

学校读者我要写书评

暂无评论

Novel coronavirus (COvID-19) diagnosis using computer vision and artificial intelligence techniques: a review

引用

MULTIMEDIA TOOLS AND applications 2021年第13期80卷 19931-19946页

作者： Bhargava, Anuja Bansal, Atul GLA Univ Mathura India

The universal transmission of pandemic COvID-19 (Coronavirus) causes an immediate need to commit in the fight across the whole human population. The emergencies for human health care are limited for this abrupt outbreak and abandoned environment. In this situation, inventive automation like computer vision (machine learning, deep learning, artificial intelligence), medical imaging (computed tomography, X-Ray) has developed an encouraging solution against COvID-19. In recent months, different techniques using image processing are done by various researchers. In this paper, a major review on image acquisition, segmentation, diagnosis, avoidance, and management are presented. An analytical comparison of the various proposed algorithm by researchers for coronavirus has been carried out. Also, challenges and motivation for research in the future to deal with coronavirus are indicated. The clinical impact and use of computer vision and deep learning were discussed and we hope that dermatologists may have better understanding of these areas from the study.

关键词： Computer vision Computed tomography machine learning Coronavirus COvID-19

来源：评论

学校读者我要写书评

暂无评论

Robust Optimization-based Neural Architectures Against Adversarial Attacks 22

Robust Optimization-based Neural Architectures Against Adver...

引用

22nd IEEE International Multi-Conference on Systems, Signals and Devices, SSD 2025

作者： Ben Ali, Khaoula Messaoud, Seifeddine Ali Hajjaji, Mohamed Atri, Mohamed Liouane, Noureddine Image Processing National Engineering School Monastir University Laboratory of Automatic Signal Monastir Tunisia Monastir University Faculty of Sciences of Monastir Monastir Tunisia Higher Institute of Applied Sciences and Technology of Sousse Sousse University Sousse Tunisia College of Computer Science King Khalid University Computer Engineering Department Abha Saudi Arabia

ISBN: (纸本)9798331542726

Recent years have seen a rapid development in machine Learning, which has profoundly influenced many areas of science and engineering. Among them, computer vision takes the leading place, where important tasks are image classifications powered by CNNs. Despite the great performance of CNNs in complicated scenarios, they remain sensitive to so-called adversarial attacks, and deliberate perturbations leading them to incorrect predictions. Besides more innocuous consequences, this has serious security implications for critical applications, in-cluding medical diagnostics, where misclassifications might result in disastrous outcomes. This research work discusses adversarial attacks on CNNs and other DNNs in computer vision, studying a full range of the generation and detection methods with details while discussing intrinsic vulnerability and robustness. It also proposes a learning framework that will enhance the robustness and security of DNNs and CNNs against such adversarial perils. The ultimate goal is directed to an improvement in the reliability of such models in absolutely critical scenarios for safe deployment into applications where accuracy is crucial. © 2025 IEEE.

关键词： Adversarial Attacks Deep Neural Networks (DNNs) machine Learning Security

来源：评论

学校读者我要写书评

暂无评论

visual Question Answering Optimized Framework using Mixed Precision Training

Visual Question Answering Optimized Framework using Mixed Pr...

引用

2023 International Conference on Artificial Intelligence and applications, ICAIA 2023 and Alliance Technology Conference, ATCON-1 2023

作者： Chowdhury, Souvik Soni, Badal National Institute of Technology Silchar Department of Computer Science and Engineering Silchar India

ISBN: (纸本)9781665456272

Thanks to the emergence and continued devel-opment of machine learning, particularly deep learning, the research on visual question and answer, also known as vQA, has advanced dramatically, with great theoretical research significance and practical application value. This field of study makes use of multimodal learning, computer vision, and natural language processing techniques. Except for a few academics who presented different types of optimized bi-linear fusion approaches that integrate text and image characteristics in an efficient way, there haven't been many efforts to optimize the vQA framework. In order to optimize the vQA problem, we offer a unique visual Question Answering framework in this research. Because both 16-bit and 32-bit floating points provide automatic mixed precision, deep learning architectures can now be optimized with less computation and execution time. Using the vQA 2.0 and CLEvR datasets, the proposed framework has been tested against two models. In terms of overall accuracy and execution time, the experimental findings demonstrated a significant improvement. © 2023 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Multi-level receptive field feature reuse for multi-focus image fusion

引用

machine vision AND applications 2022年第6期33卷 92-92页

作者： Jiang, Limai Fan, Hui Li, Jinjiang Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China Univ Chinese Acad Sci Shenzhen Coll Adv Technol Shenzhen Peoples R China Shandong Technol & Business Univ Sch Comp Sci & Technol Yantai Peoples R China Coinnovat Ctr Shandong Coll & Univ Future Intelli Yantai Peoples R China

Multi-focus image fusion, which is the fusion of two or more images focused on different targets into one clear image, is a worthwhile problem in digital image processing. Traditional methods are usually based on frequency domain or space domain, but they cannot guarantee the accurate measurement of all the image details of the activity level, and also cannot perfect the selection of image fusion rules. Therefore, the deep learning method with strong feature representation ability is called the mainstream of multi-focus image fusion. However, until now, most of the deep learning frameworks have not balanced the relationship between the two input features, the shallow features and the feature fusion. In order to improve the defects of previous work, we propose an end-to-end deep network, which includes an encoder and a decoder. Encoder is a pseudo-Siamese network. It extracts the same and different feature sets by using the features of double encoder, then reuses the shallow features and finally forms the coding. In decoder, the coding will be analyzed and dimensionally reduced enough to generate high-quality fusion image. We carried out extensive experiments. The results show that our network structure is better. Compared with various image fusion methods based on deep learning and traditional multi-focus image fusion methods in recent years, our method is slightly better than theirs in both objective metric contrast and subjective visual contrast.

关键词： Multi-focus image fusion Deep learning Regression model Feature reuse

来源：评论

学校读者我要写书评

暂无评论

End-to-end optimized image compression with the frequency-oriented transform

引用

machine vision AND applications 2024年第2期35卷 27-27页

作者： Zhang, Yuefeng Lin, Kai Beijing Inst Comp Technol & Applicat 51th Yongding Rd Beijing 100039 Peoples R China Peking Univ Sch Comp Sci Beijing 100871 Peoples R China

image compression constitutes a significant challenge amid the era of information explosion. Recent studies employing deep learning methods have demonstrated the superior performance of learning-based image compression methods over traditional codecs. However, an inherent challenge associated with these methods lies in their lack of interpretability. Following an analysis of the varying degrees of compression degradation across different frequency bands, we propose the end-to-end optimized image compression model facilitated by the frequency-oriented transform. The proposed end-to-end image compression model consists of four components: spatial sampling, frequency-oriented transform, entropy estimation, and frequency-aware fusion. The frequency-oriented transform separates the original image signal into distinct frequency bands, aligning with the human-interpretable concept. Leveraging the non-overlapping hypothesis, the model enables scalable coding through the selective transmission of arbitrary frequency components. Extensive experiments are conducted to demonstrate that our model outperforms all traditional codecs including next-generation standard H.266/vvC on MS-SSIM metric. Moreover, visual analysis tasks (i.e., object detection and semantic segmentation) are conducted to verify the proposed compression method that could preserve semantic fidelity besides signal-level precision.

关键词： image compression image processing Computer vision machine learning

来源：评论

学校读者我要写书评

暂无评论

A computer vision approach to calculate diameter, volume, velocity and flow rate of bubble leaks in offshore wells 20

A computer vision approach to calculate diameter, volume, ve...

引用

20th ACS/IEEE International Conference on Computer Systems and applications (AICCSA)

作者： Chagas, Joao v. S. Texeira, Gleber T. Araujo, Adriel S. Goncalves, Andre Passos, Fernanda G. O. Conci, Aura Univ Fed Fluminense Inst Comp Niteroi RJ Brazil Inst Politecn Lisboa ISEL Lisbon Portugal Lusofona Univ COPELABS Lisbon Portugal Petrobras SA Rio De Janeiro RJ Brazil

ISBN: (纸本)9798350319439

This paper presents an approach for detection and quantification, with low latency, of the flow of leakage bubbles, in sub-surfaces, making use of video recorded by remote underwater vehicle using only image analysis and under the premise of no overlapping bubbles. Implementation details are presented allowing its trial and reproduction. Results are confronted with videos acquired in a laboratory under controlled conditions and in real operational situation from literature, showing great efficiency in terms of processing time and all other important aspects for pipeline inspections, considering environment and safety in the oil industry.

关键词： Offshore oil well production

来源：评论

学校读者我要写书评

暂无评论

Streamlining Crop Segmentation with Multispectral Imaging and Foundation Models: Minimizing Manual Annotation 20

Streamlining Crop Segmentation with Multispectral Imaging an...

引用

20th IEEE International Conference on Intelligent Computer Communication and processing Conference, ICCP 2024

作者： Aszkowski, Przemyslaw Kraft, Marek Institute of Robotics and Machine Intelligence Poznań University of Technology Poznań Poland

ISBN: (纸本)9798331539979

Deep learning advancements have significantly enhanced computer vision applications in precision agriculture. While RGB cameras operating in visible light are affordable, they provide limited information compared to multispectral equipment. This research analyses methods to reduce the need for manual annotation when training a model using only RGB images, without compromising the model's accuracy. We propose a semi-supervised approach where a teacher model, trained on multispectral images, generates artificial ground truth data to train a student model that operates solely on RGB images. This strategy has enabled us to achieve nearly a tenfold reduction in the required training data while maintaining similar performance metrics. Additionally, we explore the potential of segmentation foundation models to simplify the manual annotation process, reducing the need for full segmentation masks to just bounding boxes. Our findings also indicate that using multispectral images as input for the Segment Anything Model is more effective than using RGB images. © 2024 IEEE.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Multichannel Object Detection with Event Camera

Multichannel Object Detection with Event Camera

引用

International image processing, applications and Systems Conference (IPAS)

作者： Rafael Iliasov Alessandro Golkar Chair of Spacecraft Systems Technical University of Munich Munich Germany

ISBN: (数字)9798331506520

ISBN: (纸本)9798331506537

object detection based on event vision has been a dynamically growing field in computer vision for the last 16 years. In this work, we create multiple channels from a single event camera and propose an event fusion method (EFM) to enhance object detection in event-based vision systems. Each channel uses a different accumulation buffer to collect events from the event camera. We implement YOLOv7 for object detection, followed by a fusion algorithm. Our multichannel approach outperforms single-channel-based object detection by 0.7% in mean Average Precision (mAP) for detection overlapping ground truth with IOU = 0.5.

关键词： Computer vision Event detection machine vision AI accelerators Object detection Cameras Feature extraction Real-time systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：