检索结果-内蒙古大学图书馆

3rd International Conference on machine Learning and Big Data Analytics for IoT Security and Privacy, SPIoT2023

作者： Tan, Guihua Liu, Yiran College of Information Engineering Loudi Xiaoxiang Vocational College Hunan Loudi417000 China

Picture processing is applied in all kind of fields, such as space science research, medical imaging, photography art. Because the human vision system is a complex nonlinear dynamic system, the traditional image enhancement methods often can not meet the requirements of real-time in practical applications. In the face of a large number of images, it is of great practical significance to obtain practical information from these data. With the rapid development of computer CnTech, image compression coding CnTech has also made great progress. And through the Mask R-CNN algorithm for effective segmentation and image recognition, can help people face massive image information, to their own needs as the goal, to achieve efficient retrieval, analysis, induction. The traditional methods of image classification by manually selecting features or manually extracting template matching have some defects. In this paper, a Mask R-CNN image optimization processing CnTech is proposed, which can accurately identify images and has higher accuracy for image feature extraction. It is meaningful to achieve automatic and accurate segmentation of microscopic images. Through training on COCO data set, it is verified through experiments that there are problems in image segmentation and recognition. In Mask R-CNN model, resource consumption is reduced and the efficiency of image segmentation and recognition is improved. Compared with the current cutting-edge algorithms, the image object detection on the strength of Mask R-CNN model has significant advantages. © 2023 The Authors. Published by Elsevier B.v.

关键词： image retrieval

来源：评论

学校读者我要写书评

暂无评论

Precision Agriculture Advancements: A Comprehensive Integrated System for Disease Prediction and Crop Yield Estimation Using image Analysis and Environmental Data 2

Precision Agriculture Advancements: A Comprehensive Integrat...

引用

2nd International Conference on Artificial Intelligence and machine Learning applications, AIMLA 2024

作者： Nithiya, A. Navina, N. Thoshitha, D. Suvetha, R. Thirilosana, J. M. Kumarasamy College of Engineering Department of Information Technology Tamilnadu Karur639113 India

ISBN: (数字)9798350349221

ISBN: (纸本)9798350349221

The primary problem facing agriculture, which is essential to ensuring the world's food security, is maximizing crop productivity while reducing the effects of plant diseases. Advanced technologies have the potential to completely transform agricultural methods, particularly in the parts of computer vision and machine learning. This study uses meteorological datasets and fruit image analysis to create an integrated agricultural decision support system for crop yield estimation and disease prediction. By offering early plant disease detection and precise crop yield estimates, the system seeks to improve precision agriculture techniques. A variety of datasets with plant photos labelled with disease information are gathered for the study, and meteorological data is integrated to capture environmental variables. The technology includes advanced image processing techniques to extract relevant features from plant pictures. The suggested method analyses images using a convolutional neural network technique to forecast the disease in impacted fruits. Make recommendations for natural fertilizers based on the ailment being suffered. The Multilayer Perceptron algorithm is used to train the model using a large dataset that contains historical meteorological data, allowing it to identify patterns and connectionsbetween environmental conditions. Lastly, farmers receive an SMS notice with prediction specifics. © 2024 IEEE.

关键词： Fruits

来源：评论

学校读者我要写书评

暂无评论

12th EAI International Conference on Context-Aware Systems and applications, ICCASA 2023

12th EAI International Conference on Context-Aware Systems a...

引用

12th EAI International Conference on Context-Aware Systems and applications, ICCASA 2023

ISBN: (纸本)9783031588778

The proceedings contain 14 papers. The special focus in this conference is on Context-Aware Systems and applications. The topics include: User-Based Collaborative Filtering Multi-criteria Recommender System Based on Interaction Between Criteria, Criteria Set with Choquet Integral;application of machine Learning Techniques to Classify Intention to Pay for Forest Ecosystem Services;Anomaly Detection in Univariate Time Series: HOT SAX vesus LSTM-Based Method;application of machine Learning Models for Predicting Glucose-Level in the Pure Fluid with Algorithm for Reducing Data Dimension Based on Data Series Extraction;comprehensive Survey On Remote Sensing image processing Techniques for image Classification;item-Based Energy Clustering Recommendation;General Evaluation of EtherCAT-Based Techniques in various Industrial Systems: Review and applications;towards an IoT-Based Unmanned Surface vehicle Design for Environment Monitoring in Mekong Delta;3D CNN with BERT and vision Transformer for video Recognition;Identify Tumors on Lung CT images;a Context-Aware Application to Monitor the Air Quality;applying Guided Discovery Learning to Enhance the Achievement of Information Technology Team.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-Modal Learning with Joint image-Text Embeddings and Decoder Networks 7

Multi-Modal Learning with Joint Image-Text Embeddings and De...

引用

IEEE 7th International Conference on Industrial Cyber-Physical Systems (ICPS)

作者： Chemmanam, Ajai John Jose, Bijoy A. Moopan, Asif Cochin Univ Sci & Technol CPS Lab Dept Elect Cochin Kerala India Vuelogix Technol Pvt Ltd Kochi Kerala India

ISBN: (纸本)9798350363029;9798350363012

Advances in machine learning and neural networks have transformed natural language processing (NLP) and computer vision (Cv) applications. Recent research efforts have begun to bridge the gap between the two domains. In this work, we propose a semi supervised Multi-Modal Encoder Decoder Network (MMEDN) to capture the relationship between images and textual descriptions, allowing us to generate meaningful descriptions of images and retrieve images from a database using cross-modality search. The semi-supervised training approach, which combines ground truth text descriptions and pseudotext generated by the text decoder within the model, requires far fewer image-text pairs in the training data and can directly add new raw images without manual text labelling for training. This approach is particularly useful for active learning environments, where labels are expensive and hard to obtain. We show that our model performs well with qualitative evaluations. We applied our model for finding images of a person from large databases and generating descriptions of people involved in an event for adding to an automatically generated report. The model was able to retrieve relevant images and generate accurate descriptions, demonstrating its applicability to more practical use cases.

关键词： Multi-modal learning Cross-modal retrieval Encoder-decoder architectures Computer vision Natural Language processing

来源：评论

学校读者我要写书评

暂无评论

Enhanced Feature Extraction for image Dehazing: A Comparative Study between Deep Learning Architectures and FFA-NET 2

Enhanced Feature Extraction for Image Dehazing: A Comparativ...

引用

2nd International Conference on Inventive Computing and Informatics (ICICI)

作者： Chaudhary, Sarthak Gupta, Samridh Iniyan, S. SRM Inst Sci & Technol Dept Comp Sci & Engn Chennai 603203 Tamil Nadu India SRM Inst Sci & Technol Dept Comp Technol Sch Comp Chennai 603203 Tamil Nadu India

ISBN: (纸本)9798350373301;9798350373295

The problem of poor visibility in foggy images has spurred various image de-hazing strategies. As the need for high-quality images grows, especially for autonomous systems, this research aims to leverage different Deep Learning (DL) architectures to draw out key details from images, localizing this retrieved data to mitigate the impact of haze. The work explores using DL methods, particularly contrasting the regression and classification models of Convolutional Neural Networks (CNN), to remove haze from foggy images. This work sets the stage for further developments in image processing, particularly in conditions with poor visibility. It opens opportunities for improving image quality in various applications, such as autonomous driving and outdoor robotics, where clarity of vision is crucial. The final stage of the proposed model involves three specific pre-processing methods: contextual regularization, air light estimation and boundary constraint for optimal results. The next stage sets out to determine the best DL model for producing clear images from de-hazed ones.

关键词： machine Learning (ML) Convolutional Neural Networks (CNN) Feature Fusion Attention Network(FFA-NET) Deep Learning Dark Channel Prior(DCP) Artificial Intelligence (AI)

来源：评论

学校读者我要写书评

暂无评论

Portable High-Speed Optical Gaze Controller with vision Chip

引用

JOURNAL OF ROBOTICS AND MECHATRONICS 2022年第5期34卷 1133-1140页

作者： Miyashita, Leo Ishikawa, Masatoshi Univ Tokyo Bunkyo Ku 7-3-1 Hongo Tokyo 1138656 Japan Tokyo Univ Sci Shinjuku Ku 1-3 Kagurazaka Tokyo 1628601 Japan

It is important to miniaturize robot systems while maintaining advantages such as high responsiveness and functionality for human-machine interactions and for achieving integration with other robotic systems such as drones. In this research, we focused on the miniaturization of a high-speed visual feedback system, and developed a "portable saccade mirror," which is a system that can realize active target tracking using 1000 Hz image capturing, processing, and feedback actuation with only 3 ms latency in a hand-held device. By using a three-dimensionally-stacked vision chip, the proposed system achieved high speed, low latency, low power consumption and compact size, and therefore, can be considered as a good example of a miniaturized high-speed visual feedback system. In this study, we evaluated the performance of the proposed system in comparison with the conventional optical gaze controller, and demonstrated some applications, such as tracking field scope and panorama target scanning.

关键词： vision chip visual feedback optical gaze controller high-speed image processing

来源：评论

学校读者我要写书评

暂无评论

Advancing Multi-Class Arc Welding Defect Classification: DEEPTLWELD Intelligent System Utilizing Computer vision, Deep Learning, and Transfer Learning on Radiographic X-ray images for Bangladesh's Manufacturing Sector

Advancing Multi-Class Arc Welding Defect Classification: DEE...

引用

2024 IEEE International Conference on Computing, applications and Systems, COMPAS 2024

作者： Chowdhury, Avijit Chittagong University of Engineering and Technology Department of Mechanical Engineering Chattogram4369 Bangladesh

ISBN: (纸本)9798331529765

Welding defects are a crucial problem in the manufacturing industry. However, the industry faces enormous losses for these defects. Conditional monitoring and quality control can reduce this loss. In Industry 4.0, artificial intelligence revolutionized every field. This research aims to improve industrial automation by creating a multi-class classification system through digital image processing, deep learning, and transfer learning - a real-time application integrated with deep learning, transfer learning, image processing, and MLOPS operations. Accurate prediction occurred by various models, such as custom CNN and pre-trained models (Inceptionv3, EfficientNetB7, DenseNet201, and ResNet152). DEEPTLWELD system was developed using a bespoke dataset of 4,000 radiographic X-ray pictures with four frequent categories of defects that are essential for tackling difficulties faced by local industry challenges. This dataset is All radiographic images processed with image processing techniques (distance transform, watershed transform, edge detection). In the transfer learning part, weights were loaded manually as those weights performed better with a particular model. Out of the networks, custom CNN achieved high accuracy with 98% after fine-tuning. The deployment of the produced model in a production environment developed with a Streamlit framework is ensured by its smooth integration into a machine learning pipeline housed on Amazon EC2 servers. This intelligent system has the potential for broader applications in inclusive technology. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Underwater Binocular Meta-lens

引用

ACS PHOTONICS 2023年第7期10卷 2382-2389页

作者： Liu, Xiaoyuan Chen, Mu Ku Chu, Cheng Hung Zhang, Jingcheng Leng, Borui Yamaguchi, Takeshi Tanaka, Takuo Tsai, Din Ping City Univ Hong Kong Dept Elect Engn Hong Kong 999077 Peoples R China RIKEN Ctr Adv Photon Innovat Photon Manipulat Res Team Saitama 3510198 Japan City Univ Hong Kong Ctr Biosyst Neurosci & Nanotechnol Kowloon Hong Kong 999077 Peoples R China City Univ Hong Kong State Key Lab Terahertz & Millimeter Waves Kowloon Hong Kong 999077 Peoples R China RIKEN Ctr Adv Photon Innovat Photon Manipulat Res Team Saitama 3510198 Japan RIKEN Cluster Pioneering Res Metamat Lab Saitama 3510198 Japan Tokushima Univ Inst Postled Photon Tokushima 7708506 Japan

Underwater optics in all-aquatic environments is vital for environmental management, biogeochemistry, phytoplankton ecology, benthic processes, global change, etc. Many optical techniques of observational systems for underwater sensing, imaging, and applications have been developed. For the demands of compact, miniaturized, portable, lightweight, and low-energy consumption, a novel underwater binocular depth-sensing and imaging meta-optic device is developed and reported here. A GaN binocular meta-lens is specifically designed and fabricated to demonstrate underwater stereo vision and depth sensing. The diameter of each meta-lens is 2.6 mm, and the measured distance between the two meta-lens centers is 4.04 mm. The advantage of our binocular meta-lens is no need of distortion correction or camera calibration, which is necessary for traditional two camera stereo vision systems. Based on the experimental results, we developed the generalized depth calculation formula for all-size binocular vision systems. With deep-learning support, this stereo vision system can realize the fast underwater object's depth and image computation for real-time processing capability. Our artificial intelligent imaging results show that depth measurement accuracy is down to 50 mu m. Besides the aberration-free advantage of flat meta-optic components, the intrinsic superhydrophobicity properties of our nanostructured GaN meta-lens enable an antiadhesion, stain-resistant, and self-cleaning novel underwater imaging device. This stereo vision binocular meta-lens will significantly benefit underwater micro/nanorobots, autonomous submarines, machine vision in the ocean, marine ecological surveys, etc.

关键词： meta-lens binocular vision underwater depth sensing stereo vision deep learning

来源：评论

学校读者我要写书评

暂无评论

RNCE: A New image Segmentation Approach

RNCE: A New Image Segmentation Approach

引用

International Conference on Computer vision and machine Intelligence, CvMI 2022

作者： Kumar, vikash Ali, Asfak Chaudhuri, Sheli Sinha Electronics and Telecommunication Engineering West Bengal Jadavpur Kolkata700032 India

ISBN: (纸本)9789811978661

Semantic image segmentation based on deep learning is gaining popularity because it is giving promising results in medical image analysis, automated land categorization, remote sensing, and other computer vision applications. Many algorithms have been designed in recent years, yet there is scope for further improvement in computer vision research. We have proposed a unique ensemble method called Ranking and Nonhierarchical Comparison Ensemble (RNCE) for semantic segmentation of landcover images based on the Ranking and Nonhierarchical Comparison methodology. Our approach has been tested on pretrained models showing improved accuracy and mean IoU with respect to the existing method. The code is available at: https://***/vekash2021/***. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Remote sensing

来源：评论

学校读者我要写书评

暂无评论

A Paradigm Shift towards Computer vision

A Paradigm Shift towards Computer Vision

引用

2023 IEEE International Conference on Device Intelligence, Computing and Communication Technologies, DICCT 2023

作者： Chaithra, N. Jha, Janhvi Sayal, Anu Gupta, veethika Gupta, Ashulekha Karnataka Bangalore India Taylor's University Department of Mathematics Malaysia Doon University School of Social Sciences Department of Economics Uttarakhand India Department of Management Uttarakhand Dehradun India

ISBN: (纸本)9781665474924

In today's world, machine learning, artificial intelligence, IoT, deep learning and several other techniques have become the need of the moment. One such division of artificial intelligence is computer vision. The main goal of computer vision development is to create paradigms for extracting data and information from images. It has various applications in the fields of industry, agriculture, automations, healthcare, e-commerce and much more. The study examines the most recent events and conceptual frameworks governing the progress of computer vision, with a focus on pattern recognition and image processing, using a variety of applications from the t field. This article attempts to discuss the most current results and applications in computer vision. © 2023 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：