检索结果-内蒙古大学图书馆

Performance Evaluation of Median Filter Hardware Chip for image and Signal processing

Performance Evaluation of Median Filter Hardware Chip for Im...

Cognitive Computing in Engineering, Communications, Sciences and Biomedical Health Informatics (IC3ECSBHI), International Conference on

作者： Sreesh Gaur Akash Goel Pawan Kumar Pal Amit Kumar Singh Sanger Department of Computer Science KIET Group of Institutions Ghaziabad India

ISBN: (数字)9798331518523

ISBN: (纸本)9798331518530

The median filter is a valuable image processing tool that can be used in many applications to advance picture quality, and inferior noise, and acquire data ready for more analysis. The median filter, a non-linear image processing technique, has excessive application in many diverse fields because of its capacity to remove noise while preserving edges. The filter is significant for both simple and complex imageprocessing occupations since it is together easy to comprehend and fast to use. An active method to attain real-time performance is to devise a median filter on a FieldProgrammable Gate Array (FPGA) for image signal processing and machine vision applications. FPGAs are well-suited for photo filtering due to their capability to achieve parallel processing, which is one of their prominent advantages. The research article meets with the design and simulation of the average filter in HDL and synthesis on the FPGA for assessing the performance indices.

关键词： Performance evaluation Filtering machine vision Noise Parallel processing Logic gates Hardware Real-time systems Delays Field programmable gate arrays

来源：评论

学校读者我要写书评

暂无评论

CapNet: An Encoder-Decoder based Neural Network Model for Automatic Bangla image Caption Generation

引用

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND applications 2022年第8期13卷 752-759页

作者： Rahman, Rashik Saha, Aloke Kumar Murad, Hasan Al Masud, Shah Murtaza Rashid Rahman, Nakiba Nuren Momtaz, A. S. Zaforullah Univ Asia Pacific Comp Sci & Engn Dhaka Bangladesh Chittagong Univ Engn & Technol Comp Sci & Engn Chattogram Bangladesh

Automatic caption generation from images has become an active research topic in the field of Computer vision (CV) and Natural Language processing (NLP). machine generated image caption plays a vital role for the visually impaired people by converting the caption to speech to have a better understanding of their surrounding. Though significant amount of research has been conducted for automatic caption generation in other languages, far too little effort has been devoted to Bangla image caption generation. In this paper, we propose an encoder-decoder based model which takes an image as input and generates the corresponding Bangla caption as output. The encoder network consists of a pretrained image feature extractor called ResNet-50, while the decoder network consists of Bidirectional LSTMs for caption generation. The model has been trained and evaluated using a Bangla image captioning dataset named BanglaLekhaimageCaptions. The proposed model achieved a training accuracy of 91% and BLEU-1, BLEU-2, BLEU-3, BLEU-4 scores of 0.81, 0.67, 0.57, and 0.51 respectively. Moreover, a comparative study for different pretrained feature extractors such as VGG-16 and Xception is presented. Finally, the proposed model has been deployed on an embedded device for analysing the inference time and power consumption.

关键词： -Bangla image caption generation encoder-decoder bidirectional long short term memory (LSTM) bangla natural language processing (NLP)

来源：评论

学校读者我要写书评

暂无评论

Rapid detection of imperfect wheat grains based on deep learning technique 2

Rapid detection of imperfect wheat grains based on deep lear...

引用

2nd International Conference on image processing, Computer vision and machine Learning, ICICML 2023

作者： Hu, Kui Zhang, Hongming Lyu, Bo Shen, Yongcai Fu, Jia Wang, Fudi Lin, Zichao Anhui University Institute of Physical Science and Information Technology Hefei China Chinese Academy of Sciences Institute of Plasma Physics HFIPS Hefei China University of Science and Technology of China Science Island Branch Graduate School Hefei China Hefei Normal University Hefei China

ISBN: (纸本)9798350331417

The detection and identification of imperfect wheat grains are of great significance in evaluating their quality. Manual inspection and separation of imperfect grains in wheat are time-consuming and expensive. Therefore, there is a need for a fast, automated, and accurate method to detect imperfect grains in wheat. In this study, we created an image acquisition platform to capture images of wheat grains. Each image was labeled as either spotted, sprouted, moldy, broken, or perfect grains. To balance calculation efficiency and prediction accuracy, it is necessary to identify a suitable model. Thus, we applied several mainstream deep-learning algorithms, including ResNet50, ResNet50 with an attention module, MobileNetv1, MobileNetv2, MobileNetv3-small, and MobileNetv3-large, to construct a model suitable for practical application. It was found that the MobileNetV3-Small model achieved a good balance between computational efficiency and prediction accuracy. The model based on the MobileNetV3-Small architecture achieved high accuracy, recall rate, and F1-score, all exceeding 96%. Although the MobileNetV3-Small model's accuracy is slightly lower than that of the ResNet50 model with an added attention mechanism, it significantly reduces computational costs, improves detection speed, and cuts prediction time by 50%. Compared with other models, the MobileNetv3-small-based model is more suitable for practical applications due to its advantages of high speed, high precision, and stable prediction performance. This study can provide technical guidance for intelligent recognition and detection of wheat grains in the future. © 2023 IEEE.

关键词： deep learning image pattern recognition MobileNetV3-small Wheat imperfect grains

来源：评论

学校读者我要写书评

暂无评论

Deep Learning for HDR Imaging: State-of-the-Art and Future Trends

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2022年第12期44卷 8874-8895页

作者： Wang, Lin Yoon, Kuk-Jin Korea Adv Inst Sci & Technol Dept Mech Engn Visual Intelligence Lab Daejeon 34141 South Korea

High dynamic range (HDR) imaging is a technique that allows an extensive dynamic range of exposures, which is important in image processing, computer graphics, and computer vision. In recent years, there has been a significant advancement in HDR imaging using deep learning (DL). This study conducts a comprehensive and insightful survey and analysis of recent developments in deep HDR imaging methodologies. We hierarchically and structurally group existing deep HDR imaging methods into five categories based on (1) number/domain of input exposures, (2) number of learning tasks, (3) novel sensor data, (4) novel learning strategies, and (5) applications. Importantly, we provide a constructive discussion on each category regarding its potential and challenges. Moreover, we review some crucial aspects of deep HDR imaging, such as datasets and evaluation metrics. Finally, we highlight some open problems and point out future research directions.

关键词： Imaging image reconstruction Loss measurement Cameras Deep learning Visualization Dynamic range High-dynamic-range (HDR) imaging deep learning (DL) convolutional neural networks (CNNs)

来源：评论

学校读者我要写书评

暂无评论

machine Learning in Computer vision: A Review

引用

EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS 2021年第32期8卷 1-11页

作者： Khan, Abdullah Ayub Laghari, Asif Ali Awan, Shafique Ahmed Sindh Madressatul Islam Univ Fac Comp Sci Karachi Sindh Pakistan Benazir Bhutto Shaheed Univ Lyari Fac Comp Sci & Informat Technol Karachi Pakistan

INTRODUCTION: Due to the advancement in the field of Artificial Intelligence (AI), the ability to tackle entire problems of machine intelligence. Nowadays, machine learning (ML) is becoming a hot topic due to the direct training of machines with less interaction with a human. The scenario of manual feeding of the machine is changed in the modern era, it will learn automatically. Supervised and unsupervised ML techniques are used as a distinct purpose like feature extraction, pattern recognition, object detection, and classification. OBJECTIVES: In Computer vision (CV), ML performs a significant role to extract crucial information from images. CV successfully contributes to multiple domains, surveillance system, optical character recognition, robotics, suspect detection, and many more. The direction of CV research is going toward healthcare realm, medical imaging (MI) is the emerging technology, play a vital role to enhance image quality and recognized critical features of binary medical image, covert original image into grayscale and set the threshold values for segmentation. CONTRIBUTION: This paper will address the importance of machine learning, state-of-the-art, and how ML is utilized in computer vision and image processing. This survey will provide details about the type of tools and applications, datasets, and techniques. Limitations of previous work and challenges of future work also discussed. Further, we identify and discuss a set of open issues yet to be addressed, for efficiently applying of ML in Computer vision and image process. METHODS, RESULTS, AND CONCLUSION: In this review paper, we have discussed the techniques and various types of supervised and unsupervised algorithms of ML, general overview of image processing and the results based on the impact;neural network enabled models, limitations, tools and application of CV, moreover, highlight the critical open research areas of ML in CV.

关键词： machine Learning Computer vision Supervised and Unsupervised Learning Medical Imaging Pattern Recognition Feature Extraction Neural Network

来源：评论

学校读者我要写书评

暂无评论

Maize Leaf Healthy and Unhealthy Classification Using image processing Technique and machine Learning Classifiers 4th

Maize Leaf Healthy and Unhealthy Classification Using Image ...

引用

4th International Conference on Communications and Cyber-Physical Engineering, ICCCE 2021

作者： Khade, Vishnu C. Patil, Sanjay B. Jadhav, Sachin B. BSCOER Pune India AGCE Satara India SCSOCE Dhangawadi Pune India BVCOE Kolhapur India

ISBN: (纸本)9789811679841

Automatic detection of the healthy and unhealthy maize plant leaf is a prevalent machine vision learning task and has significant applications in the Food Industry. In this paper, effective machine learning technique for maize leaf healthy and unhealthy classifications based on leaf images that have been presented. This study estimates color feature extraction using RGB mean and standard deviation and the classification, using PNN and KNN methods. A new Five-stage image processing method is presented (including image pre-processing, image segmentation, feature extraction, classification, and performance analysis). The Experimental results show that a small set of RGB color features reach an accuracy of 92.5% and 90% using PNN and KNN classifier respectively, while doing classification the KNN classifier requires more computational time as compared to PNN Classifier. © 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

AI based Plant Growth Monitoring System using Computer vision

AI based Plant Growth Monitoring System using Computer Visio...

引用

2023 IEEE Technology and Engineering Management Conference - Asia Pacific, TEMSCON-ASPAC 2023

作者： Bhamare, Archana Upadhyay, Vivek Bansal, Payal Poornima University Dept of Electronics & Communication Engineering Jaipur India

ISBN: (纸本)9798350384659

This research work aims to develop an AI-based plant growth monitoring system using computer vision. By leveraging computer vision algorithms and artificial intelligence techniques, the system will enable real-time and automated assessment of plant growth and health. Traditional methods for plant growth monitoring are time-consuming and labor-intensive. The proposed system will address these limitations by analyzing images of plants using image processing algorithms and extracting relevant features such as leaf area, plant height and width. machine learning algorithms will be employed to train models capable of recognizing patterns and predicting plant growth based on the extracted features. The system will be designed to accommodate different plant types and environmental conditions. By providing accurate and timely insights into plant health and growth, this AI-based plant growth monitoring system will benefit farmers, researchers, and plant enthusiasts, enabling informed decision-making for plant care and optimizing growth conditions. This research work aims to revolutionize plant growth monitoring by offering a cost-effective, scalable, and efficient solution with wide applications in agriculture, horticulture and environmental research. © 2023 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Normalized Determinant Pooling Layer in CNNs for Multi-Label Classification 7

Normalized Determinant Pooling Layer in CNNs for Multi-Label...

引用

Computational Imaging Vii 2023

作者： Giuliano, Alessandro Hilal, Waleed Alsadi, Naseem Yawney, John Gadsden, S. Andrew Lab HamiltonONL8S 4L8 Canada Adastra Corporation TorontoONM5J 2J2 Canada

ISBN: (数字)9781510661615

ISBN: (纸本)9781510661608

Convolutional neural networks (CNNs) are a widely researched neural network architecture that has demonstrated exemplary performance in image processing tasks and applications compared to other popular deep learning and machine learning methods resulting in state-of-the-art performance in many image processing tasks such as image classification and segmentation. CNNs operate on the principle of automated learning of filters or kernels in contrast with hand-crafted digital filters to extrapolate features from images effectively. This paper aims to investigate whether a matrix's determinant can be used to preserve information in CNN convolutional layers. Geometrically the absolute value of the determinant is defined as a scaling factor of the linear transformation resulting from matrix multiplication. When an image's size is reduced into a feature space through a convolutional layer of a CNN, some information is lost. The intuition is that the scaling factor that results from the determinant of the pooling layer matrix can enhance the feature space introducing scaling as a piece of information in the feature space as well as lost relations between adjacent pixels. © 2023 SPIE. All rights reserved.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Imagined Speech and Visual imagery as Intuitive Paradigms for Brain-Computer Interfaces 13

Imagined Speech and Visual Imagery as Intuitive Paradigms fo...

引用

13th International Winter Conference on Brain-Computer Interface, BCI 2025

作者： Lee, Seo-Hyun Park, Ji-Ha Kim, Deok-Seon Dept. of Brain and Cognitive Engineering Korea University Seoul Korea Republic of Dept. of Artificial Intelligence Korea University Seoul Korea Republic of

ISBN: (纸本)9798331521929

Brain-computer interfaces (BCIs) have shown promise in supporting communication for individuals with motor or speech impairments. Recent advancements such as brain-to-speech or brain-to-image technology aim to reconstruct speech from neural activity. However, robust decoding of communication-related paradigms, such as imagined speech and visual imagery, using non-invasive techniques still remains challenging. This study analyzes brain dynamics in these two paradigms by examining neural synchronization and functional connectivity through phase-locking values (PLV) in noninvasively collected EEG data. Results show that visual imagery produces higher PLV values in visual cortex, engaging spatial networks, while imagined speech demonstrates consistent synchronization, primarily engaging language-related regions. These findings suggest that imagined speech is suitable for language-driven BCI applications, while visual imagery can complement BCI systems for users with speech impairments. Furthermore, the brain connectivity results implies that personalized calibration is crucial for optimizing BCI performance. © 2025 IEEE.

关键词： vision

来源：评论

学校读者我要写书评

暂无评论

Drone-Based applications for Tailings Dam Monitoring

Drone-Based Applications for Tailings Dam Monitoring

引用

APCOM 2023 Conference: Intelligent Mining: Innovation, vision, and Value

作者： Gomez, Jose A. Sattarvand, Javad Department of Mining & Metallurgical Engineering Mackay School of Earth Sciences & Engineering University of Nevada RenoNV United States

ISBN: (纸本)9780873355216

Failures of tailings dams have been happening lately. Due to the lack of laws on particular design criteria and stability requirements related monitoring during construction and maintenance, they are thought to be more fragile than hydraulic dams. Monitoring the dam is therefore necessary to understand its current condition and guarantee its safety. The physical condition of the dam could be evaluated with the early identification of seepage. Additionally, due to their adaptability and capacity for high-resolution data collecting, UAVs are an excellent choice for efficiently covering the tailings dam site. UAVs may capture high-quality photos when equipped with a high-resolution RGB camera, thermal sensors, or multispectral sensors. When these sensors are paired with image processing and machine learning algorithms, the result is a reliable estimate of the dam condition. © 2023 Society for Mining, Metallurgy & Exploration Inc. All rights reserved.

关键词： Learning algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：