检索结果-内蒙古大学图书馆

An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM

MULTIMEDIA TOOLS AND applications 2023年第18期82卷 28043-28065页

作者： Enireddy, vamsidhar Anitha, J. Mahendra, N. Kishore, G. Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Guntur 522502 Andhra Pradesh India Malla Reddy Engn Coll Dept Comp Sci & Engn Hyderabad 500100 Telangana India Miracle Educ Soc Grp Inst Miracle City 535216 Andhra Pradesh India RISE Krishna Sai Prakasam Grp Inst Dept CSE Ongole 523272 Andhra Pradesh India

In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.

关键词： Baby sign language Automated recognition Computer vision Optimization

来源：评论

学校读者我要写书评

暂无评论

Enhancing Digital Manufacturing with Affordable vision Systems: Exploring Low-Cost applications

Enhancing Digital Manufacturing with Affordable Vision Syste...

引用

2023 Low-Cost Digital Solutions for Industrial Automation, LoDiSA 2023

作者： Ling, Zhengyang Hawkridge, Gregory McFarlane, Duncan Thorne, Alan Institute for Manufacturing Department of Engineering University of Cambridge CB3 0FS United Kingdom

vision systems play a pivotal role in the digitalization of manufacturing processes. They offer various benefits, such as quality control, process monitoring, and digitizing analog data. Developing vision systems can be a complicated, and expensive process, tailored to specific applications. However, not every problem requires a high-end solution. This paper aims to identify the scope of low-cost solutions that can be effectively addressed using simple vision systems. It adopts a'Shoestring' design approach, focusing on developing vision systems tailored to the needs of small and medium-sized companies (SMEs). Essential service modules and building blocks are proposed, utilizing off-the-shelf components and open-source image processing software. A simplified development procedure is outlined and applied to two industrial case studies: legacy panel status monitoring and braid materials quality inspection. Both cases originate from SMEs and have been successfully tested and deployed on the shop floor. The study demonstrates the feasibility of implementing these cost-effective vision systems in SMEs, providing a valuable development guideline for low-cost vision applications in the industrial sector. © 2023 IET Conference Proceedings. All rights reserved.

关键词： Process monitoring

来源：评论

学校读者我要写书评

暂无评论

Application of the image processing Technique for Powerline Robot 11th

Application of the Image Processing Technique for Powerline ...

引用

11th EAI International Conference on Context-Aware Systems and applications, ICCASA 2022

作者： Ngo, Ha Quang Thinh Bui, The Tri 268 Ly Thuong Kiet Street District 10 Ho Chi Minh City Viet Nam Linh Trung Ward Thu Duc City Ho Chi Minh City Viet Nam

ISBN: (纸本)9783031288159

Applying image processing to electromechanical systems is a problem of interest to scientists, in order to serve humans in many fields. To do that, there needs to be a connection between image processing and mechanical construction to create complete mobile cameras. One of the research directions is about mobile cameras, specifically a system consisting of dual cameras that detect and track moving objects, and at the same time calculate the distance from the dual camera system to the target, this system can be application in object tracking robot. In this paper, the research object includes the camera system designed according to the pan-tilt structure, the algorithm used for object detection is YOLO-based on CNN, estimating the distance from the camera system to the object. By means of stereo vision, control the pan-tilt system to automatically track objects. © 2023, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

Pigment Epithelial Detachment Detection: A Review of Imaging Techniques and Algorithms

Pigment Epithelial Detachment Detection: A Review of Imaging...

引用

2022 International Conference on Advanced Computing Technologies and applications, ICACTA 2022

作者： Sheeba, T.M. Albert Antony Raj, S. Anand, M. SRM Institute of Science and Technology College of Science and Humanities Department of Computer Applications Chennai India SRM Institute of Science and Technology College of Engineering and Technology Department of Networking and Communications Chennai India

ISBN: (纸本)9781665495158

Pigment epithelial detachment(PED) is a disorder in retina that happens when RPE layers of cells at the back side of the eye come apart, or get teared. The bend of layers in the retina, as well as fluid, proteins, tissue, or blood vessels, is a defining feature of PED disease, which occurs most frequently in the macula. PED can disturb the vision of the people which is often depict dark shadow, blurry vision or partial loss of vision. The optical coherence tomography (OCT) is a trend set of high resolution and non-invasive imaging modality that expedite the structure of the retina. OCT non-invasively yields cross-sectional volume of images with tissues. The major objective of this research paper is to study, state of art and to classify the retinal layer segmentation techniques, PED fluid segmentation and classification of diseases in retinal OCT images. The medical industry is suffering with more critical patients and the cases are increasing in eye diseases double the number as of now. The artificial intelligence (AI) techniques help the health sector with a great and accurate automatic detection of disease. The image classification and pattern recognition are transforming the industry with artificial intelligence techniques. Many studies are being conducted employing image processing to aid in the early diagnosis of this disease. image processing techniques have advanced as a result of the introduction of artificial intelligence and machine learning. In this review paper, the structure classification methods and the image segmentation method that are best available existing research is discussed. This review summarizes all the recent algorithms that suits for the application of machine learning algorithms for predicting retinal diseases in OCT images. The algorithms discussed from existing research paper, produce the readers to identify the best accurate algorithm for retinal classification of infected eye and normal eye, precision and less processing time for la

关键词： Optical tomography

来源：评论

学校读者我要写书评

暂无评论

Multilevel Crop image Segmentation Using Firefly Algorithm and Recursive Minimum Cross Entropy 4th

Multilevel Crop Image Segmentation Using Firefly Algorithm a...

引用

4th International Conference on machine Intelligence and Signal processing, MISP 2022

作者： Kumar, Arun Kumar, A. vishwakarma, Amit PDPM Indian Institute of Information Technology Design and Manufacturing Jabalpur India

ISBN: (纸本)9789819900466

image segmentation plays an important role in computer vision technology and agriculture is one of their applications. The crop images present near the vicinity are complex and dense. Hence, multilevel thresholding of such crop images is a tedious task. In this paper, we propose multilevel thresholding of crop images using recursive minimum cross entropy and firefly algorithm. The firefly is based on the social behavior of the swarm of fireflies and bioluminescent information-sharing phenomena. It is a swarm-based algorithm, which offers a better search mechanism to find the optimum threshold value. The performance of the proposed method is observed over ten complex background crop images and compared with the wind-driven optimization algorithm. The better fidelity parameters evidence the superiority of the proposed method. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

A 40nm 2TOPS/W Depth-Completion Neural Network Accelerator SoC With Efficient Depth Engine for Realtime LiDAR Systems

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS 2023年第5期70卷 1704-1708页

作者： Sun, Miao Cao, Yingjie Qian, Jian Li, Jie Zhou, Sifan Zhao, Ziyu Wu, Yifan Xia, Tao Qin, Yajie Qiu, Lei Ma, Shunli Chiang, Patrick Yin Zhuo, Shenglong Fudan Univ State Key Lab AS & Syst Shanghai 200433 Peoples R China Dept IC Design TiMESiNTELLi Technol Shanghai 201203 Peoples R China Tongji Univ Coll Elect Informat & Engn Shanghai 200082 Peoples R China

Light Detection and Ranging (LiDAR) is becoming a critical requirement for future computer vision applications, such as AR/vR (iPhone-LiDAR) and ADAS (Automotive-LiDAR). A depth point-cloud input has different characteristics than a conventional RGB image input, such that the CNN depth-inference implementation is unique when compared with a standard super-resolution CNN(SR-CNN). In this brief, we present a heterogeneous AI-accelerator SoC, which is specific to depth image completion computation. Three key innovations are introduced to improve SoC's performance. First, to accommodate the unique input data structure of a depth input, a fully-filled dataflow management engine is proposed to pre-process the RGB+Depth input, significantly improving processing element utilization (PEU). Second, to improve the efficiency of the instruction configurations of the CNN accelerator, a hardware-tiling co-processor is proposed that performs the tiling strategy of the CNN accelerator, assigning each sub-job to the PE array directly, therefore reducing the time for task assignments. Third, due to the large number of vector operations required for the post-process in the neural network, a RISC-v core is incorporated to execute vector computations better. The SoC is implemented in 40nm CMOS process, achieving 2TOPs/W energy efficiency with 34fps throughput under vGA-resolution output for real-time LiDAR systems.

关键词： Engines Single-photon avalanche diodes Neural networks Laser radar Random access memory Convolution Costs Depth completion depth engine RISC-v extended vector DSA on-chip co-processor scheduler

来源：评论

学校读者我要写书评

暂无评论

Cross-Dataset Generalization in -Based Plant Disease Recognition 2

Cross-Dataset Generalization in -Based Plant Disease Recogni...

引用

2nd International Conference on Artificial Intelligence and machine Learning applications, AIMLA 2024

作者： Sathiyapriya, N. Ram Shankar, S. Suresh Krishna, R. viknesh, v. K.S.Rangasamy College of Technology Department of Information Technology Namakkal Tamil Nadu Tiruchengode India

ISBN: (数字)9798350349221

ISBN: (纸本)9798350349221

Plant diseases recognition large crop losses and have negative economic effects, which makes them a serious danger to the world's food security. Early and accurate disease diagnosis is essential for efficient disease control and mitigation. Convolutional neural networks (CNNs) are a viable method for diagnosing plant diseases based on visual symptoms because of their impressive performance in image recognition tasks in recent years. In this work, we present a novel CNN algorithm created especially for the diagnosis of plant diseases. The accuracy and resilience of disease classification are increased by the suggested CNN architecture's tuning, which is designed to extract the intricate patterns and variations of plant illnesses from photos. To enhance the model's capacity for generalisation, extensive datasets including various plant species and disease symptoms are gathered and subjected to pre-processing techniques that standardise image quality. Using a stochastic gradient descent technique on the pre-processed CNN weights, the training procedure The model's hyper parameters are fine-tuned to prevent overfitting and improve performance on a separate validation set. An independent test data set is used to thoroughly assess the trained CNN in order to calculate a confusion matrix and quantify accuracy, recall, F1 score, and precision. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

A Comparison Between CCTv and Industrial Cameras for vehicle Attribute Recognition

A Comparison Between CCTV and Industrial Cameras for Vehicle...

引用

Iranian machine vision and image processing (MvIP)

作者： Mohammadreza Asadi Mohammad Yasin Fakhar Seyedeh Sogand Hashemi Safiyeh Rezaei Mohamad Kiani Abari Seyed Alireza Abbaspour AI Research Center Hoopad Vision Company Isfahan Iran

In machine/computer vision, cameras serve a major role in image acquisition. Surveillance scenarios typically rely on Closed-Circuit Television (CCTv) cameras. This study aims to evaluate industrial cameras within a surveillance application, contrasting their performance with that of CCTv cameras. We explore the comparative analysis of CCTv and industrial cameras for vehicle attribute recognition, specifically concentrating on the recognition of vehicle color and model using deep learning techniques. To train and evaluate the models, we have created datasets from images captured by both a CCTv and an industrial camera. Our findings indicate that the industrial camera outperforms the CCTv. However, employing advanced processing algorithms has the potential to minimize the performance gap between these two cameras. Our research represents one of the initial comparative analyses between these camera types, offering valuable guidance in selecting the most suitable camera for specific applications.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-dilation Convolutional Neural Network for Automatic Handwritten Signature verification

引用

SN Computer Science 2023年第5期4卷 1-12页

作者： Upadhyay, Rashmi Rathi Mehta, Ravishankar Singh, Koushlendra Kumar Machine Vision and Intelligence Lab Dept. of CSE National Institute of Technology Jamshedpur Jharkhand 831014 India

With the recent advancements in deep learning techniques, the application areas of unstructured data analytics are emerging in multiple domains. One of the popular applications is analyzing image unstructured data. Multiple image analytics-based solutions have been developed to establish an autonomous, cutting-edge computer-based approach for identification and verification. The exponential growth in text documents, images, and videos in every domain is driving the development of multiple image analytics-based applications to get insights and improve solutions. Secure authentication and verification of handwritten signatures play an important role in security and authentication, particularly in financial institutions, legal transactions, etc. One of the exciting applications of Deep Learning is automated signature verification for person identifications. Since signature verification is the most commonly accepted biometric attribute by law enforcement officials and agencies, making it more secure is a major challenging task. In the proposed work, the author developed an efficient multi-dilation convolutional neural network-based model for handwritten signature verification. It has been observed that the proposed model is memory efficient and does not require many pre-processing and hardware resources like GPU. The proposed model is validated on the CEDAR dataset which contains 24 genuine and 24 forgery off-line signatures for each of 55 writers. The authors have made a comparative analysis of the proposed method with other state-of-the-art methods discussed in the literature. The model achieves more than 99% accuracy with an equal error rate of 6.00 which is a good improvement over the other existing methods. © 2023, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.

关键词： Deep learning Multi-dilation Offline signature verification ResNet

来源：评论

学校读者我要写书评

暂无评论

A Black-Box Targeted Misclassification Attack on Edge AI with Adversarial Examples Generated from RAW image Sensor Data

A Black-Box Targeted Misclassification Attack on Edge AI wit...

引用

2024 Asian Hardware Oriented Security and Trust Symposium, AsianHOST 2024

作者： Hu, Bowen He, Weiyang Wang, Kuo Chang, Chip-Hong School of Electrical and Electronic Engineering Nanyang Technological University Singapore

ISBN: (纸本)9798350368062

Deep learning has witnessed pervasive deployment on edge devices over the past decade, especially for computer vision applications. However, its vulnerability to adversarial attacks, where visually imperceptible patterns cause machine learning models to malfunction, has raised significant security concerns. The connection between the image sensor and the application processors is typically not encrypted nor signed for data integrity verification, leaving the data link exposed to tampering threats. Previous works have demonstrated how these threats can be exploited. However, these methods typically inject attack patterns into the RAW image data without considering the effect of the image signal processing (ISP) pipeline, which can undesirably weaken the adversarial effects. Insofar such attacks have not succeeded in more powerful targeted misclassification fraud attacks where a selected target can be misclassified into the attacker's intended output. In this work, we propose a novel RAW image domain black-box attack that incorporates a differentiable ISP to train a knowledge-distilled substitute classifier to generate adaptive adversarial perturbations that survive the ISP. We show that such an attack is feasible by attacking the edge implementations of ResNet18 and MobileNetv2 with adversarial examples generated from their knowledge-distilled models by applying differentiable ISP on RAW formatted GTSRB test images captured by a Raspberry Pi camera. Our results demonstrate that its attack success rate surpasses previous direct mapping techniques by 10.37% and 13.07%, respectively for ResNet18 and MobileNetv2 in untargeted misclassification attack tasks with greater stealthiness when the adversarial examples are displayed on an LCD monitor for comparison. More importantly, it can achieve the targeted misclassification at an attack success rate of 95.09 % and 51.98 % respectively, which is currently impossible with existing camera-link attack methods. The results are r

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：