In this paper three methods for determining displacements on images are compared. Two of them are neuralnetworks designed for Particle image Velocimetry images processing. The third method is classic cross-correlatio...
详细信息
In the realm of AI-enabled Wireless Sensor networks (WSNs) and Internet of Things (IoT) integration, efficient resource allocation is paramount for enhancing energy efficiency and optimizing data utilization. The dyna...
详细信息
A Convolutional neural Network (CNN) are a class of artificialneuralnetworks specifically designed to process data with a grid-like topology, such as images, making them well-suited for tasks like image recognition ...
详细信息
ISBN:
(数字)9798350376685
ISBN:
(纸本)9798350376692
A Convolutional neural Network (CNN) are a class of artificialneuralnetworks specifically designed to process data with a grid-like topology, such as images, making them well-suited for tasks like image recognition and classification, object detection, and speech recognition. However, their current software implementations leave much to be desired regarding energy efficiency, speed, performance and scalability. Hardware-based implementation of CNNs has several significant advantages over software, including faster data processing due to the parallel execution of hardware-based FPGAs and ASICs, which are necessary for real-time applications. They are more energy-efficient and have consistent, predictable performance. In this paper, we present the implementation of CNN on Verilog. We implemented a highly optimized CNN regression model architecture for object detection having an accuracy of 97%. The entire design was made on Verilog, allowing easy transferability to both FPGA and ASIC platforms. The proposed work is compared to the current standard for software implementations – Google Colab. The results obtained show considerable speed up and improved performance.
As one of the key problems in computer-aided medical image analysis, learning how to model global relationships and extract local details is crucial to improve the performance of abdominal multi-organ segmentation. Wh...
详细信息
The proceedings contain 28 papers. The special focus in this conference is on Climbing and Walking Robots and the Support Technologies for Mobile Machines. The topics include: 27 Years of Climbing and Walking Robots ...
ISBN:
(纸本)9783031707216
The proceedings contain 28 papers. The special focus in this conference is on Climbing and Walking Robots and the Support Technologies for Mobile Machines. The topics include: 27 Years of Climbing and Walking Robots – Are We There?;Torque Controlled or Intrinsically Compliant? DLR’s Perspective on Robust and Efficient Biped and Quadruped Locomotion;Efficient Stream-Based Active Learning Initialization for Legged Robots Based on a PCA/K-Means image Selection Approach;precision Vehicle Pose Estimation with Uncertainty-aware neural Network;HAPmamba: Linear-Time Sequence Modeling for Terrain Classification by Legged Robots;neural-Based Self-collision Checking for a Quadruped Robot;omnidirectional Climbing Robot for Maintenance Services on Hard to Reach Places of Ship Hulls;demonstration of a Micro Wall-Climbing Robot Moving on Metal Surfaces;linkage Length Optimization of a Climbing Inspection Robot Using an Area Overlap Method;Rotary Push-in Mechanism for Variable Outer Diameter PIGs with Multiple-Connected Using Pneumatic artificial Muscles;basic Study on a Peristaltic Motion-Type In-Pipe Inspection Robot Using a Hyper-Extension Unit for Improving Locomotion Speed;proposal of Operation Methods of the Square-Duct Cleaning Machine with Multistage Planetary Gear Mechanism;earth-Shaping with Heterogeneous Robotic Teams: From Sim to Real;mobile Victim Signs Monitoring Through Non-invasive Robotic System;Multi-UAV Coverage Path Planning for Agricultural applications;autonomous Landing Pad with a Closed Cover for a Medium-Sized Drone to Support Typical Research and Reconaissance Tasks in the Local Environment;concept of Pneumatic Soft Robot: Suction-Driven Locomotion;climbing Robot Inspired by Inchworms: Adaptable for Tubular and Flat Surfaces with Multi-plane Work Capability;high-Propulsive Trunk Flexion–Extension Mechanism Using Cheetah-Inspired S-Shaped Spine;self-organized Locomotion with Multiple Stepping Frequencies in an Insect-Like Robot Under Decentralized Adaptive
Advanced artificial intelligence (AI) algorithms, particularly those based on artificialneuralnetworks, have garnered significant attention for their potential applications in areas such as image recognition and nat...
详细信息
image captioning, which exists at the point of intersection of computer vision and natural language processing, is essential for enhancing image comprehension, allowing applications like content discovery, visual aid ...
详细信息
ISBN:
(数字)9798350394474
ISBN:
(纸本)9798350394481
image captioning, which exists at the point of intersection of computer vision and natural language processing, is essential for enhancing image comprehension, allowing applications like content discovery, visual aid for the blind, and more. The hunt for more precise and reliable picture captioning models continues to be an important research goal as technology develops quickly. The two prominent image captioning techniques used in this study image Captioning Using LSTM+CNN and image Captioning Using VisionGPT2 are thoroughly compared. We examine these models' internal workings, assess their effectiveness, and offer insights into their advantages and disadvantages for diverse application *** neuralnetworks (CNNs) for extracting visual features and long short-term memory (LSTM) networks for producing sequential language are combined in the LSTM+CNN model, a tried-and-true methodology. It has shown adept in creating insightful descriptions for a variety of photographs. On the other hand, VisionGPT2, a GPT-2 architectural extension, makes use of transformers and pretrained language models to provide cutting-edge outcomes in a range of natural language processingapplications. We analyze the viability of each technique by taking into account elements like model complexity, training data needs, and deployment simplicity. This comprehensive comparison enlightens academics, programmers, and businesses on the ideal picture captioning solution for their particular requirements, fostering development in this area and its numerous uses.
The state-of-the-art neural network-based (NN-based) in-loop filters for video coding are built on convolutional neuralnetworks. The Joint Video Experts Team (JVET) activities investigate NN-based in-loop filters for...
详细信息
ISBN:
(数字)9798350349399
ISBN:
(纸本)9798350349405
The state-of-the-art neural network-based (NN-based) in-loop filters for video coding are built on convolutional neuralnetworks. The Joint Video Experts Team (JVET) activities investigate NN-based in-loop filters for two operation points, the high operation point (HOP) which provides highest possible gains at a high complexity and the low operation point (LOP) which is constrained on a low complexity. This paper focuses on the LOP network. We apply a DCT and reshaping to the inputs and an inverse DCT and inverse reshaping to the outputs of LOP. The spatial resolution inside the network is reduced by a factor of four while the final output still has the same number of pixels. The complexity in MAC/pixel (multiplyaccumulate operations per pixel) is therefore also reduced by a factor of four. This freed-up complexity is instead spent on increasing the number of backbone blocks and channels so the LOP complexity is matched. Our network has a complexity of $16.9 \mathrm{kMAC} /$ pixel and 0.2 M parameters (LOP: 17 kMAC/pixel, 0.05 M parameters). The BD-rate impact compared to the NNVC-7.1 anchor is reported to be −0.48% for RA and −0.17% for AI with the float model, and −0.44% for RA and −0.18% for AI with the integer model.
The bad weather events, such as haze, in maritime traffic dramatically reduce the visibility, which can seriously affect the ship navigation especially in areas with intensive port traffic. Meanwhile, unwanted signals...
详细信息
This survey paper examines the advancements and challenges in chatbot technology, focusing on deep neuralnetworks (DNNs) and their application in natural language processing (NLP). It discusses various chatbot models...
详细信息
ISBN:
(数字)9798331505790
ISBN:
(纸本)9798331505806
This survey paper examines the advancements and challenges in chatbot technology, focusing on deep neuralnetworks (DNNs) and their application in natural language processing (NLP). It discusses various chatbot models, including Elizabot, Alicebot, Mitsuku, and Cleverbot, highlighting their evolution from rule-based systems to sophisticated AI conversational agents. The study introduces a specialized chatbot for website integration, emphasizing the importance of swift, accurate, and personalized interactions to enhance customer engagement. Additionally, the paper explores the integration of Large Language Models (LLMs) such as Gemini, GPT and BERT, fine-tuning with deep learning techniques to improve chatbot performance, and their potential to revolutionize customer interactions and business growth.
暂无评论