Implicit Neural Representations (INRs) are powerful to parameterize continous signals in computervision. However, almost all INRs methods are limited to low-level tasks, e.g., image/video compression, super-resolutio...
详细信息
Safety management in tunnel construction is important to ensure the safety of construction personnel and equipment. this paper proposes a tunnel engineering risk early warning model based on computervision and convol...
详细信息
In this study, a method is proposed to represent the corresponding viewpoints and features in the form of graphs for observing the sentiment of the subscribers based on product review. the aim is to observe and empath...
详细信息
Human action recognition (HAR) is a crucial field in computervision with applications ranging from video surveillance to human-computer interaction. this study explores an efficient framework for HAR by leveraging hu...
详细信息
ISBN:
(纸本)9783031837951;9783031837968
Human action recognition (HAR) is a crucial field in computervision with applications ranging from video surveillance to human-computer interaction. this study explores an efficient framework for HAR by leveraging human keypoint extraction using Google's Mediapipe and Long Short-Term Memory (LSTM) networks. the Mediapipe framework provides accurate and real-time human keypoint extraction, significantly reducing the computational complexity associated with raw video data processing. these keypoints, representing skeletal movements, are utilised as input to LSTM networks, which capture the temporal dependencies vital for action classification. the proposed method is evaluated on two benchmark datasets: UCF101 and Kinetics 400. UCF101 contains 13,320 video clips across 101 action classes, while Kinetics 400 features 400 human action categories. the combination of Mediapipe for feature extraction and LSTM for temporal modelling achieves 92.40% and 86.8% accuracy on UCF101 and Kinetics400, respectively. the results demonstrate the effectiveness of keypoint-based approaches for HAR. this study highlights the potential of lightweight, real-time human action recognition frameworks suitable for resource-constrained environments, especially for edgeAI and edge Robotics.
Scene text recognition has a wide range of applications. Its deep learning algorithm model is mainly CNN and RNN hybrid neural network, which can effectively identify text information in natural scenes. However, compl...
详细信息
this research addresses the critical challenge of enhancing object recognition and real-Time response capabilities in autonomous vehicles (AVs) under varying simulated conditions, which is crucial for ensuring both na...
详细信息
the implementation of several agriculture-related issues was made simple by advancements in computervision technology. the detection of fruit diseases is one such issue. Using deep learning techniques, a lot of study...
详细信息
Vehicle based logistics are hinged on their ability to timely deliver goods, services, and people. the classical expression of "time is money" comes alive in the logistics industry yielding potentially huge ...
详细信息
ISBN:
(纸本)9781450398329
Vehicle based logistics are hinged on their ability to timely deliver goods, services, and people. the classical expression of "time is money" comes alive in the logistics industry yielding potentially huge financial and health consequences in case of missing deadlines. this is especially the case for time sensitive pharmaceuticals, delivery of perishable goods, delivery of people travelling, delivery of services in fault fixing/recovery sector. All these use cases motivate the need for an immutable, secure, and immortalized process of tracking time. To solve this challenge, this paper presents prototype-based research that integrates the 4th industrial revolution technologies of vision Internet of things (IoT), Artificial Intelligence (AI)-based Optical Character recognition (OCR) and blockchain. the developed prototype features a Raspberry-PI board embedding a camera, an Artificial Intelligence (AI) model to recognize plate letters from the image and a crypto wallet to sign the logging of plate number and time events on the NEAR blockchain, an emerging sharded, proof-of-stake, layer-one blockchain that is simple to use, secure and scalable. the effective operation of the developed prototype has been validated inside a campus parking and shows an accuracy of 80%. the benefits of transparency, security, and immutability of the blockchain combined withthe intelligence, data capture, and processing of IoT will enable to develop accountability solutions trusted by all different logistic stakeholders.
image segmentation is of great importance in many areas including artificial intelligence, imageprocessing, computervision, etc. the optimization of image segmentation can be converted to seek the best threshold val...
详细信息
Visual Question Answering (VQA) is an intricate and demanding task that integrates natural language processing (NLP) and computervision (CV), capturing the interest of researchers. the English language, renowned for ...
详细信息
暂无评论