Mobile robots can move around in the surrounding. They are used in military, industrial applications, for surveillance tasks, etc. These tasks involve multi-floor navigation through the staircase. For efficient and sa...
详细信息
In the contemporary age of advanced technology, image processing plays a vital role as an efficient and indispensable tool across industries, enterprises, monitoring systems, and various applications. This study focus...
详细信息
This paper addresses few-shot semantic segmentation (FSS) guided by text, where we classify unseen novel classes using image and text references as in-context examples, without the need for training. We enhance the qu...
详细信息
Event cameras are promising sensors for on-line and real-time vision tasks due to their high temporal resolution, low latency, and redundant static data elimination. Many vision algorithms use some form of spatial con...
详细信息
ISBN:
(数字)9781665453493
ISBN:
(纸本)9781665453493
Event cameras are promising sensors for on-line and real-time vision tasks due to their high temporal resolution, low latency, and redundant static data elimination. Many vision algorithms use some form of spatial convolution (i.e. spatial pattern detection) as a fundamental component. However, additional consideration must be taken for event cameras, as the visual signal is asynchronous and sparse. While elegant methods have been proposed for event-based convolutions, they are unsuitable for real scenarios due to their inefficient processing pipeline and subsequent low event-throughput. This paper presents an efficient implementation based on decoupling the event-based computations from the computationally heavy convolutions, increasing the maximum event processing rate by 15.92x to over 10 million events/second, while still maintaining the event-based paradigm of asynchronous input and output. Results on public datasets with modern 640 x 480 event-camera recordings show that the proposed implementation achieves real-time processing with minimal impact on the convolution result, while the prior state-of-the-art results in a latency of over 1 second.
In response to the significant risks and low efficiency associated with traditional manual operations on offshore wind turbine towers, a robotic rust removal and painting system tailored for offshore wind turbine towe...
详细信息
This paper presents foot trajectory planning for bipedal robots during the swing phase. This experiment aims to design an efficient and dynamically stable foot trajectory for a 10-DOF (Degree of freedom) bipedal robot...
详细信息
ISBN:
(纸本)9798400700460
This paper presents foot trajectory planning for bipedal robots during the swing phase. This experiment aims to design an efficient and dynamically stable foot trajectory for a 10-DOF (Degree of freedom) bipedal robot. The experiment considers the foot's quadratic, cubic, and quintic polynomial trajectories while climbing the inclined plane. Forward and Inverse kinematics is used to calculate the variation of joint angles during the swing phase. Dynamic Balance Margin DBM has been computed using Zero Moment Point ZMP principle while walking. The angle variation and the ZMP for the aforementioned polynomial trajectories are presented. The maximum DBM is achieved for quintic polynomial trajectory, leading to a considerable reduction in power consumption. Our paper presents a dynamically stable walking pattern of a bipedal robot. It also ensures repeatability, which has been validated using a stick diagram.
Optical Character Recognition (OCR) technologies are crucial for automated information extraction across various domains. However, the intricate layouts and diverse text properties often found on different products ca...
详细信息
ISBN:
(纸本)9798331516246;9798331516239
Optical Character Recognition (OCR) technologies are crucial for automated information extraction across various domains. However, the intricate layouts and diverse text properties often found on different products can complicate accurate data retrieval and categorization. This paper introduces Context vision OCR (CVOCR), a versatile framework designed to address the proposed challenges using advanced image processing and text analysis techniques. While CVOCR is applicable to any OCR-related application, this paper focuses on pharmaceutical items as a case study due to the stringent accuracy requirements and the complexity of medicine packaging. The CVOCR algorithm is developed based on the integration of the Fast Super-Resolution Convolutional Neural Network (FSRCNN) for enhanced image clarity, LayoutLMv2 for spatial layout understanding, Tesseract OCR for robust character recognition, and GPT-Neo for advanced contextual analysis. The strategic integration of these components form a cohesive system that significantly improves text detection and interpretation accuracy. We demonstrate the efficacy of the CVOCR system through testing on various pharmaceutical products, where it consistently outperforms Tesseract OCR.
Highly intelligent robot is the trend in the new round of development, and it is also the ultimate goal of robot navigation technology *** ability to learn is an important embodiment of robot *** navigation is the bas...
详细信息
Tactile exploration of surfaces is a key component of everyday life, allowing us to make complex inferences about our environments even when vision is occluded. The emergence of biomimetic neuromorphic hardware in rec...
详细信息
ISBN:
(纸本)9798350384581;9798350384574
Tactile exploration of surfaces is a key component of everyday life, allowing us to make complex inferences about our environments even when vision is occluded. The emergence of biomimetic neuromorphic hardware in recent years has furthered our ability to create biologically plausible sensing solutions. While these platforms continue to improve in regards to latency and power consumption, within recent literature on tactile texture classification there is an emphasis on accuracy at the expense of real-time processing. In order for these tactile sensing systems to find use outside of experimental laboratory environments, it is key to design systems capable of capturing and processing data in real-time. Within this paper we present a system for the real-time classification of texture using a neuromorphic tactile sensor, a spiking neural network and a novel decision making algorithm. Our real-time system achieves classification accuracies of 94% on a dataset of 11 natural textile textures. Furthermore our system is capable of identifying textures at human-level performance in as little as 84ms. Additionally, benchmarking our system across CPU, GPU and Loihi2 hardware platforms resulted in a 96% reduction in power consumption on the neuromorphic platform. This system out-performed previous work by the authors and the state of art, both in terms of accuracy and classification speed.
Machine learning is a trending topic in the area of computer vision, which makes the machine able to learn about it without being expressly programmed using various algorithms. When a model is designed to predict futu...
详细信息
暂无评论