Ocular mobility disorders such as strabismus affect millions of people. Patients' descriptions of their symptoms, such as what they see and how their vision has changed, are important for ophthalmologists to diagn...
详细信息
ISBN:
(纸本)9781665491907
Ocular mobility disorders such as strabismus affect millions of people. Patients' descriptions of their symptoms, such as what they see and how their vision has changed, are important for ophthalmologists to diagnose, monitor progression, and evaluate treatment effectiveness. However, such verbal depiction may be vague and subjective. A data-driven simulator that visualizes abnormal vision experienced by a strabismic patient can be helpful to objectively illustrate each individual's vision condition and thus to better understand and manage strabismus. To fulfill this technical void, this paper presents the first vision visualization robot that uses human eye movement data to simulate strabismic vision. We developed a robotic binocular eye platform, which is capable of displaying simulated visual scenes using its onboard cameras. Based on the hypothesis that a human's binocular vision fusion process can be mimicked as a homography transformation from one view to another view, we developed a pipeline to estimate the time-varying homography matrix, and generate the fused view of a human's binocular vision. The effectiveness of the proposed method is demonstrated through experiments with eye movement data from both healthy individuals and strabismic patients.
Image inpainting has made huge strides benefiting from the advantages of convolutional neural networks (CNNs) in understanding high-level semantics. Recently, some studies have applied transformers to the visual field...
详细信息
Robust object pose tracking plays an important role in robot manipulation, but it is still an open issue for quickly moving targets as motion blur and low frequency detection can reduce pose estimation accuracy even f...
详细信息
ISBN:
(纸本)9781665491907
Robust object pose tracking plays an important role in robot manipulation, but it is still an open issue for quickly moving targets as motion blur and low frequency detection can reduce pose estimation accuracy even for state-of-the-art RGB-D-based methods. An event-camera is a low-latency vision sensor that can act complementary to RGB-D. Specifically, its sub-millisecond temporal resolution can be exploited to correct for pose estimation inaccuracies due to low frequency RGB-D based detection. To do so, we propose a dual Kalman filter: the first filter estimates an object's velocity from the spatio-temporal patterns of "events", the second filter fuses the tracked object velocity with a low-frequency object pose estimated from a deep neural network using RGB-D data. The full system outputs high frequency, accurate object poses also for fast moving objects. The proposed method works towards low-power robotics by replacing high-cost GPU-based optical flow used in prior work with event-cameras that inherently extract the required signal without costly processing. The proposed algorithm achieves comparable or better performance when compared to two state-of-the-art 6-DoF object pose estimation algorithms and one hybrid event/RGB-D algorithm on benchmarks with simulated and real data. We discuss the benefits and trade-offs for using the event-camera and contribute algorithm, code, and datasets to the community. The code and datasets are available at https://***/event-driven-robotics/Hybrid-object-tracking-with-events-and-frames.
Reservoir dams serve as critical infrastructure. However, during their prolonged operation, cracks tend to appear on surfaces, particularly on slopes. The development of cracks poses risks to the stability of reservoi...
详细信息
Starting from the demand analysis, based on the intelligent six axis robot combined with ultrasonic flaw detection equipment, the advanced robot technology and ultrasonic acquisition, processing and identification tec...
详细信息
The proceedings contain 9 papers. The special focus in this conference is on Design and Architecture for signal and Image processing. The topics include: sEMG-Based Gesture Recognition with Spiking Neural Network...
ISBN:
(纸本)9783031628733
The proceedings contain 9 papers. The special focus in this conference is on Design and Architecture for signal and Image processing. The topics include: sEMG-Based Gesture Recognition with Spiking Neural Networks on Low-Power FPGA;A Highly Configurable Platform for Advanced PPG Analysis;preface;Standalone Nested Loop Acceleration on CGRAs for signalprocessing Applications;optimising Graph Representation for Hardware Implementation of Graph Convolutional Networks for Event-Based vision;Improving the Energy Efficiency of CNN Inference on FPGA Using Partial Reconfiguration;scratchy: A Class of Adaptable Architectures with Software-Managed Communication for Edge Streaming Applications.
Stereo vision is essential for many applications. Currently, the synchronization of the streams coming from two cameras is done using mostly hardware. A software-based synchronization method would reduce the cost, wei...
详细信息
The inconspicuousness of human gait characteristic in radar signal makes it hard to differentiate different identities. In this work, a Dual-stream Siamese vision Transformer with Mutual Attention is proposed to verif...
详细信息
Plankton recognition provides novel possibilities to study various environmental aspects and an interesting real-world context to develop domain adaptation (DA) methods. Different imaging instruments cause domain shif...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Plankton recognition provides novel possibilities to study various environmental aspects and an interesting real-world context to develop domain adaptation (DA) methods. Different imaging instruments cause domain shift between datasets hampering the development of general plankton recognition methods. A promising remedy for this is DA allowing to adapt a model trained on one instrument to other instruments. In this paper, we present a new DA dataset called DAPlankton which consists of phytoplankton images obtained with different instruments. Phytoplankton provides a challenging DA problem due to the fine-grained nature of the task and high class imbalance in real-world datasets. DAPlankton consists of two subsets. DAPlankton(LAB) contains images of cultured phytoplankton providing a balanced dataset with minimal label uncertainty. DAPlankton(SEA) consists of images collected from the Baltic Sea providing challenging real-world data with large intra-class variance and class imbalance. We further present a benchmark comparison of three widely used DA methods.
In modern computer vision tasks, Swin Transformer is an emerging deep learning model architecture that innovates on the basis of the traditional transformer architecture by the introduction of a mechanism of attention...
详细信息
暂无评论