Effective video frame interpolation hinges on the adept handling of motion in the input scene. Prior work acknowledges asynchronous event information for this, but often overlooks whether motion induces blur in the vi...
详细信息
The goal of this paper is to present a robotic workcell to automate several tasks of the cementing process in footwear manufacturing. Our cell's main applications are sole digitization of a wide variety of footwea...
ISBN:
(数字)9781728189567
ISBN:
(纸本)9781728189574
The goal of this paper is to present a robotic workcell to automate several tasks of the cementing process in footwear manufacturing. Our cell's main applications are sole digitization of a wide variety of footwear, glue dispensing and sole grasping from conveyor belts. This cell is made up of a manipulator arm endowed with a gripper, a conveyor belt and a 3D scanner. We have integrated all the elements into a ROS simulation environment facilitating control and communication among them, also providing flexibility to support future extensions. We propose a novel method to grasp soles of different shape, size and material, exploiting the particular characteristics of these objects. Our method relies on object contour extraction using concave hulls. We evaluate it on point clouds of 16 digitized real soles in three different scenarios: concave hull, k-NNs extension and PCA correction. While we have tested this workcell in a simulated environment, the presented system's performance is scheduled to be tested on a real setup at INESCOP facilities in the upcoming months.
Cross domain object detection is a realistic and challenging task in the wild. It suffers from performance degradation due to large shift of data distributions and lack of instance-level annotations in the target doma...
详细信息
This study assesses the outcomes of the NTIRE 2023 Challenge on Non-Homogeneous Dehazing, wherein novel techniques were proposed and evaluated on new image dataset called HD-NH-HAZE. The HD-NH-HAZE dataset contains 50...
详细信息
Detection of moving objects is an essential capability in dealing with dynamic environments. Most moving object detection algorithms have been designed for color images without depth. For robotic navigation where real...
详细信息
ISBN:
(数字)9781728173955
ISBN:
(纸本)9781728173962
Detection of moving objects is an essential capability in dealing with dynamic environments. Most moving object detection algorithms have been designed for color images without depth. For robotic navigation where real-time RGBD data is often readily available, utilization of the depth information would be beneficial for obstacle recognition. Here, we propose a simple moving object detection algorithm that uses RGB-D images. The proposed algorithm does not require estimating a background model. Instead, it uses an occlusion model which enables us to estimate the camera pose on a background confused with moving objects that dominate the scene. The proposed algorithm allows to separate the moving object detection and visual odometry (VO) so that an arbitrary robust VO method can be employed in a dynamic situation with a combination of moving object detection, whereas other VO algorithms for a dynamic environment are inseparable. In this paper, we use dense visual odometry (DVO) as a VO method with a bi-square regression weight. Experimental results show the segmentation accuracy and the performance improvement of DVO in the situations. We validate our algorithm in public datasets and our dataset which also publicly accessible.
Due to their high capacity in capturing 3D spatial information, 3D Fully Convolutional Neural Networks (3D FCNs), especially 3D U-Net, are prevalent for volumetric medical image segmentation. However, 3D convolutions ...
详细信息
ISBN:
(纸本)9781665412469;9781665429474
Due to their high capacity in capturing 3D spatial information, 3D Fully Convolutional Neural Networks (3D FCNs), especially 3D U-Net, are prevalent for volumetric medical image segmentation. However, 3D convolutions are much more computationally complex than 2D convolutions and thus, are more prone to overfitting. This paper proposes Collaborative Multi-View convolutions (CMV convs) that can keep the model complexity similar to those employing 2D convolutions while capturing the 3D spatial context like 3D convolutions. Specifically, CMV convs simultaneously extract information from three orthogonal views with three parameter-shared 2D convolutions. A Global-Guided Gating mechanism (3G) is further designed that selectively passes information from CMV convs to the next stage. Combined with 3G, a CMV conv become a G-CMV conv that constitutes a plug-and-play module, which can be easily integrated into various 3D CNNs for image segmentation. Extensive experiments utilizing BraTS18 dataset have been conducted. Our method achieves competitive results compared to state-of-the-art methods with over 10× fewer parameters than 3D-UNet.
Image quality assessment (IQA) is the key factor for the fast development of image restoration (IR) algorithms. The most recent IR methods based on Generative Adversarial Networks (GANs) have achieved significant impr...
详细信息
vision transformers (ViTs) have become the popular structures and outperformed convolutional neural networks (CNNs) on various vision tasks. However, such powerful transformers bring a huge computation burden, because...
详细信息
This work studies and defines the problem of providing extensive and opportunistic Edge AI-based area coverage in smart city application scenarios, by researching and determining the optimal configuration of sensing a...
详细信息
This work studies and defines the problem of providing extensive and opportunistic Edge AI-based area coverage in smart city application scenarios, by researching and determining the optimal configuration of sensing and computational resources for minimizing the environmental/technology footprint of the solution. A typical smart city computing continuum consists of statically installed multimodal sensing Internet-of-Things (IoT) nodes at various city locations, accompanied by interconnected computational Cloud/Edge/IoT nodes. This paper presents Optimal Trustworthy EdgeAI (OTE), an entirely novel research pipeline, that complements existing smart city infrastructure with intelligent drone Edge/IoT nodes (in the form of modularly equipped unmanned aerial vehicles), capable of autonomous repositioning according to individual/collective sensing and coverage criteria. Thereby, we envisage the emerging cutting-edge technologies of trustworthy sensing, perceiving, modelling technologies for predicting the behavior of moving targets (e.g., citizens/vehicles/objects), understanding natural phenomena (e.g., sea wave motion, urban flora/fauna, biodiversity) in order to anticipate events (people's bad habits, environmental changes), by exploiting novel continuous data processing services across the whole span of the enhanced Cloud-Edge-IoT computing continuum.
Image quality assessment (IQA) is the key factor for the fast development of image restoration (IR) algorithms. The most recent IR methods based on Generative Adversarial Networks (GANs) have achieved significant impr...
详细信息
暂无评论