We propose PyTorchGeoNodes, a differentiable module for reconstructing 3D objects from images using interpretable shape programs. In comparison to traditional CAD model retrieval methods, the use of shape programs for...
详细信息
Recently, morphing attack detection (MAD) solutions have achieved remarkable success with the aid of deep learning techniques. Despite the good performance achieved by binary label or binary pixel-wise supervised MAD ...
Recently, morphing attack detection (MAD) solutions have achieved remarkable success with the aid of deep learning techniques. Despite the good performance achieved by binary label or binary pixel-wise supervised MAD models, the robustness of such models drops when facing variations in morphing attacks. In this work, we propose a novel process that leverages facial depth information to build a robust and generalized MAD. The depth map, representing the 3D shape of the face in a 2D image, is more informative compared to binary and binary pixel-wise map labels. To validate the idea we synthetically generated 3D depth map ground truth. Furthermore, we introduce a novel MAD architecture designed to capture subtle information from the 3D depth data. In addition, we analyze the training loss formulation to further enhance the MAD performance. Driven by the need for developing MAD solutions while preserving the privacy of individuals for legal and ethical reasons, we conduct our experiments on privacy-friendly synthetic training data and authentic evaluation data. The experimental results on existing public datasets in SYN-MAD 22 competition demonstrate the effectiveness of our proposed solution in terms of both robustness and generalization.
In this work we tackle the problem of estimating the density fX of a random variable X by successive smoothing, such that the smoothed random variable Y fulfills the diffusion partial differential equation (∂t − ∆1)fY...
详细信息
Surgical phase recognition is an important aspect of surgical workflow analysis, as it allows an automatic analysis of the performance and efficiency of surgical procedures. A big challenge for training a neural netwo...
详细信息
The European Cultural Heritage Strategy for the 21st century has led to an increased demand for fast, efficient and faithful 3D digitization technologies for cultural heritage artefacts. Yet, unlike the digital acquis...
This paper presents an application of a mixture of Hidden Markov Models (HMMs) as a tool for verification of IoT fuel sensors. The IoT fuel sensors report the level of fuel in tanks of a petrol station, and are a key ...
详细信息
Pedestrians and cyclists suffer the most serious injuries in traffic accidents. Existing Pedestrian Protection Systems and Road Safety Systems rely on an ideal model of pedestrian behavior and do not consider that peo...
详细信息
ISBN:
(数字)9781665463829
ISBN:
(纸本)9781665463836
Pedestrians and cyclists suffer the most serious injuries in traffic accidents. Existing Pedestrian Protection Systems and Road Safety Systems rely on an ideal model of pedestrian behavior and do not consider that people tend to take shortcuts, appear at unexpected places or can be distracted on the road, for example, by using a smartphone or wearing headphones. Collecting and analyzing realistic road user behavior is a crucial component to improve pedestrian and cyclist safety. However, such real-world data is still missing. To address this, we propose a visual surveillance system with two perpendicular partially overlapping fields of view, combined with a fully automated deep learning-based pipeline to process and collect video observations, detect and extract road user trajectories in real-world coordinates and estimate human attributes, such as age, gender, smartphone usage, etc. We demonstrate our prototype by deploying it in two locations in a European city.
The resolution of optical imaging is limited by diffraction as well as detector noise. However, thermal imaging exhibits an additional unique phenomenon of ghosting which results in blurry and low-texture images. Here...
详细信息
Sequence-to-sequence models provide a feasible new approach for generative text summarization, but these models are not able to accurately reproduce factual details and subject information. To address the problem of u...
详细信息
Image fusion typically employs non-invertible neural networks to merge multiple source images into a single fused image. However, for clinical experts, solely relying on fused images may be insufficient for making dia...
详细信息
暂无评论