Despite advances in facial beauty prediction, how specific facial regions contribute to perceptions of attractiveness remains largely unexplored, highlighting a critical interpretability gap in this domain. This study...
详细信息
The aim of the work presented in this paper is to develop and evaluate an integrated lecture style evaluation methodology that provides, teachers instant feedback related to the quality of their lecturing style. The p...
详细信息
The Non-myth of the Noble Red is a tangible narrative that combines cardboard puppets with digital storytelling in an integrated physical storyworld. This narrative employs networked microcontrollers within the puppet...
详细信息
Software Quality Assurance (SQA) is a way to verify quality in the software. It is the set of activities which make certain processes, procedures as well as standards fit for the project and implemented appropriately....
详细信息
Neural fields are receiving increased attention as a geometric representation due to their ability to compactly store detailed and smooth shapes and easily undergo topological changes. Compared to classic geometry rep...
详细信息
Latent representations are extensively used for tasks like visualization, interpolation, or feature extraction in deep learning models. This paper demonstrates the importance of considering the inductive bias imposed ...
详细信息
Latent representations are extensively used for tasks like visualization, interpolation, or feature extraction in deep learning models. This paper demonstrates the importance of considering the inductive bias imposed by an equivariant model when using latent representations as neglecting these biases can lead to decreased performance in downstream tasks. We propose principles for choosing invariant projections of latent representations and show their effectiveness in two examples: A permutation equivariant variational autoencoder for molecular graph generation, where an invariant projection can be designed to maintain information without loss, and for a rotation-equivariant representation in image classification, where random invariant projections proves to retain a high degree of information. In both cases, the analysis of invariant latent representations proves superior to their equivariant counterparts. Finally, we illustrate that the phenomena documented here for equivariant neural networks have counterparts in standard neural networks where invariance is encouraged via augmentation. Copyright 2024 by the author(s)
Simulation studies require thorough validation to ensure model accuracy, reliability, and credibility. While validation typically focuses on the simulation model itself, additional artifacts also influence study outco...
详细信息
We introduce a novel end-to-end deeplearning solution for rapidly estimating a dense spherical depth map of an indoor *** input is a single equirectangular image registered with a sparse depth map,as provided by a var...
详细信息
We introduce a novel end-to-end deeplearning solution for rapidly estimating a dense spherical depth map of an indoor *** input is a single equirectangular image registered with a sparse depth map,as provided by a variety of common capture *** is inferred by an efficient and lightweight single-branch network,which employs a dynamic gating system to process together dense visual data and sparse geometric *** exploit the characteristics of typical man-made environments to efficiently compress multiresolution features and find short-and long-range relations among scene ***,we introduce a new augmentation strategy to make the model robust to different types of sparsity,including those generated by various structured light sensors and LiDAR *** experimental results demonstrate that our method provides interactive performance and outperforms stateof-the-art solutions in computational efficiency,adaptivity to variable depth sparsity patterns,and prediction accuracy for challenging indoor data,even when trained solely on synthetic data without any fine tuning.
This study introduces CLIP-Flow,a novel network for generating images from a given image or *** effectively utilize the rich semantics contained in both modalities,we designed a semantics-guided methodology for image-...
详细信息
This study introduces CLIP-Flow,a novel network for generating images from a given image or *** effectively utilize the rich semantics contained in both modalities,we designed a semantics-guided methodology for image-and text-to-image *** particular,we adopted Contrastive Language-Image Pretraining(CLIP)as an encoder to extract semantics and StyleGAN as a decoder to generate images from such ***,to bridge the embedding space of CLIP and latent space of StyleGAN,real NVP is employed and modified with activation normalization and invertible *** the images and text in CLIP share the same representation space,text prompts can be fed directly into CLIP-Flow to achieve text-to-image *** conducted extensive experiments on several datasets to validate the effectiveness of the proposed image-to-image synthesis *** addition,we tested on the public dataset Multi-Modal CelebA-HQ,for text-to-image *** validated that our approach can generate high-quality text-matching images,and is comparable with state-of-the-art methods,both qualitatively and quantitatively.
暂无评论