检索结果-内蒙古大学图书馆

Correspondence Distillation from NeRF-Based GAN

INTERNATIONAL JOURNAL OF computer VISION 2024年第3期132卷 611-631页

作者： Lan, Yushi Loy, Chen Change Dai, Bo Nanyang Technol Univ Lab S Singapore Singapore Shanghai AI Lab Shanghai Peoples R China

The neural radiance field (NeRF) has shown promising results in preserving the fine details of objects and scenes. However, unlike explicit shape representations e.g., mesh, it remains an open problem to build dense correspondences across different NeRFs of the same category, which is essential in many downstream tasks. The main difficulties of this problem lie in the implicit nature of NeRF and the lack of ground-truth correspondence annotations. In this paper, we show it is possible to bypass these challenges by leveraging the rich semantics and structural priors encapsulated in a pre-trained NeRF-based GAN. Specifically, we exploit such priors from three aspects, namely (1) a dual deformation field that takes latent codes as global structural indicators, (2) a learning objective that regards generator features as geometric-aware local descriptors, and (3) a source of infinite object-specific NeRF samples. Our experiments demonstrate that such priors lead to 3D dense correspondence that is accurate, smooth, and robust. We also show that established dense correspondence across NeRFs can effectively enable many NeRF-based downstream applications such as texture transfer.

关键词： Neural radiance field Dense correspondence Generative modeling Shape analysis computer vision computer graphics

来源：评论

学校读者我要写书评

暂无评论

Point'n Move: Interactive scene object manipulation on Gaussian splatting radiance fields

引用

IET IMAGE PROCESSING 2024年第12期18卷 3507-3517页

作者： Huang, Jiajun Yu, Hongchuan Zhang, Jianjun Nait-Charif, Hammadi Natl Ctr Comp Animat Poole BH12 5BB Dorset England

The authors propose Point'n Move, a method that achieves interactive scene object manipulation with exposed region inpainting. Interactivity here further comes from intuitive object selection and real-time editing. To achieve this, Gaussian Splatting Radiance Field is adopted as the scene representation and its explicit nature and speed advantage are fully leveraged. Its explicit representation formulation allows to devise a 2D prompt points to 3D masks dual-stage self-prompting segmentation algorithm, perform mask refinement and merging, minimize changes, and provide good initialization for scene inpainting and perform editing in real-time without per-editing training;all lead to superior quality and performance. The method was tested by editing both forward-facing and 360 scenes. The method is also compared against existing methods, showing superior quality despite being more capable and having a speed advantage. We propose Point'n Move, a method that achieves interactive scene object manipulation with exposed region inpainting. Interactivity here refers to intuitive object selection and real-time editing. This is achieved by devising a pipeline that fully exploits the explicit nature of our adopted scene representation. Our method achieves superior quality against existing object removal methods despite being more capable and having a speed advantage. image

关键词： computer animation computer graphics computer vision

来源：评论

学校读者我要写书评

暂无评论

Self-attention residual network-based spatial super-resolution synthesis for time-varying volumetric data

引用

IET IMAGE PROCESSING 2024年第6期18卷 1579-1597页

作者： Ma, Ji Ye, Yuhao Chen, Jinjin Zhejiang Univ Technol Sch Comp Sci & Technol Hangzhou Peoples R China Commun Univ Zhejiang Sch Design & Art Hangzhou Peoples R China Macau Univ Sci & Technol Fac Humanities & Arts Macau Peoples R China

In the field of scientific visualization, the upscaling of time-varying volume is meaningful. It can be used in in situ visualization to help scientists overcome the limitations of I/O speed and storage capacity when analysing and visualizing large-scale, time-varying simulation data. This paper proposes self-attention residual network-based spatial super-resolution (SARN-SSR), a spatial super-resolution model based on self-attention residual networks that can generate time-varying data with temporal coherence. SARN-SSR consists of two components: a generator and a discriminator. The generator takes the low-resolution volume sequences as the input and gives the corresponding high-resolution volume sequences as the output. The discriminator takes both synthesized and real high-resolution volume sequence as the input and gives a matrix to predict the realness as the output. To verify the validity of SARN-SSR, four sets of time-varying volume datasets are applied from scientific simulation. In addition, SARN-SSR is compared on these datasets, both qualitatively and quantitatively, with two deep learning-based techniques and one traditional technique. The experimental results show that by using this method, the closest time-varying data to the ground truth can be obtained. This paper proposes a novel self-attention residual network-based spatial super-resolution (SARN-SSR) framework for upscaling time-varying volume data in scientific visualization. It utilizes a generator and discriminator based on generative adversarial networks to generate high-resolution volume sequences. Comparative evaluations demonstrate that SARN-SSR outperforms state-of-the-art techniques in generating accurate time-varying volume datasets. image

关键词： computer graphics data visualisation image processing

来源：评论

学校读者我要写书评

暂无评论

Investigating Macroexpressions and Microexpressions in computer graphics Animated Faces

引用

PRESENCE-VIRTUAL AND AUGMENTED REALITY 2014年第2期23卷 191-208页

作者： Queiroz, Rossana B. Musse, Soraia R. Sadler, Norman I. Pontificia Univ Catolica Rio Grande do Sul BR-90619900 Porto Alegre RS Brazil Univ Penn Dept Comp & Informat Sci Philadelphia PA 19104 USA

Due to varied personal, social, or even cultural situations, people sometimes conceal or mask their true emotions. These suppressed emotions can be expressed in a very subtle way by brief movements called microexpressions. We investigate human subjects' perception of hidden emotions in virtual faces, inspired by recent psychological experiments. We created animations with virtual faces showing some facial expressions and inserted brief secondary expressions in some sequences, in order to try to convey a subtle second emotion in the character Our evaluation methodology consists of two sets of experiments, with three different sets of questions. The first experiment verifies that the accuracy and concordance of the participant's responses with synthetic faces matches the empirical results done with photos of real people in the paper by X.-b. Shen, Q. Wu, and X.-I. Fu, 2012, "Effects of the duration of expressions on the recognition of microexpressions," Journal of Zhejiang University Science 8, 13(3), 221-230. The second experiment verifies whether participants could perceive and identify primary and secondary emotions in virtual faces. The third experiment tries to evaluate the participant's perception of realism, deceit, and valence of the emotions. Our results show that most of the participants recognized the foreground (macro) emotion and most of the time they perceived the presence of the second (micro) emotion in the animations, although they did not identify it correctly in some samples. This experiment exposes the benefits of conveying microexpressions in computer graphics characters, as they may visually enhance a character's emotional depth through subliminal microexpression cues, and consequently increase the perceived social complexity and believability.

关键词： computer graphics HUMAN face recognition (computer science) REALISM computer-generated imagery EMPIRICAL research VIRTUAL reality

来源：评论

学校读者我要写书评

暂无评论

LFS-Aware Surface Reconstruction From Unoriented 3D Point Clouds

引用

IEEE TRANSACTIONS ON MULTIMEDIA 2024年 26卷 11415-11427页

作者： Fu, Rao Hormann, Kai Alliez, Pierre INRIA F-06902 Sohia Antipolis France Geometry Factory F-06560 Antibes France Univ Svizzera Italiana USI CH-6900 Lugano Switzerland

We present a novel approach for generating isotropic surface triangle meshes directly from unoriented 3D point clouds, with the mesh density adapting to the estimated local feature size (LFS). Popular reconstruction pipelines first reconstruct a dense mesh from the input point cloud and then apply remeshing to obtain an isotropic mesh. The sequential pipeline makes it hard to find a lower-density mesh while preserving more details. Instead, our approach reconstructs both an implicit function and an LFS-aware mesh sizing function directly from the input point cloud, which is then used to produce the final LFS-aware mesh without remeshing. We combine local curvature radius and shape diameter to estimate the LFS directly from the input point clouds. Additionally, we propose a new mesh solver to solve an implicit function whose zero level set delineates the surface without requiring normal orientation. The added value of our approach is generating isotropic meshes directly from 3D point clouds with an LFS-aware density, thus achieving a trade-off between geometric detail and mesh complexity. Our experiments also demonstrate the robustness of our method to noise, outliers, and missing data and can preserve sharp features for CAD point clouds.

关键词： Surface reconstruction Point cloud compression Three-dimensional displays Shape Estimation Surface fitting Scalability I.3.5 [Computing methodologies] computer graphics computational geometry and object modeling

来源：评论

学校读者我要写书评

暂无评论

FSKT-GE: Feature maps similarity knowledge transfer for low-resolution gaze estimation

引用

IET IMAGE PROCESSING 2024年第6期18卷 1642-1654页

作者： Yan, Chao Pan, Weiguo Dai, Songyin Xu, Bingxin Xu, Cheng Liu, Hongzhe Li, Xuewei Beijing Union Univ Beijing Key Lab Informat Serv Engn Beijing 100101 Peoples R China Beijing Union Univ Inst Brain & Cognit Sci Coll Robot Beijing Peoples R China

The limited of texture details information in low-resolution facial or eye images presents a challenge for gaze estimation. To address this, FSKT-GE (feature maps similarity knowledge transfer for low-resolution gaze estimation) is proposed, a gaze estimation framework consisting of both a high resolution (HR) network and low resolution (LR) network with the identical structure. Rather than mere feature imitation, this issue is addressed by assessing the cosine similarity of feature layers, emphasizing the distribution similarity between the HR and LR networks. This enables the LR network to acquire richer knowledge. This framework utilizes a combination loss function, incorporating cosine similarity measurement, soft loss based on probability distribution difference and gaze direction output, along with a hard loss from the LR network output layer. This approach on low-resolution datasets derived from Gaze360 and RT-Gene datasets is validated, demonstrating excellent performance in low-resolution gaze estimation. Evaluations on low-resolution images obtained through 2x, 4x, and 8x down-sampling are conducted on two datasets. On the Gaze360 dataset, the lowest mean angular errors of 10.97 degrees, 11.22 degrees, and 13.61 degrees were achieved, while on the RT-Gene dataset, the lowest mean angular errors of 6.73 degrees, 6.83 degrees, and 7.75 degrees were obtained. Here, a novel approach called feature map similarity-based knowledge transfer for low-resolution gaze estimation (FSKT-GE) is proposed. The motivation behind this work is to address the challenge of accurately estimating gaze direction for low-resolution facial images encountered in unconstrained outdoor environments. image

关键词： computer graphics computer vision convolutional neural nets

来源：评论

学校读者我要写书评

暂无评论

ElectroEncephalographics Making Waves in computer graphics Research

引用

IEEE computer graphics AND APPLICATIONS 2014年第6期34卷 46-56页

作者： Mustafa, Maryam Magnor, Marcus Tech Univ Carolo Wilhelmina Braunschweig Comp Graph Lab Dept Comp Sci Braunschweig Germany

Electroencephalography (EEG) is a novel modality for investigating perceptual graphics problems. Until recently, EEG has predominantly been used for clinical diagnosis, in psychology, and by the brain-computer-interface community. Researchers are extending it to help understand the perception of visual output from graphics applications and to create approaches based on direct neural feedback. Researchers have applied EEG to graphics to determine perceived image and video quality by detecting typical rendering artifacts, to evaluate visualization effectiveness by calculating the cognitive load, and to automatically optimize rendering parameters for images and videos on the basis of implicit neural feedback.

关键词： aesthetics Biomedical image processing computer graphics computer vision EEG Electrodes Electroencephalography graphics Image coding perception rendering Rendering (computer graphics) Video sequences Visualization

来源：评论

学校读者我要写书评

暂无评论

Improve Computing Efficiency and Motion Safety by Analyzing Environment With graphics

引用

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2024年第3期21卷 4613-4626页

作者： Zhang, Qianyi Wu, Shichao Jia, Yuhang Xu, Yuang Liu, Jingtai Nankai Univ Inst Robot & Automat Informat Syst Tianjin 300350 Peoples R China

Exploring topologically distinctive trajectories provides more options for robot motion planning. Since computing time grows greatly with environment complexity, improving exploration efficiency and picking the optimal trajectory in complex environments are critical issues. To this end, this paper proposes a Graphic-and Timed-Elastic-Band-based approach (GraphicTEB) with spatial completeness and high computing efficiency. The environment is analyzed utilizing computer graphics, where obstacles are extracted as nodes and their relationships are built as edges. Three contributions are presented. 1) By assembling directed detours formed by nodes and segmented paths formed by edges, a generalized path consisting of nodes and edges derives various normal paths efficiently. 2) By multiplying two vectors starting from the obstacle point closest to the waypoint and the boundary point farthest from the waypoint, an novel obstacle gradient is introduced to guide safer optimization. 3) By assigning edges with asymmetric Gaussian model, a trajectory evaluation strategy is designed to reflect the motion tendency and motion uncertainty of dynamic obstacles. Qualitative and quantitative simulations demonstrate that the proposed GraphicTEB achieves spatial completeness, higher scene pass rate, and fastest computing efficiency. Experiments are implemented in long corridor and broad room scenarios, where the robot goes through gaps safely, finds trajectories quickly, and passes pedestrians politely Note to Practitioners-The motivation stems from the fact that our daily cruising robot occasionally gets trapped in a corridor with piled obstacles or in a complex dynamic crowd due to the lack of a reliable trajectory. The solution is to search for more topologically distinctive trajectories and pick the optimal one. Considering that existing open-source approaches are either incomplete or highly time-consuming, a method for clustering and searching trajectories in the obstacle-occupied r

关键词： Trajectory Robots Planning Safety Dynamics Collision avoidance Robot sensing systems Motion planning computer graphics timed elastic band (TEB) homology class of trajectories

来源：评论

学校读者我要写书评

暂无评论

A Multi-aperture Coaxial Projector Balancing Shadow Suppression and Deblurring

引用

IEEE TRANSACTIONS ON VISUALIZATION AND computer graphics 2024年第11期30卷 7031-7041页

作者： Kusuyama, Hiroki Kageyama, Yuta Iwai, Daisuke Sato, Kosuke Osaka Univ Suita Osaka Japan

This paper proposes a projection system that optically removes the cast shadow in projection mapping. Specifically, we realize the large-aperture (LA) projection using a large-format Fresnel lens to suppress cast shadows by condensing the projection light from a wide viewing angle. However, the resolution and contrast of the projected results are significantly degraded by defocus blur, veiling glare, and stray light caused by the aberration of an LA Fresnel lens. To solve the technical problems, we employ two different approaches: optical and digital image processing methods. First, we introduce a residual projector with a typical aperture lens on the same optical axis as the LA projector, projecting the residual (i.e., high-frequency) components attenuated in the LA projection. These projectors play different roles in shadow suppression and blur compensation, both achieved by projecting simultaneously. Secondly, we optimize the pair of projection images that can balance the shadow suppression and deblurring performance of our projection system. We implemented a proof-of-concept prototype and validated the above-mentioned techniques through projection experiments and a user study.

关键词： Human-centered computing Human computer interaction (HCI) Interaction devices Displays and imagers Computing methodologies computer graphics graphics systems and interfaces Mixed / augmented reality

来源：评论

学校读者我要写书评

暂无评论

Local Geometric Indexing of High Resolution Data for Facial Reconstruction From Sparse Markers

引用

IEEE TRANSACTIONS ON VISUALIZATION AND computer graphics 2024年第8期30卷 5289-5298页

作者： Cong, Matthew Lan, Lana Fedkiw, Ronald Ind Light & Mag San Francisco CA 94129 USA Stanford Univ Dept Comp Sci Stanford CA 94305 USA

When considering sparse motion capture marker data, one typically struggles to balance its overfitting via a high dimensional blendshape system versus underfitting caused by smoothness constraints. With the current trend towards using more and more data, our aim is not to fit the motion capture markers with a parameterized (blendshape) model or to smoothly interpolate a surface through the marker positions, but rather to find an instance in the high resolution dataset that contains local geometry to fit each marker. Just as is true for typical machine learning applications, this approach benefits from a plethora of data, and thus we also consider augmenting the dataset via specially designed physical simulations that target the high resolution dataset such that the simulation output lies on the same so-called manifold as the data targeted.

关键词： Shape Faces Geometry Surface reconstruction Cameras Point cloud compression Deformation computer graphics image processing and computer vision interpolation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：