检索结果-内蒙古大学图书馆

RenderGAN: Enhancing Real-time Rendering Efficiency with Deep Learning

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS 2025年第3期21卷 1-22页

作者： Mameli, Marco Paolanti, Marina Mancini, Adriano Zingaretti, Primo Pierdicca, Roberto Univ Politecn Marche Dipartimento Ingn Informaz DII Ancona Italy Univ Macerata Macerata Italy Univ Politecn Marche Ancona Italy Univ Politecn Marche Dipartimento Ingn Civile Edile & Architettura DICE Ancona Italy

In the domain of computer graphics, achieving high visual quality in real-time rendering remains a formidable challenge due to the inherent time-quality tradeoff. Conventional real-time rendering engines sacrifice visual fidelity for interactive performance, while image generation using path-tracing techniques can be exceedingly time-consuming. In this article, we introduce RenderGAN, a deep learning-based solution designed to address this critical challenge in real-time rendering. RenderGAN uses G-Buffers and information from a real-time rendering engine as inputs to produce output images with exceptional visual fidelity. Its encoder-decoder architecture, trained using the Generative Adversarial Network (GAN) framework with perceptual loss, enhances image realism. To evaluate RenderGAN's effectiveness, we quantitatively compare the generated images with those of a path-tracing engine, obtaining a remarkable Universal Image Quality Index (UIQI) value of 0.898. RenderGAN's open source nature fosters collaboration, driving advancements in real-time computer graphics and rendering techniques. By bridging the gap between real-time and path-tracing rendering, RenderGAN opens new horizons for accelerated image generation, inspiring innovation and unlocking the full potential of real-time visual experiences. Project page: https://***/marcomameli1992/RenderNet

关键词： computer graphics Deep Learning Generative Adversarial Networks RenderGAN

来源：评论

学校读者我要写书评

暂无评论

A Review of Multi-Object Tracking in Recent Times

引用

IET computer VISION 2025年第1期19卷

作者： Li, Suya Ren, Hengyi Xie, Xin Cao, Ying Henan Univ Henan Key Lab Big Data Anal & Proc Comp & Informat Kaifeng Peoples R China Nanjing Forestry Univ Informat Sci & Technol Artificial Intelligence Nanjing Peoples R China

Multi-object tracking (MOT) is a fundamental problem in computer vision that involves tracing the trajectories of foreground targets throughout a video sequence while establishing correspondences for identical objects across frames. With the advancement of deep learning techniques, methods based on deep learning have significantly improved accuracy and efficiency in MOT. This paper reviews several recent deep learning-based MOT methods and categorises them into three main groups: detection-based, single-object tracking (SOT)-based, and segmentation-based methods, according to their core technologies. Additionally, this paper discusses the metrics and datasets used for evaluating MOT performance, the challenges faced in the field, and future directions for research.

关键词： computer graphics computer vision multi-object tracking

来源：评论

学校读者我要写书评

暂无评论

MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration

引用

VISUAL computer 2025年第3期41卷 1999-2010页

作者： Li, Shiyu Liu, Zehao Gao, Meijing Bai, Yang Yin, Haozheng Yanshan Univ Coll Informat Sci & Engn Key Lab Special Fiber & Fiber Sensor Hebei Prov Qinhuangdao 066004 Hebei Peoples R China Beijing Inst Technol Coll Informat & Elect Beijing 100081 Peoples R China Beijing Inst Technol Tangshan Res Inst Tangshan 063000 Peoples R China

Underwater imaging techniques have been a focus of research for computer vision. Underwater imaging frequently encounters challenges for poor image quality and slow restoration speed, thereby hindering human underwater exploration endeavors. To enhance the quality and improve the real-time performance of underwater image restoration, the paper proposes a lightweight underwater color image restoration network based on multiscale depthwise separable convolution. First, the algorithm tackles the problems of difficult convergence and slow training by improving the AdamW optimizer. Then, we propose a multiscale depthwise separable convolution module with RGB channel, which allows efficient extraction of image features based on the underwater light propagation properties. The MDSCN can effectively improve the processing speed and recovery effect of underwater images. Through experimentation and analysis, our algorithm outperforms traditional image processing methods and recent deep learning approaches in terms of visual effects and objective evaluation metrics. Furthermore, our algorithm also has a better performs than existing deep learning methods in processing speed, which demonstrates excellent generalizability and practicality. The research in the article is highly informative for the field of underwater computer vision. The dataset, training weights files and codes are publicly available https://***/raining-li/underwater-image-processing/tree/master.

关键词： computer graphics Underwater image restoration Optimizer Multiscale convolution Depth separable convolution

来源：评论

学校读者我要写书评

暂无评论

LiDAR Data Processing for Digitization of the Castro of Santa Trega and Integration in Unreal Engine 5

引用

INTERNATIONAL JOURNAL OF ARCHITECTURAL HERITAGE 2025年第1期19卷 131-151页

作者： Conde, David Balado, Jesus Soilan, Mario Martinez, Joaquin Arias, Pedro Univ Vigo CINTECX Vigo Spain Higher Sch Min Engn Campus Lagoas MarcosendeRua Maxwell S-N Vigo 36310 Pontevedra Spain

With the advancement of graphic engines, real-life structures can be digitized with more realistic representations than before. Virtual models obtained from LiDAR (Light Detection and Ranging) data in real-time applications can be inspected in graphic engines without rendering a point cloud. Well-known proprietary software is used to convert scanning from LiDAR into meshes of triangles that work the best on graphic pipelines. However proprietary software is usually expensive, hard to learn, and requires manual interaction. The proposed methodology generates virtual models from LiDAR with little manual interaction employing open-source software in an automated workflow for generic conversion. The point cloud is registered for geo-reference, processed for building textured models, and implemented in Unreal Engine 5 for Virtual Reality deployment. Specific improvements were made to the selected study case of the Castro of Santa Trega. Visualization of the model is overall more realistic than the rendering of every point in a cloud. The average framerate is improved upon a 229% when rendering optimized meshes compared to point clouds, leading to an enriched visualization quality and reduced data size. A Virtual Reality (VR) experience was implemented with an average of 143 FPS, surpassing the standard 90 FPS recommended to avoid motion sickness.

关键词： computer graphics computer vision heritage lidar point cloud processing computer graphics computer vision heritage lidar point cloud processing

来源：评论

学校读者我要写书评

暂无评论

Exploring Generative Adversarial Networks: Comparative Analysis of Facial Image Synthesis and the Extension of Creative Capacities in Artificial Intelligence

引用

IEEE ACCESS 2025年 13卷 19588-19597页

作者： Eglynas, Tomas Lizdenis, Dovydas Raudys, Aistis Jakovlev, Sergej Voznak, Miroslav Klaipeda Univ Marine Res Inst LT-92294 Klaipeda Lithuania Vilnius Univ Fac Math & Informat LT-03225 Vilnius Lithuania VSB Tech Univ Ostrava Telecommun Dept Ostrava 70833 Czech Republic

Neural networks have become foundational in modern technology, driving advancements across diverse domains such as medicine, law enforcement, and information technology. By enabling algorithms to learn from data and perform tasks autonomously, they eliminate the need for explicit programming. A significant challenge in this field is replicating the uniquely human capacity for creativity-envisioning and realizing novel concepts and tangible creations. Generative Adversarial Networks (GANs), a leading approach in this effort, are especially notable for synthesizing realistic human facial images. Despite the success of GANs, comprehensive comparative studies of face-generating GAN methodologies are limited. This paper addresses this gap by analyzing the scope and capabilities of facial generation, detailing the principles of the original GAN framework, and reviewing prominent GAN variants specifically designed for facial synthesis. Through performance evaluations and fidelity analysis of generated images, this study contributes to a deeper understanding of GAN potential in advancing artificial intelligence creativity through performance evaluations and fidelity analysis of generated images.

关键词： Generative adversarial networks Generators Training Faces Noise Synthetic data Data models Loss measurement Decoding Computational modeling Human image synthesis image processing computer graphics visualization photorealism

来源：评论

学校读者我要写书评

暂无评论

Exploring the effects of synthetic data generation: a case study on autonomous driving for semantic segmentation

引用

VISUAL computer 2025年 1-19页

作者： Silva, Manuel Seoane, Antonio Mures, Omar A. Lopez, Antonio M. Iglesias-Guitian, Jose A. CITIC Ctr ICT Res La Coruna Spain Univ A Coruna La Coruna Spain Univ Autonoma Barcelona Comp Vis Ctr Bellaterra Catalonia Spain Univ Autonoma Barcelona Comp Sci Dept Bellaterra Spain Univ Autonoma Barcelona Comp Sci Dept Escola Engn Edif Q Bellaterra 08193 Spain

Rendering 3D virtual scenarios has become a popular alternative for generating per-pixel-labeled image datasets, especially in fields like autonomous driving. The approach is valuable for training neural perception models, such as semantic segmentation models, particularly when data might be scarce, expensive, or difficult to collect. However, fundamental questions persist within the research community regarding the generation and processing of these synthetic images, particularly a better understanding of the key factors influencing the performance of deep learning models trained with such synthetic images. In response, we conducted a series of experiments to elucidate the impact that common aspects involved in the generation of rendered synthetic images may have on the performance of neural semantic segmentation tasks. Our study used a recent autonomous driving synthetic dataset as our main testbed, allowing us to investigate the effect of different approaches when modeling their geometric, material, and lighting details. We also studied the impact of rendering noise, typically produced by path-tracing algorithms, as well as the impact of using different color transformations and tonemapping algorithms.

关键词： computer graphics Rendering Autonomous driving Semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

Training a shadow removal network using only 3D primitive occluders

引用

VISUAL computer 2025年第4期41卷 2339-2376页

作者： Gallego, Neil Patrick Del Ilao, Joel Cordel, Macario, II Ruiz Jr, Conrado De La Salle Univ 2401 Taft Ave Malate Manila 1004 Metro Manila Philippines De Le Salle Univ Graph Animat Multimedia & Entertainment GAME Lab Manila Philippines De La Salle Univ Ctr Automat Res Manila Philippines De La Salle Univ Dr Andrew L Tan Data Sci Inst Manila Philippines La Salle Univ Ramon Llull Human Environm Res HER Grp Barcelona 08022 Spain

Removing shadows in images is often a necessary pre-processing task for improving the performance of computer vision applications. Deep learning shadow removal approaches require a large-scale dataset that is challenging to gather. To address the issue of limited shadow data, we present a new and cost-effective method of synthetically generating shadows using 3D virtual primitives as occluders. We simulate the shadow generation process in a virtual environment where foreground objects are composed of mapped textures from the Places-365 dataset. We argue that complex shadow regions can be approximated by mixing primitives, analogous to how 3D models in computer graphics can be represented as triangle meshes. We use the proposed synthetic shadow removal dataset, DLSUSynthPlaces-100K, to train a feature-attention-based shadow removal network without explicit domain adaptation or style transfer strategy. The results of this study show that the trained network achieves competitive results with state-of-the-art shadow removal networks that were trained purely on typical SR datasets such as ISTD or SRD. Using a synthetic shadow dataset of only triangular prisms and spheres as occluders produces the best results. Therefore, the synthetic shadow removal dataset can be a viable alternative for future deep-learning shadow removal methods. The source code and dataset can be accessed at this link: https://***/SynthShadowRemoval/.

关键词： Shadow removal Deep neural network Synthetic images computer graphics Image processing

来源：评论

学校读者我要写书评

暂无评论

Generative Motion Infilling from Imprecisely Timed Keyframes

引用

computer graphics FORUM 2025年

作者： Goel, P. Zhang, H. Liu, C. K. Fatahalian, K. Stanford Univ Stanford CA 94305 USA NVIDIA Santa Clara CA USA

Keyframes are a standard representation for kinematic motion specification. Recent learned motion-inbetweening methods use keyframes as a way to control generative motion models, and are trained to generate life-like motion that matches the exact poses and timings of input keyframes. However, the quality of generated motion may degrade if the timing of these constraints is not perfectly consistent with the desired motion. Unfortunately, correctly specifying keyframe timings is a tedious and challenging task in practice. Our goal is to create a system that synthesizes high-quality motion from keyframes, even if keyframes are imprecisely timed. We present a method that allows constraints to be retimed as part of the generation process. Specifically, we introduce a novel model architecture that explicitly outputs a time-warping function to correct mistimed keyframes and spatial residuals that add pose details. We demonstrate how our method can automatically turn approximately timed keyframe constraints into diverse, realistic motions with plausible timing and detailed submovements.

关键词： computer graphics

来源：评论

学校读者我要写书评

暂无评论

Scheduling an Agile Multipayload Earth-Observing Satellite

引用

JOURNAL OF SPACECRAFT AND ROCKETS 2024年第1期61卷 143-156页

作者： Zhang, Wenyuan Zheng, Gangtie Tsinghua Univ Sch Aerosp Engn Beijing 100084 Peoples R China

A highly integrated Earth-observing satellite can possess several maneuverable payloads to perform different missions simultaneously, which brings some challenges to the method of task scheduling. This paper addresses the selection and scheduling problem of an agile satellite with several independently maneuverable optical payloads. Some differences compared to the traditional scheduling problem of agile satellites are presented and considered in a constrained optimization model. A two-stage method is proposed to accomplish the scheduling of the satellite and payloads in different stages. Clusters are generated from preprocessed tasks by a clique partition algorithm, and their centers are used to calculate the pointing direction of the satellite in the first stage. A multiobjective local search algorithm is introduced to schedule tasks in each selected cluster in the second stage. Considering the time-dependent property of the transition time, the problem of determining the start observation time is transformed into linear programming in a proposed insertion operator that guarantees the feasibility of generated solutions. Two types of instances are created and tested to demonstrate the effectiveness of the proposed method, and some analyses are conducted based on the experimental results.

关键词： Search Algorithm Optimization Algorithm Multi Objective Evolutionary Algorithms Computing and Informatics computer graphics Earth Observation Satellite

来源：评论

学校读者我要写书评

暂无评论

IEEE VR 2025 Message from the Program Chairs and Guest Editors

引用

IEEE TRANSACTIONS ON VISUALIZATION AND computer graphics 2025年第5期31卷 xi-xii页

作者： Iwai, Daisuke Nedel, Luciana Peck, Tabitha Popescu, Voicu Univ Osaka Suita Osaka Japan Univ Fed Rio Grande do Sul Porto Alegre RS Brazil Davidson Coll Davidson NC USA Purdue Univ W Lafayette IN USA

In this special issue of IEEE Transactions on Visualization and computer graphics (TVCG), we are pleased to present the top papers from the 32nd IEEE Conference on Virtual Reality and 3D User Interfaces (IEEE VR 2025)... 详细信息

关键词： Program Chair Virtually Review Process Conference Papers Committee Members Areas Of Expertise Webinars Final Exam computer graphics External Review Primary Review Approach Goals Mixed Reality Magnitude Of Contribution Secondary Review Cybersickness Program Committee Tight Deadlines Valuable Advice Conference Program

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：