检索结果-内蒙古大学图书馆

Deep Learning for Visual Speech Analysis: A Survey

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024年第9期46卷 6001-6022页

作者： Sheng, Changchong Kuang, Gangyao Bai, Liang Hou, Chenping Guo, Yulan Xu, Xin Pietikainen, Matti Liu, Li Natl Univ Def Technol NUDT Coll Elect Sci & Technol Changsha 410073 Hunan Peoples R China Naval Univ Engn Natl Key Lab Electromagnet Energy Wuhan 430030 Hubei Peoples R China Natl Univ Def Technol NUDT Coll Syst Engn Changsha 410073 Hunan Peoples R China Natl Univ Def Technol NUDT Coll Liberal Arts & Sci Changsha 410073 Hunan Peoples R China Natl Univ Def Technol NUDT Coll Intelligence Sci & Technol Changsha 410073 Hunan Peoples R China Oulu Univ Ctr Machine Vis & Signal Anal Oulu 90570 Finland

Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have been proposed to address various problems in this area, especially automatic visual speech recognition and generation. To push forward future research on visual speech, this paper will present a comprehensive review of recent progress in deep learning methods on visual speech analysis. We cover different aspects of visual speech, including fundamental problems, challenges, benchmark datasets, a taxonomy of existing methods, and state-of-the-art performance. Besides, we also identify gaps in current research and discuss inspiring future research directions.

关键词： Visualization Deep learning Surveys Task analysis Feature extraction Speech analysis Lips visual speech lip reading speech perception computer vision computer graphics

来源：评论

学校读者我要写书评

暂无评论

Structural Color in Amber-Entombed Wasp: A Detailed Study Using NS-FDTD Simulations

引用

IEEE ACCESS 2024年 12卷 57163-57171页

作者： Hou, Zhuo Cai, Dongsheng Dong, Ran Univ Tsukuba Grad Sch Syst Informat & Engn Tsukuba Ibaraki 3058577 Japan Univ Tsukuba Inst Syst & Informat Engn Tsukuba Ibaraki 3058577 Japan Chukyo Univ Sch Engn Nagoya Aichi 4700393 Japan

The multilayer reflectors of insect epidermis can produce unique structural color through interactions with light. Many fossilized insects, like amber-entombed wasps, present structural colors. However, how this multilayer structure and structural colors are preserved during the fossilization process is still being determined. We use a transfer matrix method (TMM) and a Non-Standard Finite Difference Time Domain (NS-FDTD) simulations to analyze the effects of both expected compressions and expansions of the epidermis layer thickness during fossilization on its structural colors. We estimate the variations of epidermis layer thickness due to the fossilization by measuring their color distances. Surprisingly, we find that the structural coloration of the multilayer reflectors, ranging from blue to green, emitted by many insects remained unchanged from about +5% expansion to -12% compression of their thickness. These findings suggest that, first, insects might have kept their original colors during the fossilization process. Second, the appearance of these structural colors in insects might not just be by chance, but could also be a result of specific evolutionary choices.

关键词： FDTD computer graphics simulation paleontology structural color insects fossil optics photonics

来源：评论

学校读者我要写书评

暂无评论

SAwareSSGI: Surrounding-Aware Screen-Space Global Illumination Using Generative Adversarial Networks

引用

IEEE ACCESS 2024年 12卷 139946-139961页

作者： Noor, Jannatun Mahmud, Abrar Rahman, Moh. Absar Sifar, Alimus Mostafa, Fateen Yusuf Tasnova, Lamia Chellappan, Sriram BRAC Univ Sch Data & Sci Comp Sustainabil & Social Good C2SG Res Grp Dhaka 1212 Bangladesh BRAC Univ Dept Comp Sci & Engn Dhaka 1212 Bangladesh Univ S Florida Dept Comp Sci Tampa 33620 FL USA

Global Illumination (GI) is a technique that is employed in computer graphics to enhance realism. Various methods have been used to achieve this using computer-generated imagery. The most precise method involves conventional ray tracing, which yields highly realistic results but is computationally intensive and unsuitable for real-time applications. Alternatively, faster algorithms utilize post-processing on rasterization, making them more suitable for real-time scenarios. However, these algorithms are also resource-intensive and may produce inaccurate lighting due to limited information on screen-space features. our proposal involves utilizing a Generative Adversarial Network (GAN) approach to achieve real-time GI effects, following the methodology of conventional screen-space GI techniques. We take surrounding graphical information into account by going beyond screen-space and producing consistent GI effects that are comparatively closer to their physically correct ray-tracing counterpart. Moreover, our model provides a better quality of generated output than the other recent model which utilized a similar approach by scoring 0.90811 in SSIM, 0.00093 in MSE, and 30.30576 dB in PSNR on our developed dataset.

关键词： Lighting Ray tracing Real-time systems Three-dimensional displays graphics Rendering (computer graphics) Generative adversarial networks computer graphics Neural networks global illumination GAN neural networks

来源：评论

学校读者我要写书评

暂无评论

Initial Pole Axis and Spin Direction Estimation of Asteroids Using Infrared Imagery

引用

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS 2024年第6期47卷 1055-1071页

作者： Kuppa, Koundinya Mcmahon, Jay W. Dietrich, Ann B. Univ Colorado Colorado Ctr Astrodynam Res Aerosp Engn Sci 3775 Discovery Dr Boulder CO 80303 USA Charles Stark Draper Lab Inc GN&C 555 Technol Sq Cambridge MA 02139 USA

Knowing the pole axis of an asteroid is vital to autonomous asteroid exploration efforts. Ground-based initial pole estimation methods are time and data intensive and produce estimates with large uncertainties. These errors have a significant impact on proximity navigation, shape modeling, and scientific data for small body missions. In this paper, a new method of obtaining this information from onboard spacecraft imagery is presented. The proposed method estimates the pole from onboard infrared imagery using the camera-asteroid geometry. This method does not require a prior and is designed to work in a vast majority approach trajectories due to the use of infrared images. The method is applied to simulated infrared images of asteroids 101955 Bennu and 25143 Itokawa as well as real infrared images of asteroid 162173 Ryugu from the Hayabusa2 mission. The average pole errors using this method on Bennu and Itokawa images are approximately 2 and 6 deg, respectively. The pole estimate error on the Ryugu images is approximately 8 deg. The algorithm is shown to be sensitive to the percentage of spin period imaged and the spacing between the images.

关键词： Kuiper Belt Thermal Modeling and Analysis computer graphics Image Processing Planetary Science and Exploration Space Missions Navigation Algorithm spin pole estimation

来源：评论

学校读者我要写书评

暂无评论

Temporal Coherence-Based Distributed Ray Tracing of Massive Scenes

引用

IEEE TRANSACTIONS ON VISUALIZATION AND computer graphics 2024年第2期30卷 1489-1501页

作者： Xu, Xiang Wang, Lu Perard-Gayot, Arsene Membarth, Richard Li, Cuiyu Yang, Chenglei Slusallek, Philipp Shandong Univ Finance & Econ Shandong Key Lab Blockchain Finance Jinan 250101 Shandong Peoples R China Shandong Univ Sch Software Jinan 250101 Shandong Peoples R China Weta Digital Wellington 6243 New Zealand TH Ingolstadt THI Res Inst AImot Bavaria D-85049 Ingolstadt Germany Saarland Informat Campus German Res Ctr Artificial Intelligence DFKI D-66123 Saarbrucken Saarland Germany Adv Comp East China Subctr Suzhou 215300 Jiangsu Peoples R China

Distributed ray tracing algorithms are widely used when rendering massive scenes, where data utilization and load balancing are the keys to improving performance. One essential observation is that rays are temporally coherent, which indicates that temporal information can be used to improve computational efficiency. In this paper, we use temporal coherence to optimize the performance of distributed ray tracing. First, we propose a temporal coherence-based scheduling algorithm to guide the task/data assignment and scheduling. Then, we propose a virtual portal structure to predict the radiance of rays based on the previous frame, and send the rays with low radiance to a precomputed simplified model for further tracing, which can dramatically reduce the traversal complexity and the overhead of network data transmission. The approach was validated on scenes of sizes up to 355 GB. Our algorithm can achieve a speedup of up to 81% compared to previous algorithms, with a very small mean squared error.

关键词： Rendering (computer graphics) Ray tracing Portals Heuristic algorithms Dynamic scheduling Task analysis Distributed databases computer graphics distributed graphics ray tracing

来源：评论

学校读者我要写书评

暂无评论

Robust edge-preserving image smoothing based on complementary weighting scheme

引用

SIGNAL IMAGE AND VIDEO PROCESSING 2024年第8-9期18卷 5663-5675页

作者： Yang, Yang Xia, Minghui Wang, Xinyu Zeng, Lanling Zhan, Yongzhao Jiangsu Univ Dept Comp Sci Xuefu Rd 301 Zhenjiang 212013 Jiangsu Peoples R China

Edge-aware image smoothing refers to the removal of details with edges preserved. It is an essential topic in the field of image processing and computer graphics. In this paper, in order to achieve better edge preservation than the existing models, we propose a robust edge-preserving image filtering method based on a complementary weighting scheme. Both isotropic and anisotropic weights are involved in our model to adapt the fidelity and the regularization terms. To efficiently solve the proposed model, we introduce an effective algorithm based on additive half quadratic minimization, alternating direction of multipliers, and Fourier domain optimization strategies. We experimentally validate the proposed filter on several low-level vision tasks. Both quantitative and qualitative experimental results show significant superiority of our proposed filter compared to existing techniques. Furthermore, the filter exhibits high efficiency and is able to process 720P color images (over 10 fps) in real-time on an NVIDIA RTX 3070. Therefore, it is practical for real-world applications.

关键词： Edge-preserving Complementary weighting Image processing computer graphics

来源：评论

学校读者我要写书评

暂无评论

An automatic detection method for cervical liquid-based cells based on improved Yolov5s network

引用

IET IMAGE PROCESSING 2024年第14期18卷 4695-4703页

作者： Shen, Xudong Wu, Zhihua Tao, Yebo Wu, Xianglian Chen, Linfei China Jiliang Univ Hangzhou 310018 Zhejiang Peoples R China Jiaxing Vocat & Tech Coll Jiaxing Zhejiang Peoples R China Xiamen Univ Technol Xiamen Fujian Peoples R China Jiaxing Jingzhu Biotechnol Co Ltd Jiaxing Zhejiang Peoples R China

To address the significant challenges of high false positive and false negative rates in existing algorithms for detecting cervical fluid-based cells, an enhanced Yolov5s network is introduced. This paper details a novel approach that dynamically adjusts the weights of channels and the spatial attention in modules, substantially improving feature extraction from small objects and boosting the detection capabilities of the network. Furthermore, Mixup data augmentation technology is incorporated to counter the issue of imbalanced data categories in the custom dataset. The Complete Intersection over Union loss function is also employed to refine coordinate localization accuracy during training. Tested on the proprietary cervical cytology dataset, the modified Yolov5s achieves a mean Average Precision of 92.1%, surpassing the previous state-of-the-art by 5.6%. This enhancement substantiates the efficacy of the proposed model. Code and models are accessible at .

关键词： computer graphics computer vision

来源：评论

学校读者我要写书评

暂无评论

Evaluating Image-Based Interactive 3D Modeling Tools

引用

IEEE ACCESS 2024年 12卷 104138-104152页

作者： Siddique, Arslan Cignoni, Paolo Corsini, Massimiliano Banterle, Francesco CNR ISTI Visual Comp Lab I-56124 Pisa Italy Univ Pisa Dept Comp Sci I-56127 Pisa Italy

Structure from Motion (SfM) is a computer vision technique used to reconstruct three-dimensional (3D) structures from a series of two-dimensional (2D) images or video frames. However, SfM tools struggle with transparent objects, reflective surfaces, and low-resolution frames. In such situations, image-based interactive 3D modeling software packages are employed to model 3D objects and measure dimensions. Our contributions to this work are twofold. First, we have introduced new tools to improve 3D modeling software packages;such tools are aimed at easing the workload for users. Second, we have conducted a comprehensive user study to evaluate the efficacy of popular 3d modeling software packages. The task is to measure certain dimensions for which ground truth measurements are already known. A relative error is calculated for every measurement. The evaluation of each software tool is done through survey form, event logs, and measurement relative error. The results of this user study clearly show that our approach to 3D modeling using multiple images has a lower relative error and produces higher quality 3D models than other software packages. In addition, it shows our new tools reduce the required time for completing a task.

关键词： Three-dimensional displays Solid modeling Surface reconstruction Image reconstruction Computational modeling Cameras Task analysis User-assisted 3D reconstruction interactive 3D modeling computer graphics image based-3D reconstruction structure-from-motion

来源：评论

学校读者我要写书评

暂无评论

A framework for phenotyping rubber trees under intense wind stress using laser scanning and digital twin technology

引用

AGRICULTURAL AND FOREST METEOROLOGY 2025年 361卷

作者： Yun, Ting Eichhorn, Markus P. Jin, Shichao Yuan, Xinyue Fang, Wenjie Lu, Xin Wang, Xiangjun Zhang, Huaiqing Nanjing Forestry Univ Coinnovat Ctr Sustainable Forestry Southern China Nanjing 210037 Peoples R China Nanjing Forestry Univ Coll Informat Sci & Technol Nanjing 210037 Peoples R China Univ Coll Cork Sch Biol Earth & Environm Sci Distillery Fields North Mall Cork T23 N73K Ireland Nanjing Agr Univ Acad Adv Interdisciplinary Studies Plant Phen Res Ctr Nanjing 210095 Peoples R China Chinese Acad Trop Agr Sci Rubber Res Inst Minist Agr Danzhou Invest & Expt Stn Trop Crops Danzhou Peoples R China Chinese Acad Forestry Res Inst Resource Informat Tech Beijing 100091 Peoples R China

Rubber trees in coastal habitats are exposed to a high degree of wind stress. An algorithm-hardware synergetic methodology was developed for investigating and predicting rubber tree phenotyping excited by strong winds. The framework includes (1) a custom-designed industrial fan that recreates a variable airflow field at wind speeds of 15, 30 and 45 m/s coupled with a terrestrial laser scanner and bundled motion sensors to acquire point clouds and vibration data;(2) a graphic model that approximates tree canopies based on foliage clumps with phenotypic traits that are derived from point clouds captured while trees are subjected to aerodynamic drag;and (3) the wind characteristic parameters of forest canopies were calculated by a developed forest-specialized k-epsilon turbulence model combining the constructed tree models and grid-scale subdivision of the wind fluid field. (4) A digital twin model that incorporates detailed tree phenotypic traits and considers plant mechanical characteristics was established, depicting the related wind-induced actions of target trees under various wind influences. The results show that tree crowns with spreading forms are prone to yield larger pendulum amplitudes than compact crowns, but trees directly exposed to wind exhibit greater crown volume reductions than trees in sheltered areas. Within tree canopies, a one-fold increase in inlet wind speed intensified crown compression (approximately 17 % decrease in crown volume), generated 2.1-fold pressure gradients and increased turbulence kinetic energy by approximately 60 %. Moreover, the entire scenario of the adaptation of experimental trees to wind perturbations was visually restored using digital twin techniques, serving as an integral behaviour dataset for further data-driven decision-making. In summary, this paper presents a comprehensive methodology that can decipher the phenotypic manifestations of trees' reactions to wind hazards, with potential applications in phenotyping or e

关键词： Rubber tree phenotyping Wind stress Aerodynamic model computer graphics Terrestrial laser scanning Digital twin

来源：评论

学校读者我要写书评

暂无评论

Foreword to the special section on computer graphics in Brazil: A selection of papers from SIBGRAPI 2012

引用

computerS & graphics-UK 2014年第Feb.期38卷 A1-A2页

作者： Dal Sasso Freitas, Carla Scopigno, Roberto Inst Informat UFRGS BR-91501970 Porto Alegre RS Brazil CNR ISTI Pisa Italy

This special section contains three papers, which are extended contributions of original works presented at the 25th SIBGRAPI -Conference on graphics, Patterns and Images. Started in 1988, SIBGRAPI has been the main scientific event in computer graphics, Image Processing and related areas in Brazil. In 2012, celebrating the 25th anniversary of SIBGRAPI, it was held in the historical city of Ouro Preto, Minas Gerais. Each year, best papers are selected and authors are invited to submit extended versions to high quality journals. After a rigorous peer-reviewing process, the three contributions published in this section were selected among four invited submissions to be published in computers & graphics.

关键词： computer graphics historical city Brazil paper graphic images images

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：