We propose an advanced new system that automatically generates dances according to the user's favorite music and dance style and then presents 3DCG animations of the generated dances on a naked-eye 3D display so t...
详细信息
ISBN:
(纸本)9798400704635
We propose an advanced new system that automatically generates dances according to the user's favorite music and dance style and then presents 3DCG animations of the generated dances on a naked-eye 3D display so that the user can have the experience of actually dancing with the displayed dancer. For automatic dance generation, we developed a new generation model based on the transformer-based diffusionmodel that generates a dance conditioned on any music audio and dance style of the user's choice. We implemented this generation model in a GUI system that automatically generates dance animations according to the users' specifications. The CG animations are then presented on a naked-eye 3D display system we recently proposed.
Generating dance sequences that synchronize with music while maintaining naturalness and realism is a challenging task. Existing methods often suffer from "freezing" phenomena or abrupt transitions. In this ...
详细信息
Automatic layout generation models can generate numerous design layouts in a few seconds, which significantly reduces the amount of repetitive work for designers. However, most of these models consider the layout gene...
详细信息
ISBN:
(纸本)9798400701085
Automatic layout generation models can generate numerous design layouts in a few seconds, which significantly reduces the amount of repetitive work for designers. However, most of these models consider the layout generation task as arranging layout elements with different attributes on a blank canvas, thus struggle to handle the case when an image is used as the layout background. Additionally, existing layout generation models often fail to incorporate explicit aesthetic principles such as alignment and non-overlap, and neglect implicit aesthetic principles which are hard to model. To address these issues, this paper proposes a two-stage content-aware layout generation framework for poster layout generation. Our framework consists of an aesthetics-conditioned layout generation module and a layout ranking module. The diffusionmodel based layout generation module utilizes an aesthetics-guided layout denoising process to sample layout proposals that meet explicit aesthetic constraints. The Auto-Encoder based layout ranking module then measures the distance between those proposals and real designs to determine the layout that best meets implicit aesthetic principles. Quantitative and qualitative experiments demonstrate that our method outperforms state-of-the-art content-aware layout generation models.
Accurate segmentation of orbital bones in facial computed tomography (CT) images is essential for craniomaxillofacial surgery planning and the creation of bone implants. However, it has challenging issues that thin bo...
详细信息
ISBN:
(纸本)9781510685925;9781510685932
Accurate segmentation of orbital bones in facial computed tomography (CT) images is essential for craniomaxillofacial surgery planning and the creation of bone implants. However, it has challenging issues that thin bones of orbital medial wall and floor are difficult to segment due to their ambiguous boundaries and low contrast with surrounding soft tissues. Furthermore, this issue leads to inter-observer variability in manual annotation masks. In this paper, we propose a novel segmentation framework based on a conditional diffusion model with consensus-driven correction. The framework consists of three main components: conditional diffusion model-based segmentation, consensus-driven accumulation map generation, and context-aware consensus correction. The conditional diffusion model leverages diverse annotation masks to generate multiple plausible segmentation results, addressing the inter-observer variability associated with manual annotations. These results are aggregated into a consensus-driven accumulation map, which captures the agreement among possible segmentations, offering a robust alternative to simple averaging. Finally, the segmentation is refined through context-aware consensus correction, which integrates consensus information with CT image features, considering spatial and intensity-based characteristics. Experimental results show the effectiveness of the proposed method, achieving Dice Similarity Coefficients (DSCs) of 84.38% and 90.37% and precisions of 88% and 92.28% for the medial wall and floor, respectively. Compared to CNN-based methods, the proposed framework improves precision by up to 4.74% and 4.49%, significantly reducing false positives while preserving the continuity of thin structures.
Anomaly detection and identification methods for wind turbine based on deep learning have become a current research hotspot due to their superior performance in feature extraction. However, the existed methods have li...
详细信息
Anomaly detection and identification methods for wind turbine based on deep learning have become a current research hotspot due to their superior performance in feature extraction. However, the existed methods have limitations in fusion mechanisms of different physical features, and the logical relationships between parameters are difficult to interpret. To address these issues, an innovative Physical Information Dynamic Fusion (PIDF) mechanism and a Dynamic Fusion conditionaldiffusion (DFCD) model are proposed, along with a new operational state evaluation indicator derived from Shapley (SHAP) value analysis, for anomaly detection and fault identification in wind turbines. First, this proposed DFCD model based on PIDF mechanism enables the deep fusion of multiple physical parameters and overcomes the limitations of traditional models, which often struggle to effectively handle diverse information sources. Next, a quantitative approach based on SHAP values is proposed to analyze the relationship between condition parameters and the target parameter for evaluating the wind turbine's operating status. Finally, a new evaluation indicator for operational state of wind turbines is proposed based on the logical relationship. This indicator provides an intuitive and easily comprehensible way to assess the system behavior learned by the model. Through the analysis of datasets from two real wind farms, this method is capable of effectively identifying the anomaly state and fault locations, which enhances the operational efficiency of wind turbine. This work provides a new scientific tool for technology transfer, which will contribute to intelligent condition monitoring and information management in advanced engineering.
Geological CO2 storage is expected to play a pivotal role in achieving climate-neutrality targets by 2050. Accurate prediction of long-term CO2 storage performance relies on inverse modeling procedures that precisely ...
详细信息
Geological CO2 storage is expected to play a pivotal role in achieving climate-neutrality targets by 2050. Accurate prediction of long-term CO2 storage performance relies on inverse modeling procedures that precisely characterize spatially varying geological properties using practically available observed data. Traditional inversion methods necessitate extensive forward simulations to iteratively calibrate uncertain geological parameters, which can impose a significant computational burden. In this work, an end-to-end generative inversion framework based on the conditional diffusion model is proposed for efficiently characterizing heterogeneous geological properties and accelerating the inversion process. By employing an improved U-net to learn the conditional denoising diffusion process, the proposed framework enables the direct generation of high-dimensional property fields that closely match the observed data. Additionally, the probabilistic nature inherent in the diffusion approach allows for producing an ensemble of plausible geological realizations, facilitating effective quantification of parametric and predictive uncertainties. The performance of the proposed framework is validated by estimating stochastic permeability fields for both two-dimensional and three-dimensional carbon storage models. Comprehensive comparisons with the conditional generative adversarial network-based method demonstrate that the proposed framework yields more accurate inversion results and better quantifies the uncertainty in the predicted flow responses. This work offers a promising tool for subsurface inverse modeling and uncertainty quantification, potentially paving the way for broader adoption and exploration of generative diffusionmodels in the realm of energy system management.
Infrared imaging systems are widely used across industries. However, their output images often exhibit striped noise due to the nonuniform response of the detection system, which significantly affects image quality an...
详细信息
Infrared imaging systems are widely used across industries. However, their output images often exhibit striped noise due to the nonuniform response of the detection system, which significantly affects image quality and visual fidelity. To address challenges such as incomplete stripe removal, potential loss of image details and textures, and the generation of artificial artifacts during destriping, we propose a novel stripe removal method based on a knowledge-embedded diffusionmodel (KEDM). This approach effectively integrates the spatial distribution characteristics of stripe noise with an innovative, data-driven diffusion network model, creating a hybrid knowledge and data-driven framework for stripe correction. The core components of KEDM are the latent diffusionmodel (LDM) architecture and the directional wavelet convolution module (DWCM). Specifically, LDM leverages a pretrained variational autoencoder (VAE) to transform the input image into latent feature space for efficient diffusion propagation, reducing computational complexity while preserving image restoration quality. Meanwhile, DWCM uses wavelet convolution operations to construct prior loss functions for stripe noise, precisely guiding the diffusion reconstruction process to achieve a clean, stripe-free image. Empirical evaluations on several benchmark datasets demonstrate that the proposed KEDM outperforms other state-of-the-art destriping algorithms in terms of visual quality and quantitative metrics, validating its excellent performance.
暂无评论