Aiming at the visual expression effect of 3D animation, a set of optimization methods using computer graphics technology was proposed. The depth camera and image processor suitable for video transmission were selected...
详细信息
Digital Twins (DTs) in Metaverse face many challenges such as the lack of optimized AI models to allow the interaction between the user and the virtual environment. In this paper, we propose an optimized model for hum...
详细信息
ISBN:
(纸本)9798350326970
Digital Twins (DTs) in Metaverse face many challenges such as the lack of optimized AI models to allow the interaction between the user and the virtual environment. In this paper, we propose an optimized model for human language processing based on Convolutional Neural Networks (CCNs) and we present an input processing strategy to meet the realtime requirements of smart applications that integrate DTs oriented to speech-based functionalities for user interaction and Metaverse. In our solution, CNNs are applied for the processing and classification of the human voice, while structured data and MFCC coefficients are used to train the neural networks and generate interference in the models. Similarly, the MFCC algorithm is provided to extract the unique characteristics that specify each generated audio file and to reduce the complexity of the neural network model in order to obtain better performance. Starting from an approach to the problem available in the literature, we have optimized a specific CNN model for Natural Language processing (NLP) in order to increase effective results. The proposed model has demonstrated excellent performance and can be used as a basis for the implementation of software that allows the interaction of DTs with voice commands issued by a user.
Before export, fruit should be classified to improve quality, meet customer requirements and increase product value. This article proposes a method to identify defects on the surface of tomato skin using image process...
详细信息
Vector quantization (VQ) methods have been used in a wide range of applications for speech, image, and video data. While classic VQ methods often use expectation maximization, in this paper, we investigate the use of ...
详细信息
image matching is an attractive area for researchers. This field has been propelled recently due to the advancement of imaging devices with multi-spectral capabilities, compute power, and the evolution of the deep lea...
详细信息
Foundation AI models have emerged as powerful pre-trained models on a large scale, capable of seamlessly handling diverse tasks across multiple domains with minimal or no fine-tuning. These models, exemplified by the ...
详细信息
Corn plays an important role in many fields, but the level of intelligent detection for moldy corn is low. This article proposes a method for identifying moldy corn kernels based on machine vision. First, the image is...
详细信息
Few-shot image classification aims to learn a model that can adopt to unseen classes with few labeled data. This challenging problem requires to overcome the distribution shift of features due to differences between t...
详细信息
Due to the rapid rise in the identification of digital materials, automatic image classification has emerged as the most difficult topic of computer vision. In comparison to human vision, automatic visual understandin...
详细信息
Blind image inpainting aims at recovering the content from a corrupted image in which the mask indicating the corrupted regions is not available in inference time. Inspired that most existing methods for inpainting su...
详细信息
暂无评论