The proceedings contain 31 papers. The special focus in this conference is on Intelligent and 3D Technologies. The topics include: 3D Animation Design and Production Based on Intelligent Algorithm and virtual Reality;...
ISBN:
(纸本)9789819970100
The proceedings contain 31 papers. The special focus in this conference is on Intelligent and 3D Technologies. The topics include: 3D Animation Design and Production Based on Intelligent Algorithm and virtual Reality;4DFA: Four-Dimensional Full-Anatomy Reconstruction of Individualized Digital Human Models Based on Motion videos;research on Fault Prediction Technology of Air Compressor Based on Wavelet Neural Network;The Development of Intelligent vR Systems Based on Deep Learning;research and Implementation of Multi Fusion data Model Construction Technology for Distribution Network Digital Twins;research and Implementation of Rules Extraction Technology for Digital Twin Objects in Distribution Network Based on Semantic Understanding;analysis of Urban Landscape Plants’ Configuration Based on virtual Reality;3D Animation Simulation Based on Computer virtual Simulation Technology;lightweight Human Pose Estimation Based on Self-Attention Mechanism;new Approach for Soil Moisture Prediction Based on Multiple Influencing Factors;facial Photo-Guided Head Anatomy Modeling Based on Deep Learning and 2D/3D Shape Prior Model Registration;Design of Water Ecological Cleaning Robot Based on Raspberry PI and OpenCvvisual Recognition;fragrant Pear Target Identification and Positioning Based on Deep Learning;Online Mini-Game Popularity and Feedback data Prediction Based on Time Series and BP Neural Network Prediction;mileage Pile Detection for vehicle-Borne video;exploration of Future Temperature analysis Based on ARIMA Time Series Model and GA-BP Neural Network Prediction Model;research on Feature Extraction Based on Time Series Images;research on Human Eyesight Tracking Algorithm Based on Monocular vision;Automatic Calibration Method for High Resolution LiDAR and Fisheye Camera.
This paper presents SaSLaW, a spontaneous dialogue speech corpus containing synchronous recordings of what speakers speak, listen to, and watch. Humans consider the diverse environmental factors and then control the f...
详细信息
This paper presents SaSLaW, a spontaneous dialogue speech corpus containing synchronous recordings of what speakers speak, listen to, and watch. Humans consider the diverse environmental factors and then control the features of their utterances in face-to-face voice communications. Spoken dialogue systems capable of this adaptation to these audio environments enable natural and seamless communications. SaSLaW was developed to model human-speech adjustment for audio environments via first-person audio-visual perceptions in spontaneous dialogues. We propose the construction methodology of SaSLaW and display the analysis result of the corpus. We additionally conducted an experiment to develop text-to-speech models using SaSLaW and evaluate their performance of adaptations to audio environments. The results indicate that models incorporating hearing-audio data output more plausible speech tailored to diverse audio environments than the vanilla text-to-speech model.
The proceedings contain 848 papers. The topics discussed include: how well can a long sequence model model long sequences? comparing architectural inductive biases on long-context abilities;sequential fusion of text-c...
ISBN:
(纸本)9798891761971
The proceedings contain 848 papers. The topics discussed include: how well can a long sequence model model long sequences? comparing architectural inductive biases on long-context abilities;sequential fusion of text-close and text-far representations for multimodal sentiment analysis;PoemBERT: a dynamic masking content and ratio based semantic language model for chinese poem generation;STAND-Guard: a small task-adaptive content moderation model;query-LIFE: query-aware language image fusion embedding for e-commerce relevance;improving tool retrieval by leveraging large language models for query generation;RED-CT: a systems design methodology for using LLM-labeled data to train and deploy edge linguistic classifiers;beyond visual understanding introducing PARROT-360v for vision language model benchmarking;and AI-Press: a multi-agent news generating and feedback simulation system powered by large language models.
In modern surveillance, activities have increasingly become dependent on the continuous observation offered by CCTv systems. Still, with massive amounts of video data generated in a minute, sifting through this inform...
详细信息
The proceedings contain 848 papers. The topics discussed include: how well can a long sequence model model long sequences? comparing architectural inductive biases on long-context abilities;sequential fusion of text-c...
ISBN:
(纸本)9798891761988
The proceedings contain 848 papers. The topics discussed include: how well can a long sequence model model long sequences? comparing architectural inductive biases on long-context abilities;sequential fusion of text-close and text-far representations for multimodal sentiment analysis;PoemBERT: a dynamic masking content and ratio based semantic language model for chinese poem generation;STAND-Guard: a small task-adaptive content moderation model;query-LIFE: query-aware language image fusion embedding for e-commerce relevance;improving tool retrieval by leveraging large language models for query generation;RED-CT: a systems design methodology for using LLM-labeled data to train and deploy edge linguistic classifiers;beyond visual understanding introducing PARROT-360v for vision language model benchmarking;and AI-Press: a multi-agent news generating and feedback simulation system powered by large language models.
The potential of the ecosystems is a pillar and a guiding framework of environmental research to pay attention to the health and vitality of the ecosystems in the context of key concepts and paradigms in aiming to ach...
详细信息
violence detection has garnered significant attention as there's a growing demand for automated methods to identify violent actions. This surge in interest stems from the utilization of surveillance cameras in div...
详细信息
Edge-assisted visual Simultaneous Localization and Mapping (v-SLAM) systems offload complex analysis modules from computationally constrained mobile devices to cloud servers at the network edge. As visualdata must tr...
详细信息
The proceedings contain 33 papers. The special focus in this conference is on Machine Learning, Advances in Computing, Renewable Energy and Communication. The topics include: Enhancing Power Quality in Grid-Tied Solar...
ISBN:
(纸本)9789819752300
The proceedings contain 33 papers. The special focus in this conference is on Machine Learning, Advances in Computing, Renewable Energy and Communication. The topics include: Enhancing Power Quality in Grid-Tied Solar Photovoltaic Systems;RF-TSvM: Random Forest-Based Transductive Support vector Machine for Classification and Prediction of Cancer Patterns;resource-Efficient Image Retrieval: A Study of Local Patterns versus Deep Learning Models;machine Translation of Chinese–Hindi Simple Sentences Using Moses;Innovative Approaches to Reduce Carbon Footprint and Air Pollution: The Role of AI, ML, Cloud Computing, and IoT;investigating Sensor Technology and Benefits of Intelligent Transport Systems;student Attendance System by Quick Responsive Code;analysis of Brain Tumor Detection Using Machine Learning;a Review on Multiple Face Detection Techniques and Challenges;blockchain-Enabled Secure Identity verification in Agri-Food Supply Chain;Machine Learning Approach for Diagnosis of Schizophrenia Using EEG Signals;Comparative analysis of Web APIS: RESTful and GraphQL;AI-Based vision Screening Tool for Keratoconus;synergizing Artistry and Technology by Unveiling the Integration of Matte Painting Techniques in Crafting Precise and Immersive visual Effects Backgrounds;Heart Disease Prediction: A Comprehensive exploration of Optimal Predictive AI;dynamic Animation Scaling: Design and Development of Adaptive Character Animations for varying Sizes;framework for Assessment of Greywater-Assisted Composting Using IoT-Based Sensors;mood-Based Movie Recommendation System Using Sentiment analysis;blockchain-Enabled Consensus Mechanisms for data Integrity and Security in Edge and Cloud Computing Environments;AI-Based Question Paper analysis and Generator with Authentication;pomegranate Leaf Fruit Disease Prediction Using Machine Learning;price Prediction Using Machine Learning Approaches.
One of the primary reasons for decreased crop yields is the presence of plant diseases, which also leads to substantial financial losses for farmers and the entire agricultural industry. It is possible to lessen agric...
详细信息
暂无评论