Users are becoming accustomed to uploading text and pictures on social networks to express their feelings or ideas. As a result, multimodal sentiment analysis has drawn more attention as a study area in recent years. ...
详细信息
In the design of traffic monitoring solutions for optimizing the urban mobility infrastructure, acoustic vehicle counting models have received attention due to their cost effectiveness and energy efficiency. Although ...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
In the design of traffic monitoring solutions for optimizing the urban mobility infrastructure, acoustic vehicle counting models have received attention due to their cost effectiveness and energy efficiency. Although deep learning has proven effective for visual traffic monitoring, its use has not been thoroughly investigated in the audio domain, likely due to real-world data scarcity. In this work, we propose a novel approach to acoustic vehicle counting by developing: i) a traffic noise simulation framework to synthesize realistic vehicle pass-by events;ii) a strategy to mix synthetic and real data to train a deep-learning model for traffic counting. The proposed system is capable of simultaneously counting cars and commercial vehicles driving on a two-lane road, and identifying their direction of travel under moderate traffic density conditions. With only 24 hours of labeled real-world traffic noise, we are able to improve counting accuracy on real-world data from 63% to 88% for cars and from 86% to 94% for commercial vehicles.
Learning analytics, blending education theory, psychology, statistics, and computer science, utilizes data about learners and their environments to enhance education. Artificial Intelligence advances this field by per...
详细信息
ISBN:
(纸本)9789819754946;9789819754953
Learning analytics, blending education theory, psychology, statistics, and computer science, utilizes data about learners and their environments to enhance education. Artificial Intelligence advances this field by personalizing learning and providing predictive insights. However, the opaque 'black box' nature of AI decision-making poses challenges to trust and understanding within educational settings. This paper presents a novel visual analytics method to predict whether a student is at risk of failing a course. The proposed method is based on a dendritic neuron model (DNM), which not only performs excellently in prediction, but also provides an intuitive visual presentation of the importance of learning behaviors. It is worth emphasizing that the proposed DNM has a better performance than recurrent neural network (RNN), long short term memory network (LSTM), gated recurrent unit (GRU), bidirectional long short term memory network (BiLSTM) and bidirectional gated recurrent unit (BiGRU). The powerful prediction performance can assist instructors in identifying students at risk of failing and performing early interventions. The importance analysis of learning behaviors can guide students in the development of learning plans.
In order to explore the research frontiers and hot topics of blended learning, the research on blended learning in China during the past two decades from 2004 to 2023 was visualized and analyzed with the help of CiteS...
详细信息
The intersection of computer vision and machine learning has emerged as a promising avenue for advancing historical research, facilitating a more profound exploration of our past. However, the application of machine l...
详细信息
ISBN:
(纸本)9783031706417;9783031706424
The intersection of computer vision and machine learning has emerged as a promising avenue for advancing historical research, facilitating a more profound exploration of our past. However, the application of machine learning approaches in historical palaeography is often met with criticism due to their perceived "black box" nature. In response to this challenge, we introduce NeuroPapyri, an innovative deep learning-based model specifically designed for the analysis of images containing ancient Greek papyri. To address concerns related to transparency and interpretability, the model incorporates an attention mechanism. This attention mechanism not only enhances the model's performance but also provides a visual representation of the image regions that significantly contribute to the decision-making process. Specifically calibrated for processing images of papyrus documents with lines of handwritten text, the model utilizes individual attention maps to inform the presence or absence of specific characters in the input image. This paper presents the NeuroPapyri model, including its architecture and training methodology. Results from the evaluation demonstrate NeuroPapyri's efficacy in document retrieval, showcasing its potential to advance the analysis of historical manuscripts.
In this work, we explore the space of emotional reactions induced by real-world images. For this, we first introduce a large-scale dataset that contains both categorical emotional reactions and free-form textual expla...
详细信息
ISBN:
(纸本)9798350301298
In this work, we explore the space of emotional reactions induced by real-world images. For this, we first introduce a large-scale dataset that contains both categorical emotional reactions and free-form textual explanations for 85,007 publicly available images, analyzed by 6,283 annotators who were asked to indicate and explain how and why they felt when observing a particular image, with a total of 526,749 responses. Although emotional reactions are subjective and sensitive to context (personal mood, social status, past experiences) - we show that there is significant common ground to capture emotional responses with a large support in the subject population. In light of this observation, we ask the following questions: i) Can we develop neural networks that provide plausible affective responses to real-world visualdata explained with language? ii) Can we steer such methods towards producing explanations with varying degrees of pragmatic language, justifying different emotional reactions by grounding them in the visual stimulus? Finally, iii) How to evaluate the performance of such methods for this novel task? In this work, we take the first steps in addressing all of these questions, paving the way for more human-centric and emotionally-aware image analysis systems. Our code and data are publicly available at https://affective- ***.
The proceedings contain 82 papers. The topics discussed include: LLM-powered multimodal insight summarization for UX testing;on multimodal emotion recognition for human-chatbot interaction in the wild;integrating mult...
ISBN:
(纸本)9798400704628
The proceedings contain 82 papers. The topics discussed include: LLM-powered multimodal insight summarization for UX testing;on multimodal emotion recognition for human-chatbot interaction in the wild;integrating multimodal affective signals for stress detection from audio-visualdata;feeling textiles through AI: an exploration into multimodal language models and human perception alignment;decoding contact: automatic estimation of contact signatures in parent-infant free play interactions;ScentHaptics: augmenting the haptic experiences of digital mid-air textiles with scent;online multimodal end-of-turn prediction for three-party conversations;detecting deception in natural environments using incremental transfer learning;and exploring interlocutor gaze interactions in conversations based on functional spectrum analysis.
This paper presents VIVA, a novel interactive tool for visually exploring long videos and searching for specific moments. Previous work on video dataexploration and analytics often assumes that manually-created, rich...
详细信息
ISBN:
(纸本)9798400701078
This paper presents VIVA, a novel interactive tool for visually exploring long videos and searching for specific moments. Previous work on video dataexploration and analytics often assumes that manually-created, rich annotations are available. However, such metadata may not be easily obtained. We design an interactively machine learning workflow for users to rapidly create annotations along a timeline. Combined with VIVA's focus+context visualization that effectively displays frame snapshots in the context of a video stream, VIVA enables users to explore and analyze long video clips by incrementally make sense of them. We present usage scenarios that demonstrate how users would use VIVA for video-related tasks.
We introduce FitYou, an interactive dashboard for health datavisualizations from wearable devices. It supports the exploration of health trends and health attribute correlations. Wearable devices are small enough to ...
详细信息
Geothermal Play Fairway analysis (GPFA) is an exploration process adopted to geothermal, that integrates data of critical risk elements inherent to that specific geothermal play type. The key function of GPFA is to re...
详细信息
ISBN:
(纸本)0934412308
Geothermal Play Fairway analysis (GPFA) is an exploration process adopted to geothermal, that integrates data of critical risk elements inherent to that specific geothermal play type. The key function of GPFA is to reduce risk and increase focus for improving exploration success rates. GPFA begins at the regional/basin scale, and progressively focuses in on the play scale. It then examines the critical risk element data to highlight which play areas have the highest likelihood of success (prospects). The outputs from the GPFA process are Common Risk Segment (CRS) & Composite Common Risk Segment (CCRS) Maps. CRS maps define areas that contain the same general Probability of Success (PoS) for each individual risk element based on the input data. Operator analyzed/ determined cutoff values or classes are then applied to each map with color assignments indicating high (red), medium (yellow) and low (green) risk areas for each element under consideration. Each individual CRS map is then composited into a single CCRS map. Publicly available data on hundreds of thousands of boreholes in Texas and the Gulf Coast demonstrate excellent potential for geothermal electricity generation from either current or abandoned oil and gas wells. Near-surface geothermal resources, at depths of 3 km (9,842 ft) or less, are generally less than 150°C (302°F) in Texas. Economically feasible electricity generation is possible with available subsurface temperature conditions within reasonable depths-generally greater than 120°C (248°F) within 4 km (13,123 ft) - given the prolific oil and gas well drilling. Extensive data exists to depths as much as 8 km (26,246 ft), indicating temperatures in excess of 300°C (572°F). When initiating an investigation into a new geothermal exploration area, a "high-level first pass" should be conducted for the heat resource before initiating the full GPFA process or detailed feasibility assessment. This is an important first step prior to conducting a detailed st
暂无评论