In the realm of cloud computing, safeguarding data confidentiality is paramount. This paper introduces a novel approach to data security in cloud platforms, merging fuzzy logic and fractal encryption techniques. Fuzzy...
详细信息
With the increasing number of Chinese students in Thailand, the demand for insurance services from these students has gradually shown diversity and personalized characteristics. However, the current research on the in...
详细信息
visual Question Answering (vQA) lies at the crossroads of computer vision, natural language processing, and deep learning, captivating researchers across various AI domains. This dynamic field involves processing an i...
详细信息
This work in the vaca Muerta Formation examines three assets along 420 km (260 miles) from southeast to northwest in the oil window in completely different geological settings. These settings include a southern block ...
详细信息
This work in the vaca Muerta Formation examines three assets along 420 km (260 miles) from southeast to northwest in the oil window in completely different geological settings. These settings include a southern block in development phase, an exploration area in the fold and thrust belt, and a third asset where mudstones facies are intruded by magmatic sills. This manuscript contributes with a complete evaluation process from exploration to pilot stages and when conditions are appropriate to a development phase of unconventional reservoir from a multidisciplinary team standpoint. This work incorporates available data, evaluation parameter cut-offs based on well performance in the vaca Muerta Formation, and considerations for unconventional play analysis. Copyright 2023, Unconventional Resources Technology conference (URTeC).
This research paper presents an approach to data-driven visual analytics of human mobility data using Kernel Density Estimation visualized through heatmaps, highlighting the need for exploration of forecasting methods...
详细信息
Future crewed missions beyond low earth orbit will greatly rely on the support of robotic assistance platforms to perform inspection and manipulation of critical assets. This includes crew habitats, landing sites or a...
详细信息
ISBN:
(纸本)9781665490320
Future crewed missions beyond low earth orbit will greatly rely on the support of robotic assistance platforms to perform inspection and manipulation of critical assets. This includes crew habitats, landing sites or assets for life support and operation. Maintenance and manipulation of a crewed site in extraterrestrial environments is a complex task and the system will have to face different challenges during operation. While most may be solved autonomously, in certain occasions human intervention will be required. The telerobotic demonstration mission, Surface Avatar, led by the German Aerospace Center (DLR), with partner European Space Agency (ESA), investigates different approaches offering astronauts on board the International Space Station (ISS) control of ground robots in representative scenarios, e.g. a Martian landing and exploration Site. In this work we present a feasibility study on how to integrate auditory information into the mentioned application. We will discuss methods for obtaining audio information and localizing audio sources in the environment, as well as fusing auditory and visual information to perform state estimation based on the gathered data. We demonstrate our work in different experiments to show the effectiveness of utilizing audio information, the results of spectral analysis of our mission assets, and how this information could help future astronauts to argue about the current mission situation.
Emotion recognition represents a critical facet of human-centric artificial intelligence systems. This paper delves into the forefront of emotion detection by leveraging cutting-edge deep learning models across three ...
详细信息
With the significant development in deep learning within the domains of computer vision and natural language processing, the research involving the multimodal aspects of visual Question Answering (vQA) has also reache...
详细信息
ISBN:
(纸本)9798400708473
With the significant development in deep learning within the domains of computer vision and natural language processing, the research involving the multimodal aspects of visual Question Answering (vQA) has also reached a pivotal turning point in recent years. Throughout prior investigations, scholars have consistently emphasized feature extraction from images and text. Numerous models have been applied in this context, ranging from the initial breakthroughs of Recurrent Neural Network (RNN) and Convolutional Neural Network (CNN), to the momentary prominence of Dynamic Memory Networks, and subsequently, the rise of transformers in recent times. Nonetheless, it is imperative to recognize that beyond the ambit of feature extraction models, the fusion of bi-modal features assumes pivotal significance. This paper builds upon SOAT model from the previous work, serving as the baseline, and meticulously scrutinizes its performance across distinct fusion methodologies. various notable fusion strategies, such as MUTAN and BLOCK, are considered. Notably, the most adept model achieves an impressive 65.74% accuracy on the vQA v2 dataset, outperforming established benchmarks. This outcome robustly substantiates the premise that fusion techniques exert tangible influence over the ultimate research outcomes.
Gait analysis is important for guide dog organizations, as ideal guide dogs have a smooth and efficient gait, where they can also easily shift between and maintain various gaits. Gait quality and natural traveling spe...
详细信息
ISBN:
(纸本)9798400716560
Gait analysis is important for guide dog organizations, as ideal guide dogs have a smooth and efficient gait, where they can also easily shift between and maintain various gaits. Gait quality and natural traveling speed are two of the multiple factors important in matching a guide dog to its visually impaired handler. Gait evaluation typically includes subjective visual observation of the dog or objective assessments obtained from special-designed equipment. Guide dog organizations need a method to easily collect and analyze objective data of gait information. In this work, we explored how various machine learning models could learn and analyze gait patterns from inertial measurements data that were collected during two different data collection experiments using a wearable sensor device. We also evaluated how well each machine learning model could generalize behavior patterns from various dogs under different environments. Additionally, we compared how sensor placement locations could affect gait prediction performance by attaching the sensor device to the dog's neck and back area respectively. The tested machine learning models were able to classify different gaits in the range of 42% to 91% in terms of accuracy, and predict various gait parameters with an error rate ranging from 14% to 29% depending on the setup. Furthermore, we also observed that using behavior data collected from the neck region contains more movement information than the back area. By performing a cross-dataset generalization test on the machine learning models, we found that even with performance drop, the models were able to learn gait-specific behavior patterns that are generalizable for different dogs. Although the results were preliminary, the proposed gait analysisexploration still showed promising potential for studying behavior patterns of candidate guide dogs.
Safely navigating street intersections is a complex challenge for blind and low-vision individuals, as it requires a nuanced understanding of the surrounding context - a task heavily reliant on visual cues. Traditiona...
详细信息
ISBN:
(纸本)9798350361087;9798350361070
Safely navigating street intersections is a complex challenge for blind and low-vision individuals, as it requires a nuanced understanding of the surrounding context - a task heavily reliant on visual cues. Traditional methods for assisting in this decision-making process often fall short, lacking the ability to provide a comprehensive scene analysis and safety level. This paper introduces an innovative approach that leverages vision-language models (vLMs) to interpret complex street crossing scenes, offering a potential advancement over conventional traffic signal recognition techniques. By generating a safety score and scene description in natural language, our method supports safe decision-making for blind and low-vision individuals. We collected crosswalk intersection data that contains multiview egocentric images captured by a quadruped robot and annotated the images with corresponding safety scores based on our predefined safety score categorization. Grounded on the visual knowledge, extracted from images and text prompts, we evaluate a vLM for safety score prediction and scene description. Our findings highlight the reasoning and safety score prediction capabilities of the vLM, activated by various prompts, as a pathway to developing a trustworthy system, crucial for applications requiring reliable decision-making support.
暂无评论