This paper addresses the challenge of improving user interaction with autonomous vehicles by integrating voice command recognition with real-time visual feedback. The study employs Google's speech recognition Appl...
详细信息
ISBN:
(数字)9798331532956
ISBN:
(纸本)9798331532963
This paper addresses the challenge of improving user interaction with autonomous vehicles by integrating voice command recognition with real-time visual feedback. The study employs Google's speech recognition Application Programming Interface to transcribe spoken commands and the turtle graphics library to simulate the corresponding vehicle actions. The system is designed to accurately recognize and process voice commands, translating them into visual simulations that mirror the intended vehicle movements. The research method involved testing the system with various commands to assess its performance and effectiveness. The findings suggest that this integrated approach has significant potential for enhancing the usability and accessibility of autonomous vehicle systems.
Image fusion continues to be essential across various domains, such as computer vision, remote sensing, medical imaging, and military applications, from a technical standpoint. Given the rapid advancements in technolo...
Image fusion continues to be essential across various domains, such as computer vision, remote sensing, medical imaging, and military applications, from a technical standpoint. Given the rapid advancements in technology, The significance of image fusion in research is steadily increasing on a daily basis from a technical perspective. In various fields, image fusion plays a crucial role by integrating information from multiple images or imaging modalities into a unified composite image. The technique of wavelet-based image fusion is employed to merge multiple images or sources of images at varying scales and resolutions. Wavelet-based image fusion is a technique utilized to amalgamate multiple images or image sources at different scales and resolutions. It leverages the mathematical tool called wavelet transform to achieve this. The application of wavelet-based image fusion is prevalent in diverse fields like remote sensing, medical imaging, computer vision, and surveillance. This technique is vital in these domains as it enables the integration of information from multiple sources, facilitating decision-making, analysis, and visualization processes. Wavelet-based image fusion, while a powerful technique, has its limitations. The method can introduce artefacts like ringing and blurring effects in the fused output images. The proposed method removes the blurring effects using Wavelet fusion of the Weighted High-Boost Filters (HBF) and Weighted Clipped Limited Adaptive Histogram Equalization (CLAHE). The Results of experimentation on the Medical Images Dataset and IR and Visible Images Dataset using performance metrics alias entropy, NIQE, and BRISQUE have proven the proposed method to be better.
The proceedings contain 18 papers. The special focus in this conference is on computer Vision, imaging and computergraphics Theory and Applications. The topics include: Flash, storm, and mistral: Hardware-friendly an...
ISBN:
(纸本)9783030267551
The proceedings contain 18 papers. The special focus in this conference is on computer Vision, imaging and computergraphics Theory and Applications. The topics include: Flash, storm, and mistral: Hardware-friendly and high quality tone mapping;a simple and exact algorithm to solve linear problems with 1-based regularizers;spatial and spectral calibration of a multispectral-augmented endoscopic prototype;random forests based image colorization;a survey on databases of facial macro-expression and micro-expression;real-time head pose estimation by tracking and detection of keypoints and facial landmarks;contact-less, optical heart rate determination in the field ambient assisted living;effective facial expression recognition through multimodal imaging for traumatic brain injured patient’s rehabilitation;3D articulated model retrieval using depth image input;haptic and touchless user input methods for simple 3D interaction tasks: Interaction performance and user experience for people with and without impairments;recommendations from a study of a multimodal positive computing system for public speaking;utilisation of linguistic and paralinguistic features for academic presentation summarisation;An ROI visual-analytical approach for exploring uncertainty in reservoir models;a descriptive attribute-based framework for annotations in data visualization;tabularVis: An interactive relationship visualization tool supported by optimization and search algorithms;visual computing methods for assessing the well-being of older people.
Mathematical billiards assume a table of a certain shape and dynamical rules for handling collisions. Some trajectories exhibit distinguished patterns. Detecting such trajectories manually for a given billiard is cumb...
详细信息
ISBN:
(纸本)9789897584022
Mathematical billiards assume a table of a certain shape and dynamical rules for handling collisions. Some trajectories exhibit distinguished patterns. Detecting such trajectories manually for a given billiard is cumbersome, especially, when assuming an ensemble of billiards with different parameter settings. We propose a visual analysis approach for simulation ensembles of billiard dynamics based on phase-space visualizations and multi-dimensional scaling. We apply our methods to the well-studied approach of dynamical billiards for validation and to the novel approach of symplectic billiards for new observations.
Dental periapical lesions help doctors understand your oral health needs and must be found accurately to deliver proper treatment. The research introduces a new way to find dental periapical lesions by combining Retin...
详细信息
ISBN:
(数字)9798331544607
ISBN:
(纸本)9798331544614
Dental periapical lesions help doctors understand your oral health needs and must be found accurately to deliver proper treatment. The research introduces a new way to find dental periapical lesions by combining Retinex image processing with YOLOv8 object detection. The Retinex method lets the model detect periapical lesions more accurately because it enhances image details while removing picture noise. Test outcomes show enhanced lesion detection that performs well with precise accuracy in tough image scenarios. The research demonstrates how advanced image processing helps medical imaging deep learning systems work better. The Objective of this work is to improve visibility of periapical lesions by Retinex based enhancement and fast detection through YOLOv8.
This study concerns the use of complex imaging methods like integrating ResNet, Generative Adversarial Networks (GAN), and U- Net with a single set of variables. Improving medical image analysis for lung-related disea...
详细信息
Ocular diseases remain a substantial public health concern, necessitating the development of precise and efficient diagnostic methodologies. Notably, despite advances in medical imaging and artificial intelligence, a ...
详细信息
ISBN:
(数字)9798331519094
ISBN:
(纸本)9798331519100
Ocular diseases remain a substantial public health concern, necessitating the development of precise and efficient diagnostic methodologies. Notably, despite advances in medical imaging and artificial intelligence, a critical research gap persists in achieving accurate and dependable detection of major eye conditions. In response to this pressing need, our research presents an innovative approach for the identification of five major eye diseases, employing a Vision Transformer (ViT) model in tandem with the GradCam visualization technique. Our method not only bridges this existing research gap but also demonstrates remarkable performance, surpassing the capabilities of existing systems. With an exceptional accuracy of 90.3%, a precision rate of 90.0%, a recall rate of 90.0%, and an F1-score of 90.0%, our proposed method sets a new benchmark for the accurate and reliable diagnosis of ocular conditions. To ensure the robustness and consistency of our approach, we conducted rigorous 5-fold cross-validation, achieving an average accuracy of 89.64%. This research represents a significant advancement in the field of eye disease detection, promising to yield improved clinical outcomes and patient care.
Relocation of haptic feedback from the fingertips to the wrist has been considered as a way to enable haptic interaction with mixed reality virtual environments while leaving the fingers free for other tasks. We prese...
详细信息
ISBN:
(数字)9781665479271
ISBN:
(纸本)9781665479271
Relocation of haptic feedback from the fingertips to the wrist has been considered as a way to enable haptic interaction with mixed reality virtual environments while leaving the fingers free for other tasks. We present a pair of wrist-worn tactile haptic devices and a virtual environment to study how various mappings between fingers and tactors affect task performance. The haptic feedback rendered to the wrist reflects the interaction forces occurring between a virtual object and virtual avatars controlled by the index finger and thumb. We performed a user study comparing four different finger-totactor haptic feedback mappings and one no-feedback condition as a control. We evaluated users' ability to perform a simple pick-and-place task via the metrics of task completion time, path length of the fingers and virtual cube, and magnitudes of normal and shear forces at the fingertips. We found that multiple mappings were effective, and there was a greater impact when visual cues were limited. We discuss the limitations of our approach and describe next steps toward multi-degreeof-freedom haptic rendering for wrist-worn devices to improve task performance in virtual environments.
Landscape design, an integral aspect of enhancing natural beauty and greenery, is governed by several considerations when it comes to its spatial distribution. Given its significance, there's a growing need to res...
详细信息
ISBN:
(数字)9798350369212
ISBN:
(纸本)9798350380002
Landscape design, an integral aspect of enhancing natural beauty and greenery, is governed by several considerations when it comes to its spatial distribution. Given its significance, there's a growing need to research methods that effectively simulate and assess the rationality of such designs. These designs draw from the principles of landscape architecture, aligning with broader planning objectives and leveraging a range of scientific, technological, and artistic tools to shape and preserve outdoor spaces. Meanwhile, virtual reality stands out as a transformative concept and technology, enabling humans to perceive and remodel their surroundings. It essentially bridges the gap between humans and the objects they seek to understand or alter for specific purposes. Using this technology as a foundation, our research delves into the 3D simulation of landscape designs. We examine the flow of landscape features, construct a simulation model using these features, and present the evaluation outcomes through 3D visualization. Comparative tests reveal that this 3D simulation-based evaluation offers both accuracy and swiftness, aligning well with user requirements.
暂无评论