SenStick-Eye is a novel sensing platform that captures behavioral insights from small everyday objects, such as cutlery, toothbrushes, and pens, by integrating a G-sensor (accelerometer) and a monocular RGB camera (48...
详细信息
ISBN:
(数字)9798331521165
ISBN:
(纸本)9798331521172
SenStick-Eye is a novel sensing platform that captures behavioral insights from small everyday objects, such as cutlery, toothbrushes, and pens, by integrating a G-sensor (accelerometer) and a monocular RGB camera (480×480 pixels). This compact system enables continuous monitoring of human-object interactions, facilitating object-centric behavioral analysis. By collecting both motion and visual data, SenStick-Eye reveals patterns and triggers in habitual behaviors that often go unnoticed. The platform is designed for applications in just-in-time interventions, habit formation, and behavior change technologies, offering new opportunities for AIoT-based systems to support healthier habits.
When trying to avoid collision in a scene with other persons, one makes an initial guess of a location where the conflict might happen before choosing paths. Then the estimated conflict spot serves as a deciding facto...
详细信息
ISBN:
(数字)9798331514846
ISBN:
(纸本)9798331525637
When trying to avoid collision in a scene with other persons, one makes an initial guess of a location where the conflict might happen before choosing paths. Then the estimated conflict spot serves as a deciding factor in choosing a safe trajectory to avoid the intruder. However, the predicted collision point is subject to one’s estimate of the other person’s future movement. The prediction is linked to guess made with a degree of uncertainty. In this paper, we summarize on how we use AR visualizations to symbolize different levels of uncertainty of predicted collision spots to study path choice influences
We propose Metabook, a system to automatically generate interactive AR storybooks to improve children’s reading interest. Metabook introduces a story-to-3D-book generation scheme and a 3D avatar that combines multipl...
详细信息
ISBN:
(数字)9798331514846
ISBN:
(纸本)9798331525637
We propose Metabook, a system to automatically generate interactive AR storybooks to improve children’s reading interest. Metabook introduces a story-to-3D-book generation scheme and a 3D avatar that combines multiple AI models as a reading companion. Our user study shows that Metabook can significantly increase children’s interest in reading. Teachers acknowledged Metabook’s effectiveness in enhancing reading enthusiasm by connecting verbal and visual thinking, expressing high expectations for its future potential in education.
This position paper introduces the concept of a real-time auditory feedback system designed to promote physical activity and improve well-being through visual skill development in ball sports. The system uses event ca...
详细信息
ISBN:
(数字)9798331514846
ISBN:
(纸本)9798331525637
This position paper introduces the concept of a real-time auditory feedback system designed to promote physical activity and improve well-being through visual skill development in ball sports. The system uses event cameras to translate ball trajectories into intuitive sound cues and complements visual perception with high-temporal-resolution auditory feedback, enabling precise timing and enhanced engagement during practice. Unlike conventional setups requiring wearables or complex hardware, this approach ensures seamless interaction while preserving natural movement. This multimodal approach can potentially offer a scalable and immersive solution for various sports and real-world environments.
Depth perception in AR describes the ability to perceive depth from AR-generated objects. This functionality enables an emergent capability: x-ray vision, or visualizing objects past an occluding surface. To evaluate ...
详细信息
ISBN:
(数字)9798331514846
ISBN:
(纸本)9798331525637
Depth perception in AR describes the ability to perceive depth from AR-generated objects. This functionality enables an emergent capability: x-ray vision, or visualizing objects past an occluding surface. To evaluate x-ray vision’s feasibility, we propose experiments that depict a virtual object beyond a solid wall. In one condition this display is mitigated with a semi-transparent virtual window, while in another condition no such effect is presented. Depth estimates will be measured using triangulation by walking. Our motivation is to understand discrepancies between perceived distances of virtual and real-world objects so that AR systems can accurately display objects in the real-world.
Embodied conversational agents (ECAs) capable of nonverbal behaviors have been developed to address the limitations of voice-only assistants. Research has explored their use in augmented reality (AR), suggesting they ...
详细信息
ISBN:
(数字)9798331514846
ISBN:
(纸本)9798331525637
Embodied conversational agents (ECAs) capable of nonverbal behaviors have been developed to address the limitations of voice-only assistants. Research has explored their use in augmented reality (AR), suggesting they may soon interact with us more naturally in physical spaces. However, the question of how they should enter the user’s space when summoned remains under-explored. In this paper, we focused on the plausibility of ECAs’ entering action into the user’s field of view in AR. We analyzed its impact on users’ perceived social presence and functionality of the agent. Our results indicated that the plausibility of the action significantly affected social presence and had a marginal effect on perceived functionality.
We present ‘Revelio’, a real-world screen-camera communication system leveraging temporal flicker fusion in the OKLAB color space. Using spatially-adaptive flickering and encoding information in pixel region shapes,...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
We present ‘Revelio’, a real-world screen-camera communication system leveraging temporal flicker fusion in the OKLAB color space. Using spatially-adaptive flickering and encoding information in pixel region shapes, Revelio achieves visually imperceptible data embedding while remaining robust against noise, asynchronicity, and distortions in screen-camera channels, ensuring reliable decoding by standard smartphone cameras. The decoder, driven by a two-stage neural network, uses a weighted differential accumulator for precise frame detection and symbol recognition. Initial experiments demonstrate Revelio’s effectiveness in interactive television, offering an unobtrusive method for meta-information transmission.
The abundance of digital data available in today’s environment can improve quality of life and offer helpful insights. In the e-commerce space, customer evaluations and comments about a product may truly be utilized ...
详细信息
ISBN:
(数字)9798331531935
ISBN:
(纸本)9798331531942
The abundance of digital data available in today’s environment can improve quality of life and offer helpful insights. In the e-commerce space, customer evaluations and comments about a product may truly be utilized to gain valuable information for the seller, which can then be applied to the product’s enhancement. This study seeks to collect comments on the product’s surrounding features as well as the polarity of the sentiments expressed in product reviews. By doing this, the vendor will be able to comprehend the consumer market, consider ideas, and adjust their product properly. The model implementation consists of three parts, aspect term extraction, opinion term extraction, and sentiment score calculation. An unsupervised rule-based model has been implemented to extract aspect and opinion terms, whereas a BERT model to obtain the sentiment score.
Visual prosthesis that applies electrical stimulation to restore functional vision has a promising prospect. However, visual percepts generated by current visual prosthesis are low resolution with unruly color and res...
详细信息
ISBN:
(数字)9798331529482
ISBN:
(纸本)9798331529499
Visual prosthesis that applies electrical stimulation to restore functional vision has a promising prospect. However, visual percepts generated by current visual prosthesis are low resolution with unruly color and restricted grayscale. This severely restricts the ability of prosthetic implant to complete visual tasks in daily scenes. At present, some studies use existing image processing techniques to improve the perceptions of objects in prosthetic vision. However, most of these studies detect the static objects and optimize the visual percepts in general dynamic scenes. This greatly limits the application of visual prosthesis in high dynamic scenes. In this study, a novel moving object detection model is proposed to automatically extract the moving objects in high dynamic scene. In this model, the optical flow, color and luminance are applied to form different dimensional feature space, and construct the moving object map according to the visual saliency in different feature channel. The proposed method can uniformly highlight the moving object and keep good boundaries in high dynamic scene. The simulation results indicate that the proposed moving object model can effectively detect the moving object.
A major challenge in Fine-Grained Visual Classification (FGVC) is distinguishing various categories with high inter-class similarity by learning the feature that differentiates the details. Conventional cross-entropy ...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
A major challenge in Fine-Grained Visual Classification (FGVC) is distinguishing various categories with high inter-class similarity by learning the feature that differentiates the details. Conventional cross-entropy trained Convolutional Neural Network (CNN) fails this challenge as they may suffer from producing inter-class invariant features in FGVC. In this work, we innovatively propose to regularize the training of CNN by enforcing the uniqueness of the features of each category from an information-theoretic perspective. To achieve this goal, we formulate a minimax loss based on a game-theoretic framework, where a Nash equilibrium is proved to be consistent with this regularization objective. Besides, to avoid getting a solution that produces redundant features, we present a Feature Redundancy Loss (FRL) based on the normalized inner product between each selected feature map pair to complement the proposed minimax loss. The proposed method is versatile, as it can be utilized as a regularizer for features in the mid-level or the penultimate layer, and can be combined with any architectures. Extensive experimental results on several influential benchmarks along with visualization show that our method obtains significant improvement over the baseline model without extra cost and achieves state-of-the-art results.
暂无评论