this paper presents 3description, an experimental human-AI collaborative approach for intuitive 3d modeling. 3description aims to address accessibility and usability challenges in traditional 3d modeling by enabling n...
详细信息
ISBN:
(纸本)9798400708725
this paper presents 3description, an experimental human-AI collaborative approach for intuitive 3d modeling. 3description aims to address accessibility and usability challenges in traditional 3d modeling by enabling non-professional individuals to co-create 3d models using verbal and gesture descriptions. through a combination of qualitative research, product analysis, and user testing, 3description integrates AI technologies such as Natural Language Processing and Computer Vision, powered by OpenAI and MediaPipe. Recognizing the web has wide cross-platform capabilities, 3description is web-based, allowing users to describe the desired model and subsequently adjust its components using verbal and gestural inputs. In the era of AI and emerging media, 3description not only contributes to a more inclusive and user-friendly design process, empowering more people to participate in the construction of the future 3d world, but also strives to increase human engagement in co-creation with AI, thereby avoiding undue surrender to technology and preserving human creativity (Figure 1).
A sound technique forms the fundamental basis for many sports, particularly Martial Arts, as it often distinguishes between successful hits and being hit. However, the process of improving one's technique is highl...
详细信息
Molecular representation learning (MRL) has gained tremendous attention due to its critical role in learning from limited superviseddata for applications like drug design. In most MRL methods, molecules are treated a...
详细信息
We present VR PreM+, an innovative VR system designed to enhance web exploration beyond traditional computer screens. Unlike static 2ddisplays, VR PreM+ leverages 3d environments to create an immersive pre-learning e...
详细信息
the webXR device API allows the creation of web browser-based eXtended Reality (XR) applications, i.e. Virtual Reality (VR) and Augmented Reality (AR) applications, by providing access to input and output capabilities...
详细信息
ISBN:
(纸本)9781450390958
the webXR device API allows the creation of web browser-based eXtended Reality (XR) applications, i.e. Virtual Reality (VR) and Augmented Reality (AR) applications, by providing access to input and output capabilities from AR and VR devices. webXR applications can be experiencedthrough the webXR supported browsers of standalone and PC-connected VR headsets, AR headsets, mobile devices with or without headsets, and personal computers or desktops. webXR has been growing in popularity since its introduction due to the several benefits it promises such as allowing creators to utilise webGL's rich development ecosystem;create cross-platform and future proof XR applications;and create applications that can be experienced in VR or AR with minimal code changes. However, research has not been conducted to understandthe experiences of webXR creators. To address this gap, we conducted a qualitative study that involved interviews with11webXR creators withdiverse backgrounds aimed at understanding their experiences, and in this paper, we present 8 key challenges reported by these creators.
the pluralistic face completion system is developed as a web application that generates multiple face images for a face which is covered under a face mask. the web application consists of five modules where it deals w...
详细信息
this paper presents 3description, an experimental human-AI collaborative approach for intuitive 3d modeling. 3description aims to address accessibility and usability challenges in traditional 3d modeling by enabling n...
详细信息
ISBN:
(纸本)9798400708725
this paper presents 3description, an experimental human-AI collaborative approach for intuitive 3d modeling. 3description aims to address accessibility and usability challenges in traditional 3d modeling by enabling non-professional individuals to co-create 3d models using verbal and gesture descriptions. through a combination of qualitative research, product analysis, and user testing, 3description integrates AI technologies such as Natural Language Processing and Computer Vision, powered by OpenAI and MediaPipe. Recognizing the web has wide cross-platform capabilities, 3description is web-based, allowing users to describe the desired model and subsequently adjust its components using verbal and gestural inputs. In the era of AI and emerging media, 3description not only contributes to a more inclusive and user-friendly design process, empowering more people to participate in the construction of the future 3d world, but also strives to increase human engagement in co-creation with AI, thereby avoiding undue surrender to technology and preserving human creativity (Figure 1).
Majority of today's web traffic consists of highresolution videos for streaming, conferencing, and surveillance. Transmitting raw video over the webdemands high bandwidth, creating a pressing need for effective c...
详细信息
ISBN:
(数字)9798331508913
ISBN:
(纸本)9798331508920
Majority of today's web traffic consists of highresolution videos for streaming, conferencing, and surveillance. Transmitting raw video over the webdemands high bandwidth, creating a pressing need for effective compression. In this paper, we focus on compressing videos captured by moving agentssuch as drones or vehicles-used in IoT-based surveillance systems. Our method exploits 3d scene geometry and camera poses to reduce redundancy while preserving quality. Unlike approaches that rely on Structure-from-Motion, we employ an efficient Simultaneous Localization & Mapping (SLAM) technique for camera localization that can run on typical IoT devices with limited computational power. We analyze the application of SLAM for video compression and tackle the practical challenges inherent to SLAM systems. Simulation results demonstrate significant compression gains and a notable reduction in computational complexity compared to stare-of-the-art SfM methods. this makes the proposed approach highly suitable for video streaming in modern web communications and IoT applications.
作者:
Narasimhayya, B.E.Vinay Kumar, S.B.
Department of Computer Science and Engineering Bangalore India
Department of Electronics and Communication Engineering Faculty of Engineering and Technology Bangalore India
In today's agribusiness, using robotics, machine learning, and also the web of things has become standard practise. the agricultural business is being forced to embrace such new methods in order to address issues ...
详细信息
Accurate IQ detection through non-invasive methods, such as brain MRI analysis, can play a critical role in the early diagnosis and prevention of neurological disorders. this study proposes a hybrid approach that leve...
详细信息
ISBN:
(数字)9798331508913
ISBN:
(纸本)9798331508920
Accurate IQ detection through non-invasive methods, such as brain MRI analysis, can play a critical role in the early diagnosis and prevention of neurological disorders. this study proposes a hybrid approach that leverages the feature extraction power of Convolutional Neural Networks (CNN) andthe robust classification capabilities of the XGBoost algorithm to enhance the accuracy of IQ prediction models. CNNs effectively process MRI images by extracting complex visual features, while XGBoost optimizes classification performance through ensemble learning techniques. the integration of these methods results in a more precise and reliable IQ detection system. Furthermore, the model's architecture anddata processing techniques offer potential for adaptation in web-baseddiagnostic platforms and large-scale, distributed medical research applications. this study presents an innovative approach by combining convolutional neural networks (VGG16 and ResNet-50) withthe XGBoost algorithm to classify intelligence levels from brain MRI images. Unlike previous studies that used CNNs solely for feature extraction, this research employs XGBoost for final classification, achieving an 83 % accuracy in the VGG16+XGBoost model, showing a notable improvement over the 73 % reported in the most similar prior study. the use of 3d NIfTI data and sagittal slices for enhanceddata analysis is also a key innovation of this research.
暂无评论