Programming models like OpenMP offer expressive interfaces to program graphics processing units (GPUs) via directive-based offload. By default, these models copy data to or from the device without overlapping computat...
详细信息
ISBN:
(纸本)9798400705977
Programming models like OpenMP offer expressive interfaces to program graphics processing units (GPUs) via directive-based offload. By default, these models copy data to or from the device without overlapping computation, thus impacting performance. Rather than leave the onerous task of manually pipelining and tuning data communication and computation to the end user, we propose an OpenMP extension that supports block-level pipelining and, in turn, present our block-level pipelining (BLP) approach that overlaps data communication and computation in a single kernel. BLP uses persistent thread blocks with cooperative thread groups to process sub-tasks on different streaming multiprocessors and uses GPU flag arrays to enforce task dependencies without CPU involvement. To demonstrate the efficacy of BLP, we evaluate its performance using multiple benchmarks on NVIDIA V100 GPUs. Our experimental results show that BLP achieves 95% to 114% of the performance of hand-tuned kernel-level pipelining. In addition, using BLP with buffer mapping can reduce memory usage to support GPU memory oversubscription. We also show that BLP can reduce memory usage by 75% to 86% for data sets that exceed GPU memory while providing significantly better performance than CUDA Unified Memory (UM) with prefetching optimizations.
The Metaverse, a synergistic blend of physical and virtual realities, is rapidly evolving as a hub for digital twins and digital avatars, offering transformative potential in domains ranging from smart manufacturing t...
详细信息
ISBN:
(纸本)9798350381771;9798350381764
The Metaverse, a synergistic blend of physical and virtual realities, is rapidly evolving as a hub for digital twins and digital avatars, offering transformative potential in domains ranging from smart manufacturing to kinematic examination. This paper introduces an AI-empowered framework for the synthesis of Metaverse avatars through a monocular camera-based system. Our approach integrates a shared control system and advanced multistage filtering to refine sensor data, substantially enhancing pose precision and avatar realism. Experimental results demonstrate our framework's superiority in reducing data jitter and improving network transmission. By harnessing the power of AI and network optimization, our system ensures a cost-effective and accurate solution that enhances user interactivity and presence within the Metaverse. The implications of this work extend beyond immediate interaction benefits, setting a precedent for future immersive and inclusive Metaverse experiences. Future work will focus on expanding the framework's capacity for larger user interactions and incorporating richer features such as nuanced facial and gesture recognition to deliver fully expressive avatars, thereby elevating the Metaverse experience.
The scientific and technical aspects of gun barrel movement for effective attack and defense are of ultimate significance. The precise control of the gun barrel is of strategic importance during targeting, especially ...
详细信息
The scientific and technical aspects of gun barrel movement for effective attack and defense are of ultimate significance. The precise control of the gun barrel is of strategic importance during targeting, especially in changing environmental conditions. Fuzzy logic control offers a powerful alternative to classical and manual solutions to deal with the complexity and uncertainties of dynamic systems, thus increasing targeting accuracy and precision. This approach provides adaptability and intuitive parameterization while simplifying system design and implementation. On the other hand, in fuzzy control management, accurate modeling and visualization of gun barrel movement is essential to achieve efficient aiming and performance. Many papers and realizations can be found on the (automatic) fuzzy control of cannon barrels. In this paper, the authors also suggest an implementation of a fuzzy-controlled cannon barrel. The novelty of this approach is the application of new defuzzification methods, resulting in an accurate solution for the problem. The article starts with a review of the theory and literature on the control of cannon barrels. It is followed by a comparison of different implementations, including simulation tests on the accuracy, and a discussion of some practical issues.
An accurate model for optical domain equalization is proposed and verified with experiments. The modeled spectra fit well with measured curves, and prediction accuracy is better than 0.2 dB if used for filtering penal...
详细信息
This paper presents an operations-research-based improvement framework for a waste collection system in the capital city of India, Delhi. The existing manual system divides the urban region into ‘wards’, each with i...
详细信息
Urban challenges like air pollution, traffic congestion, and road quality demand precise and timely detection for effective city monitoring. Traditional stationary sensors, costly with limited coverage, have prompted ...
详细信息
ISBN:
(纸本)9798350363999;9798350364002
Urban challenges like air pollution, traffic congestion, and road quality demand precise and timely detection for effective city monitoring. Traditional stationary sensors, costly with limited coverage, have prompted the emergence of mobile sensing (e.g., vehicular urban sensing), which utilizes ubiquitous mobile devices (e.g., sensors on vehicles) for sensing tasks. While commercial vehicles (e.g., taxis) are favored for urban sensing due to easy data collection, biased spatiotemporal distributions often result in uneven sensing coverage. Despite efforts to maximize task completion and incentivize participation, fairness in coverage distribution remains neglected. To address this issue, we propose a hybrid approach integrating Dedicated Sensing Vehicles (DSVs) and commercial vehicles to optimize coverage while accounting for fairness constraints. Routing these DSVs for fairness-aware spatiotemporal coverage poses unique challenges, including achieving high coverage with limited DSVs, prioritizing underserved areas, and ensuring cost efficiency for financial sustainability. To overcome these challenges, in this paper, we design FairSense, a system, which combines the long-term reward optimization capabilities of deep reinforcement learning (DRL) with submodular maximization. FairSense integrates a submodular reward function with spatio-temporal fairness constraints into a DRL algorithm to prioritize long-term gain and fairness, enabling informed path planning for DSVs, thereby enhancing efficiency and fairness in sensing coverage. We evaluate our FairSense with real-world large-scale vehicle GPS data from two cities (i.e., Shenzhen and Beijing), and extensive experiments show our FairSense can effectively improve sensing coverage efficiency and also improve sensing fairness.
In the area of Pattern Recognition, specifically in the field of Handwriting Recognition, will be presented modeling a system of training and recognition of handwritten characters and figures by means an Abstract stat...
详细信息
This study explores the potential of system-on-Chip (SoC) IoT technology in revolutionizing smart poultry manufacturing processes. It discusses the applications and advancements of SoC IoT solutions in enhancing autom...
详细信息
As a popular way of representing holographic video or volumetric video, point cloud video can provide users with a highly immersive viewing experience of 6 degrees of freedom (6DoF) and is expected to become the mains...
详细信息
The proceedings contain 80 papers. The topics discussed include: randomized protocols for resilient peer-to-peer networks;evaluating the performance of 5G NR in indoor environments: an experimental study;an approach t...
The proceedings contain 80 papers. The topics discussed include: randomized protocols for resilient peer-to-peer networks;evaluating the performance of 5G NR in indoor environments: an experimental study;an approach towards reducing training time of the input doubling method via clustering for middle-sized data analysis;analysis of legislative framework governing biometric data;designing a testbed for privacy-preserving voice-based social networks;classical machine learning and large models for text-based emotion recognition;one hop routing optimization approach using machine learning;toward an enhanced stock market forecasting with machine learning and deep learning models;smart agriculture system based on IoT reference architecture and service choreography;and privacy-preserving crime reporting and rewarding in the presence of active insider and outsider adversaries.
暂无评论