Collaborative Inference is a prospective paradigm for accelerating Deep Neural Network (DNN) inference by harnessing the computational resources of multiple devices. However, in highly lossy network environments, such...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Collaborative Inference is a prospective paradigm for accelerating Deep Neural Network (DNN) inference by harnessing the computational resources of multiple devices. However, in highly lossy network environments, such as those encountered in wireless communication systems, the transmission loss of intermediate feature maps between devices can result in significant degradation of co-inference accuracy. In this paper, we first conduct a comprehensive investigation into the impact of intermediate feature map loss in real-world wireless scenarios and provide an in-depth analysis of loss patterns under UDP transmission. Motivated by these observations, we introduce Robust Co-inference Framework (RCIF), a novel framework that employs a hierarchical mask strategy to selectively drop activations at two different scales of feature maps. This approach enhances the robustness of DNN co-inference in the presence of network losses. Our evaluation on a variety of datasets and network architectures demonstrates that RCIF significantly enhances the accuracy and robustness of distributed DNN co-inference under highly lossy network conditions. Specifically, our results show that RCIF can achieve up to a 659% increase in accuracy compared to the original model under particularly poor network conditions.
Internet of Things (IoT), with its capability to connect anything, anywhere, anytime, bringing in revolutionary changes in data collection, processing and utilization. IoT networks generate a huge amount of data, whic...
详细信息
In the dynamic landscape of modern communication, the demand for innovative virtual video conferencing solutions is ever-increasing. Our work presents an innovative approach to building a virtual video conferencing sy...
详细信息
ISBN:
(纸本)9798350391893;9798350391886
In the dynamic landscape of modern communication, the demand for innovative virtual video conferencing solutions is ever-increasing. Our work presents an innovative approach to building a virtual video conferencing system that can be used by remote users with the help of a web page. Our system allows remote participants, joining via a web browser, to freely navigate and view the virtual environment from any angle, enhancing spatial awareness and engagement. Additionally, our system grants participants the freedom to view the environment independently, even if the host restricts certain views which is one of the main drawbacks of the current video conferencing systems. Unlike current platforms, our solution also allows users to choose their appearance location within the virtual space and this feature is missing in the current systems. Furthermore, our system is highly customizable, enabling the integration of features such as recording specific portions of the screen, which are not available in existing video conferencing tools. This flexibility ensures a more immersive, interactive, and personalized meeting experience, significantly advancing the capabilities of remote collaboration technologies. Our work highlights the results of research carried out to create a virtual conference setting in the Unity environment and establish successful real-time communication between the webpage and the Unity environment. In this virtual setting, monitors act as participants, and participants can choose on which monitor they want to appear. Participants join this virtual meeting setup from a webpage, which consists of two windows: the first window shows the participants themselves, and the second window displays the virtual meeting setup. Participants can observe this environment from any perspective they want, navigating using a keyboard and a mouse. Since we are implementing everything from scratch, we have full control over every feature and functionality with Agora video SDK
Multi-chiplet architecture can provide a high-performance solution for new tasks such as deep learning models. In order to fully utilize chiplets and accelerate the execution of deep learning models, we present a deep...
详细信息
The rapid development of Internet of Things (IoT) technology has driven the intelligent transformation of various industries, including manufacturing, agriculture, and healthcare, etc., significantly enhancing their m...
详细信息
In recent years, the demand for high-performance computing in various disciplines of ICTs has affected the dependency on single core processes due to its decline in efficiency. Whereas, this requirement of advance in ...
详细信息
作者:
Hong, PengHe, ShupingFang, XiaohanAnhui University
Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education School of Electrical Engineering and Automation Hefei230601 China
With the development of distributed energy systems, pricing different energy sources in microgrids has become a significant challenge. To solve this problem, this paper proposes an real-time pricing method through mar...
详细信息
Continuous performance monitoring is critical for maintaining optimal performance of High-Performance computing resources. This is especially important for technological test bed systems, in which software updates occ...
详细信息
We present NADA, a Network Attached Deep learning Accelerator. It provides a flexible hardware/software framework for training deep neural networks on ethernet-based FPGA clusters. The NADA hardware framework instanti...
详细信息
ISBN:
(纸本)9783031661457;9783031661464
We present NADA, a Network Attached Deep learning Accelerator. It provides a flexible hardware/software framework for training deep neural networks on ethernet-based FPGA clusters. The NADA hardware framework instantiates a dedicated entity for each layer in a model. Features and gradients flow through these entities in a tightly pipelined manner. From a compact description of a model and target cluster, the NADA software framework generates specific configuration bitstreams for each particular FPGA in the cluster. We demonstrate the scalability and flexibility of our approach by mapping an example CNN onto a cluster consisting of three up to nine Intel Arria 10 FPGAs. To verify NADAs effectiveness for commonly used networks, we train MobileNetV2 on a six-node cluster. We address the inherent incompatibility of the tightly pipelined layer parallel approach with batch normalization by using online normalization instead.
Active distribution networks (ADNs) serve as a linchpin in contemporary power systems, tasked with accommodating the rising penetration of renewable energy sources while ensuring efficient and reliable operation. This...
详细信息
暂无评论