We present DIO, a generic tool for observing inefficient and erroneous I/O interactions between applications and in-kernel storage systems that lead to performance, dependability, and correctness issues. DIO facilitat...
详细信息
ISBN:
(纸本)9798350325430
We present DIO, a generic tool for observing inefficient and erroneous I/O interactions between applications and in-kernel storage systems that lead to performance, dependability, and correctness issues. DIO facilitates the analysis and enables near real-time visualization of complex I/O patterns for data-intensive applications generating millions of storage requests. This is achieved by non-intrusively intercepting system calls, enriching collected data with relevant context, and providing timely analysis and visualization for traced events. We demonstrate its usefulness by analyzing two production-level applications. Results show that DIO enables diagnosing resource contention in multi-threaded I/O that leads to high tail latency and erroneous file accesses that cause data loss.
An embedded flexible shape cable has been first proposed and the verification of real-time shape measurement has been achieved. It is produced by sequentially preparing shape sensing units, shape sensing optical cable...
详细信息
This paper describes the approach for performance estimation of data queries in time series databases, implemented for the remote monitoring and control of injection plastic moulding systems. Experimental setup, struc...
详细信息
Steganography is the method that convert data into bits to make it suitable for hiding. While embedding data the factors such as stego image quality and distortion which cannot be observed by the human eyes. The aim o...
详细信息
Employing high-performance neural network models is challenging for resource-constrained devices as the models require strong computing power and large memory space. One-shot neural architecture search (NAS) helps to ...
详细信息
ISBN:
(数字)9798350387957
ISBN:
(纸本)9798350387964
Employing high-performance neural network models is challenging for resource-constrained devices as the models require strong computing power and large memory space. One-shot neural architecture search (NAS) helps to find a suitable neural architecture more efficiently by using a single one-shot network that shares weights with sub-networks. However, the existing one-shot NAS methods pay little attention to the sparsity achievable from the one-shot network. Therefore, there still may be room to reduce the resource requirement of the network. This work presents a new one-shot sparse NAS method, which tries to find an optimal sparsity for the network using soft channel masking during the architecture search. The experimental results show that the proposed method can find a more sparse architecture with little accuracy drop.
Due to the low temperature and temperature variety of high altitude environment, it is very difficult to identify dynamic parameters and achieve an accurate state-of-charge (SOC) estimation for Li-polymer batteries us...
详细信息
Modern software systems in every application domain are increasingly built as distributed systems. Business applications are structured as cooperating microservices, IoT devices communicate with cloud-based services o...
详细信息
ISBN:
(纸本)9798350366266;9798350366259
Modern software systems in every application domain are increasingly built as distributed systems. Business applications are structured as cooperating microservices, IoT devices communicate with cloud-based services over a network, and Web sites store data in globally dispersed data centers to support fast access in to localities in which their users reside. Behind all these systems lurk distributed computing infrastructures that architects and engineers must exploit to satisfy application service level agreements. To be successful, it is essential that architects understand the inherent complexity of distributed systems. In this half day tutorial, I'll guide the attendees through the fundamental characteristics that distributed systems exhibit. Each characteristic will be related to the software architecture quality attributes that they directly impact. The topics covered include communications reliability and latencies, message delivery semantics, state management, idempotence, data safety, consistency, time, distributed consensus, cascading failures and failover and recovery. I'll introduce each concept using an example distributed system and multiple 'props' to illustrate concepts. Once I've explained a concept using the example, I'll move on to show how the concept manifests itself in a software system and its effects on quality attributes requirements and inherent trade-offs. The tutorial will be suitable for graduate students, engineers and architects who have no or minimal exposure to distributed systems concepts. The presentation format will be suitable for a mix of both in person and remote participants. It will combine interactive sessions with short technical explanations and examples to illustrate each distributed systems concept.
Dynamic Voltage and Frequency Scaling (DVFS) is an effective means to reduce the energy dissipation of digital designs. While on most commodity FPGAs, memory and processor have separately controlled voltages, the prog...
详细信息
ISBN:
(纸本)9798331530082;9798331530075
Dynamic Voltage and Frequency Scaling (DVFS) is an effective means to reduce the energy dissipation of digital designs. While on most commodity FPGAs, memory and processor have separately controlled voltages, the programmable logic section relies on a single voltage rail and thus imposes the same voltage for all hardware accelerators that operate concurrently. Finding time slots eligible for voltage scaling gets difficult in virtualized systems, where the FPGA is shared by tasks executed in multiple guest operating systems. The situation gets even more complicated, when error-tolerant tasks are considered that allow the voltage to be reduced below its nominal value, which could provoke a certain rate of faulty hardware accelerator runs. As a solution, we propose a strategy that synchronizes concurrently executed periodic hardware tasks under consideration of their reliability as well as their real-time requirements so that the supply voltage is controlled accordingly. The proposed strategy can be combined with further mechanisms for saving energy. Our run-time module performs clock gating and adjusts the voltage to the requirements of aperiodic tasks. For fault-tolerant tasks, we monitor the error rate using Algorithm Based Fault Tolerance (ABFT) that can detect and characterize errors with an accuracy close to 100%. Compared to a strategy that scales voltage without synchronizing hardware tasks, we achieve in the best case a power saving by 29.4% and an average saving by 7%.
In the study, a real-time pill identification with prescription confirmation within medical packaging using deep learning on embedded system is proposed. In this system, multiple pills within a medical packaging for i...
详细信息
Advancements in quantum computing underscore the critical need for sophisticated qubit readout techniques to accurately discern quantum states. This abstract presents our research intended for optimizing readout pulse...
详细信息
ISBN:
(纸本)9798331541378
Advancements in quantum computing underscore the critical need for sophisticated qubit readout techniques to accurately discern quantum states. This abstract presents our research intended for optimizing readout pulse fidelity for 2D and 3D Quantum Processing Units (QPUs), the latter coupled with Superconducting Radio Frequency (SRF) cavities. Focusing specifically on the application of the Least Mean Squares (LMS) adaptive filtering algorithm, we explore its integration into the FPGA-based control systems to enhance the accuracy and efficiency of qubit state detection by improving Signal-to-Noise Ratio (SNR). Implementing the LMS algorithm on the Zynq UltraScale+ RFSoC Gen 3 devices (RFSoC 4x2 FPGA and ZCU216 FPGA) using the Quantum Instrumentation Control Kit (QICK) open-source platform, we aim to dynamically test and adjust the filtering parameters in real-time to characterize and adapt to the noise profile presented in quantum computing readout signals. Our preliminary results demonstrate the LMS filter's capability to maintain high readout accuracy while efficiently managing FPGA resources. These findings are expected to contribute to developing more reliable and scalable quantum computing architectures, highlighting the pivotal role of adaptive signal processing in quantum technology advancements.
暂无评论