Implicit Neural Representation (INR) is an innovative approach for representing complex shapes or objects without explicitly defining their geometry or surface structure. Instead, INR represents objects as continuous ...
详细信息
ISBN:
(纸本)9798350322255
Implicit Neural Representation (INR) is an innovative approach for representing complex shapes or objects without explicitly defining their geometry or surface structure. Instead, INR represents objects as continuous functions. Previous research has demonstrated the effectiveness of using neural networks as INR for image compression, showcasing comparable performance to traditional methods such as JPEG. However, INR holds potential for various applications beyond image compression. This paper introduces Rapid-INR, a novel approach that utilizes INR for encoding and compressing images, thereby accelerating neural network training in computer vision tasks. Our methodology involves storing the whole dataset directly in INR format on a GPU, mitigating the significant data communication overhead between the CPU and GPU during training. Additionally, the decoding process from INR to RGB format is highly parallelized and executed on-the-fly. To further enhance compression, we propose iterative and dynamic pruning, as well as layer-wise quantization, building upon previous work. We evaluate our framework on the image classification task, utilizing the ResNet-18 backbone network and three commonly used datasets with varying image sizes. Rapid-INR reduces memory consumption to only 5% of the original dataset size and achieves a maximum 6x speedup over the PyTorch training pipeline, as well as a maximum 1.2x speedup over the DALI training pipeline, with only a marginal decrease in accuracy. Importantly, Rapid-INR can be readily applied to other computer vision tasks and backbone networks with reasonable engineering efforts. Our implementation code is publicly available at https://***/sharc-lab/Rapid- INR.
Image compression is a topic of significant interest as it reduces file sizes in stored data. In this paper, we propose a model that achieves multiple levels of compression, thereby minimizing the storage space requir...
详细信息
Understanding the biomechanical impact of metastatic involvement in the spine is critical for patient care and treatment planning. However, limited access to human specimens and ethical concerns hinder research in thi...
详细信息
We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the text's prefixes. The selected offsets tend to approach the k-th order empirical entrop...
详细信息
ISBN:
(纸本)9781665478939
We propose a new representation of the offsets of the Lempel-Ziv (LZ) factorization based on the co-lexicographic order of the text's prefixes. The selected offsets tend to approach the k-th order empirical entropy. Our evaluations show that this choice is superior to the rightmost and bit-optimal LZ parsings on datasets with small high-order entropy.
IoMT systems are evolving at a higher rate than other IoT applications. As health is of major concern, more health monitoring sensors and devices are incorporated which helps in the well-being of the people at their c...
详细信息
Information compression is a vital approach for optimization and optimization of the scale of a digital record without affecting its content material. This paper offers an entropy-based analysis of two distinct levels...
详细信息
Recently, there has been a tremendous demand for high-efficiency face video communications, coinciding with the popularization of the digital human character in numerous applications. This paper demonstrates a new com...
详细信息
ISBN:
(纸本)9781665478939
Recently, there has been a tremendous demand for high-efficiency face video communications, coinciding with the popularization of the digital human character in numerous applications. This paper demonstrates a new communication paradigm of 3D human digital characters in ultra low-bit-rate application scenarios. The paradigm is grounded on the mild assumption of the consistency and persistence of human ap-pearance, such that only the compact features that determine the pose and expression of the 3D character need to be transmitted. The proposed is also expected to benefit virtual-physical world interaction in Metaverse.
Modern real-time applications widely embed compute intense neural algorithms at their core. Current solutions to support such algorithms either deploy highly-optimized Deep Neural Networks at mobile devices or offload...
详细信息
ISBN:
(纸本)9798350326031;9798350326048
Modern real-time applications widely embed compute intense neural algorithms at their core. Current solutions to support such algorithms either deploy highly-optimized Deep Neural Networks at mobile devices or offload the execution of possibly larger higher-performance neural models to edge servers. While the former solution typically maps to higher energy consumption and lower performance, the latter necessitates the low-latency wireless transfer of high volumes of data. Time-varying variables describing the state of these systems, such as connection quality and system load, determine the optimality of the different computing configurations in terms of energy consumption, task performance, and latency. Herein, we propose Furcifer, a framework capable of dynamically adapting the cloud continuum computing configuration in response to the perceived state of the system. Our container-based approach incorporates low-complexity predictors that generalize well across operating environments. In addition, we develop a highly optimized split Deep Neural Network model, which achieves in-model supervised compression and enhances task offloading. Experimental results for object detection across diverse conditions, environments, and wireless technologies, show Furcifer's remarkable outcomes, including a 2x energy reduction, 30% higher mean Average Precision score than pure local computing, and a notable three-fold increase in frame per second rate compared to static offloading.
Nowadays, microphone array plays an important role in speech signal processing. This paper proposes a new parametric coding scheme to further improve the coding efficiency of linear microphone array. In the coding sch...
详细信息
ISBN:
(纸本)9781665478939
Nowadays, microphone array plays an important role in speech signal processing. This paper proposes a new parametric coding scheme to further improve the coding efficiency of linear microphone array. In the coding scheme, Inter-channel Level Difference (ICLD), Inter-channel Time Difference (ICTD) and Inter-channel Coherence (ICC) are used as spatial parameters. As shown in Figure 1, at the encoder, a down-mixed signal is obtained by averaging all signals time-aligned with the reference channel. Only the spatial parameters between the first channel and the last channel are extracted. At the decoder, the speech signal of each channel is reconstructed by the down-mixed signal received, spatial parameters and their interpolation.
Continuous Glucose Monitors are minimally-invasive portable sensors that are revolutionizing the management of Type 1 Diabetes (T1D). A common issue encountered in their daily use is related to the presence of pressur...
详细信息
ISBN:
(纸本)9798350338416
Continuous Glucose Monitors are minimally-invasive portable sensors that are revolutionizing the management of Type 1 Diabetes (T1D). A common issue encountered in their daily use is related to the presence of pressure-induced sensor attenuations (PISAs), temporary faults of the devices, resulting in false low blood glucose readings that can impact and compromise the reliability of CGMs. In this work, we explore the application of matched filters (MFs), a powerful pattern recognition technique, for the retrospective identification of PISAs failures. A MF is designed for the detection of a signal with a specific shape, associated with the occurrence of a PISA episode. The proposed algorithm is tested in-silico on a dataset generated with a state-of-art T1D patient simulator. MFs achieve a recall of 0.75 with about 1 false alarm every 5 days, outperforming other state-of-art algorithms proposed for the same purpose, including one based on a Random Forest classifier (RF). Moreover, when embedded as additional feature within a RF it improves the performance by granting a recall of 0.83 and 1 false alarm raised in 10 days. The encouraging outcomes in the simulated scenario pave the way for future investigations involving real-world data, as well as potential enhancements in detecting different types of sensors' failures.
暂无评论