A stochastic binary-ternary (SBT) quantization approach is introduced for communication efficient federated computation;form of collaborative computing where locally trained models are exchanged between institutes. Co...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
A stochastic binary-ternary (SBT) quantization approach is introduced for communication efficient federated computation;form of collaborative computing where locally trained models are exchanged between institutes. Communication of deep neural network models could be highly inefficient due to their large size. This motivates model compression in which quantization is an important step. Two well-known quantization algorithms are binary and ternary quantization. The first leads into good compression, sacrificing accuracy. The second provides good accuracy with less compression. To better benefit from trade-off between accuracy and compression, we propose an algorithm to stochastically switch between binary and ternary quantization. By combining with uniform quantization, we further extend the proposed algorithm to a hierarchical method which results in even better compression without sacrificing the accuracy. We tested the proposed algorithm using Neural network Compression Test Model (NCTM) provided by MPEG community. Our results demonstrate that the hierarchical variant of the proposed algorithm outperforms other quantization algorithms in term of compression, while maintaining the accuracy competitive to that provided by other methods.
Because visual information is increasingly being sent in digital image format, identifying noisy data becomes a prevalent difficulty in many research and application sectors. To reduce noise while maintaining image in...
详细信息
Recent advances in image and video creation, especially AI-based image synthesis, have led to the production of numerous visual scenes that exhibit a high level of abstractness and diversity. Consequently, Visual Stor...
详细信息
ISBN:
(纸本)9798891760615
Recent advances in image and video creation, especially AI-based image synthesis, have led to the production of numerous visual scenes that exhibit a high level of abstractness and diversity. Consequently, Visual Storytelling (VST), a task that involves generating meaningful and coherent narratives from a collection of images, has become even more challenging and is increasingly desired beyond real-world imagery. While existing VST techniques, which typically use autoregressive decoders, have made significant progress, they suffer from low inference speed and are not well-suited for synthetic scenes. To this end, we propose a novel diffusion-based system DIFFUVST, which models the generation of a series of visual descriptions as a single conditional denoising process. The stochastic and non-autoregressive nature of DIFFUVST at inference time allows it to generate highly diverse narratives more efficiently. In addition, DIFFUVST features a unique design with bi-directional text history guidance and multimodal adapter modules, which effectively improve inter-sentence coherence and image-to-text fidelity. Extensive experiments on the story generation task covering four fictional visual-story datasets demonstrate the superiority of DIFFUVST over traditional autoregressive models in terms of both text quality and inference speed.
Practical and ethical dataset collection remains a challenge blocking many empirical methods in natural language processing, resulting in a lack of benchmarks or data on which to test hypotheses. We propose a solution...
详细信息
Region-based active contours give interesting results when applied for objects detection with poor contrasting edges and noise, but for medical image segmentation, where objects of interest are often present with inho...
详细信息
stochastic computing is a low-cost non-standard computer architecture that processes pseudo-random bitstreams. Its effectiveness, and that of other probabilistic methods, requires maintaining desired levels of correla...
详细信息
Regularization is one of the most fundamental topics in optimization, statistics and machine learning. To get sparsity in estimating a parameter u ∈ Rd, an q penalty term, uq, is usually added to the objective functi...
详细信息
This study explores the application of computer vision technology to improve the efficiency of maintaining cleanliness and order in high-traffic commercial areas. Given the growing importance of automation in managing...
详细信息
Current research on multi-label image classification mainly focuses on exploring the correlation between labels to improve the classification accuracy of multi-label images. However, in the existing methods, the label...
详细信息
Opinion-unaware no-reference (OU-NR) methods for image quality assessment (IQA) are of great interest since they can predict visual quality independent of a reference image and knowledge of human quality opinions. Mod...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Opinion-unaware no-reference (OU-NR) methods for image quality assessment (IQA) are of great interest since they can predict visual quality independent of a reference image and knowledge of human quality opinions. Models of image naturalness trained on a corpus of pristine images have shown potential for developing OU-NR methods. However, the extracted features may not match the preferences of the human visual system (HVS). This paper aims to utilize the features of convolutional neural networks to achieve a richer representation of the naturalness space. In addition, the IQA processing steps from training to quality measurement are revisited and the naturalness model is improved by incorporating HVS-inspired criteria. Experimental results show the higher performance and generalizability of the naturalness model - constructed using HVS-aligned deep features - under different distortion types and image contents. The source code of the quality index is available at https://***/saeedmp/dni.
暂无评论