In the state-of-the-art saliency detection methods based on contrast priors, little attention is paid on the region smoothness constraints. The paper proposes a two-stage saliency detection method in which a smoothnes...
详细信息
ISBN:
(纸本)9783662456460;9783662456453
In the state-of-the-art saliency detection methods based on contrast priors, little attention is paid on the region smoothness constraints. The paper proposes a two-stage saliency detection method in which a smoothness prior is explicitly involved in a continuous Conditional Random Field (CRF). In stage one, we construct a continuous CRF based on the sparse codes of perceptual features on all locations, and minimize the energy of CRF to obtain discrimination maps. In stage two, we train a discriminative machine and learn the saliency maps from discrimination maps, aiming to take the human attention priors into consideration. Our experiments on MSRA-1000 show that the new method is effective against the state-of-the-art methods.
Colorization based coding is a technique which compresses a color image using the colorization method. The main issue in colorization based coding is to extract a good RP(representative pixel) set from the original co...
详细信息
ISBN:
(纸本)9783319117584;9783319117577
Colorization based coding is a technique which compresses a color image using the colorization method. The main issue in colorization based coding is to extract a good RP(representative pixel) set from the original color image from which the colored image can be reconstructed in the decoder to a sufficient level. In this paper, we propose an iterative sparse coding method for the extraction of the RP set. Observations show that the proposed method computes simultaneously the locally optimal RP set and the locally optimal Levin's colorization matrix. Furthermore, experimental results show that the proposed method provides better color image reconstruction and compression rate than conventional colorization based coding methods.
Neuromorphic computing has recently gained significant attention as a promising approach for developing energy-efficient, massively parallel computing systems inspired by the spiking behavior of the human brain and na...
详细信息
Neuromorphic computing has recently gained significant attention as a promising approach for developing energy-efficient, massively parallel computing systems inspired by the spiking behavior of the human brain and natively mapping spiking neural networks (SNNs). Effective training algorithms for SNNs are imperative for increased adoption of neuromorphic platforms;however, SNN training continues to lag behind advances in other classes of ANN. In this paper, we reduce this gap by proposing an innovative encoder-decoder technique that leverages sparse coding and the locally competitive algorithm (LCA) to provide an algorithm specifically designed for neuromorphic platforms. Using our proposed Dataset-Scalable Exemplar LCA-Decoder we reduce the computational demands and memory requirements associated with training SNNs using error backpropagation methods on increasingly larger training sets. We offer a solution that can be scalably applied to datasets of any size. Our results show the highest reported top-1 test accuracy using SNNs on the ImageNet and CIFAR100 datasets, surpassing previous benchmarks. Specifically, we achieved a record top-1 accuracy of 80.75% on ImageNet (ILSVRC2012 validation set) and 79.32% on CIFAR100 using SNNs.
Impulse components in vibration signals are important indicators of machinery health states. sparse coding (SC) is regarded as an efficient impulse feature extraction method, but it cannot extract the weak impulse fea...
详细信息
ISBN:
(纸本)9781479979585
Impulse components in vibration signals are important indicators of machinery health states. sparse coding (SC) is regarded as an efficient impulse feature extraction method, but it cannot extract the weak impulse features in vibration signals with heavy background noises. In this paper, a fusion sparse coding (FSC) method is proposed to extract impulse components effectively. Firstly, several sparse coding algorithms are executed in parallel independently as participating algorithms. Then, fusion scheme of different sparse coding algorithms is presented to improve the accuracy of sparse signal reconstruction. Lastly, the proposed method is used to process aircraft engine rotor vibration signals compared with other feature extraction approaches. Experiment result shows FSC method can extract impulse features accurately from heavy noisy vibration signal, and it provides great significance for machinery weak fault detection and diagnosis.
In this work we focus on the problem of estimating time-varying sparse signals from a sequence of under-sampled observations. We formulate this problem as estimating hidden states in a dynamic model and exploit the un...
详细信息
ISBN:
(纸本)9781479928934
In this work we focus on the problem of estimating time-varying sparse signals from a sequence of under-sampled observations. We formulate this problem as estimating hidden states in a dynamic model and exploit the underlying temporal structure to find a more accurate solution, particularly when the information in the observations is at scarce. We propose an optimization procedure based on smoothing proximal gradient method to estimate these hidden states. We show that the proposed model is efficient and more robust to the noise in the system.
This paper presents a comprehensive framework designed for the efficient processing and analysis of large-scale data through the integration of deep learning technologies. The framework leverages the capabilities of d...
详细信息
ISBN:
(纸本)9798400718212
This paper presents a comprehensive framework designed for the efficient processing and analysis of large-scale data through the integration of deep learning technologies. The framework leverages the capabilities of deep learning to enhance the accuracy and speed of data analysis, addressing the challenges posed by the sheer volume and complexity of modern datasets. It incorporates advanced deep learning models, including convolutional neural networks and sparse coding, to extract and analyze features from large datasets effectively. The framework is built upon a robust architecture that integrates with existing big data technologies like Hadoop, Spark, and TensorFlow, facilitating scalable and distributed data processing. Experimental results demonstrate significant improvements in performance analysis and prediction accuracy across various application domains, highlighting the framework's effectiveness in harnessing deep learning for large-scale data analysis. This study contributes to the field by proposing a scalable, efficient solution for data-driven decision-making and opens new avenues for research in deep learning applications for big data.
The emergence of newer hardware and limited accessibility to that hardware are driving the need to characterize application performance on commonly used architectures to gain valuable insights. While abundant performa...
详细信息
ISBN:
(纸本)9798350376975;9798350376968
The emergence of newer hardware and limited accessibility to that hardware are driving the need to characterize application performance on commonly used architectures to gain valuable insights. While abundant performance events create opportunities for detailed performance characterization, they also make the process overwhelming. This paper takes a systematic approach to address the problem of performance characterization by taking advantage of such opportunities. Specifically, this paper 1) characterizes the hardware usage behaviors of commonly used optimizers in the context of Recurrent Neural Network (RNN) architectures, 2) connects observations to identify problems and opportunities for optimization in those optimizers, and 3) creates a large performance dataset. Comparing the hardware resource usage behaviors of Gradient Descent (GS), AdaGrad (AdaGrad), and ADAM (ADAM) optimizers for three commonly used recurrent neural network-based deep learning architectures running on AMD MI100 GPUs, this paper identifies non-uniform memory access as one of the most frequent issues and suggests strategies for optimizing such problems.
The Vector Quantized Variational AutoEncoder (VQ-VAE) has shown great potential in image generation, especially the methods with hierarchical features. However, the lack of decoupling of structural information between...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
The Vector Quantized Variational AutoEncoder (VQ-VAE) has shown great potential in image generation, especially the methods with hierarchical features. However, the lack of decoupling of structural information between hierarchical features leads to semantic inconsistencies and redundant structural features, resulting in incompatible outputs. In this study, we propose the Adaptively Hierarchical Quantization Variational AutoEncoder (AHQ-VAE) to generate high-fidelity images with a unified structure. To ensure the semantic consistency of continuous space, we employ the Spatially Consistent Semantic Embedding (SCSE) module to align the hierarchical features, while decoupling global structural information and local details. To ensure the consistency of discrete space, we introduce the Adaptive Bottom Quantizer (ABQ) to generate the quantized bottom codes consistent with quantized top codes, so that the local details can adapt to the global semantics. Extensive experiments demonstrate our approach can generate high-quality images with a unified structure.
Fault diagnosis of aero-engine inter-shaft bearing under variable operating conditions poses a significant challenge in the industry. Existing sparse classification methods with shallow architectures suffer from insuf...
详细信息
ISBN:
(纸本)9798350374483;9798350374476
Fault diagnosis of aero-engine inter-shaft bearing under variable operating conditions poses a significant challenge in the industry. Existing sparse classification methods with shallow architectures suffer from insufficient fault feature extraction and interference removal capabilities with limited training samples, resulting in low diagnostic accuracies. To address this issue, this study introduces an approach termed deep sparse representation classification (DSRC). DSRC seamlessly integrates multiple layers for dictionary learning and sparse coding. In the initial phase, the dictionary learning layer is employed to acquire the Fisher discriminative sparse representation information, while the sparse coding layer is utilized to eliminate interfering components and simultaneously enhance sparsity. The incorporation of a weight matrix, guided by a high-energy atom selection strategy, links the upward and downward processes of dictionary learning and sparse coding. Subsequently, the frequency-weighted energy operator kurtosis-based feature vectors are extracted from the reconstructed signals of the newly acquired dictionary and coding coefficients. Ultimately, these discriminative feature vectors are directly input into a straightforward classifier for intelligent fault diagnosis. DSRC is applied to an aero-engine inter-shaft bearing fault data under multiple speeds. Results demonstrate that it can effectively realize discriminative fault feature extraction and high-precision automatic fault identification.
Compressed sensing (CS) exploiting inherent sparsity prior of signals has been proven effective for sparse-view computed tomography (CT) image reconstruction from undersampled projection data. However, most CS-based C...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
Compressed sensing (CS) exploiting inherent sparsity prior of signals has been proven effective for sparse-view computed tomography (CT) image reconstruction from undersampled projection data. However, most CS-based CT studies focused on formulating different sparsity regularizers, e.g., total variation (TV) minimization, and neglect design of an incoherent sensing matrix - a key factor of CS performance. The sensing matrix formed by an incomplete set of Radon projections in CT typically exhibits large coherence. In this paper, we propose a novel method for optimizing the sensing matrix via preconditioning to improve CS-CT reconstruction. A well-conditioned preconditioner is designed to optimally reduce the coherence of the sensing matrix and thus improving the CS systems. The desired preconditioner is obtained by solving a nonconvex optimization problem via gradient descent method. The preconditioned systems solved by TV-based sparse recovery algorithms can provide better reconstruction accuracy with fewer measurements even in noisy settings. Evaluated on brain and COVID-19 chest CT datasets, the proposed method when used for preconditioning of Radon sensing matrix reconstructed images with substantially higher quality with faster speed than baselines without preconditioning.
暂无评论