Distributed video coding (DVC) is a novel video coding paradigm. One approach to DVC is Wyner-Ziv distributed video coding. The accuracy of the correlation noise model can influence the performance of the video coder ...
详细信息
Distributed video coding (DVC) is a novel video coding paradigm. One approach to DVC is Wyner-Ziv distributed video coding. The accuracy of the correlation noise model can influence the performance of the video coder directly. In order to enhance the accuracy of the distribution model, EM algorithm based mixture Laplace-uniform distribution model and basic Laplace-uniform distribution model for DCT alternating current coefficients are established. Then the model is selected adaptively using fuzzy inference. Experimental results suggest that the proposed mixture correlation noise model can describe the heavy tail and sudden change of the noise accurately at high rate and make significant improvement on the coding efficiency compared with the DISCOVER's noise model. Meanwhile, fuzzy inference based adaptive noise model selection method can reduce the operation complexity to some extent, while not influencing rate-distortion performance.
For learning-based super-resolution reconstruction, the selection and training of dictionary play an important role in improving image reconstruction quality. A super-resolution algorithm based on two dictionary-pairs...
详细信息
The development of intelligent methods capable of predicting protein-ligand binding sites has become a popular research field. Recently, deep learning based methods have been proposed as a promising solution for this ...
详细信息
The development of intelligent methods capable of predicting protein-ligand binding sites has become a popular research field. Recently, deep learning based methods have been proposed as a promising solution for this task. However, some limitations still exist. For example, the network structure is not optimized for predicting protein binding pockets, which limits the model's capabilities. To address the aforementioned challenges, a novel method called CATransUnetLPB is proposed, in which a new network structure named CATransUnet is designed. The proposed CATransUnet combines CNN and Transformer models to accurately segment binding pocket regions from protein 3D structures. It outperforms existing representative methods on three test sets, demonstrating the effectiveness of optimizing the deep network model for detecting protein ligand binding pockets. Furthermore, we conduct thorough analysis on applying data augmentation to protein data structure and confirm that such technique can enhance the model's generalization ability, thereby ensuring good performance on new protein structures. Moreover, experiments show that the predicted binding pockets from our model can complement the results obtained from other methods. This suggests that integrating our method with existing approaches could further improve the prediction of protein-ligand binding pockets.
In this study, a low complexity frame-rate up conversion method using compressed domain information for H.264 decoder is proposed. In the proposed scheme, the motion vectors (MVs) are estimated using constant accele...
详细信息
In this study, a low complexity frame-rate up conversion method using compressed domain information for H.264 decoder is proposed. In the proposed scheme, the motion vectors (MVs) are estimated using constant acceleration motion model, and the MVs regarded as no credibility are corrected, and the interpolation method is applied on the basis of the macroblock (MB) coded types. Applied to the H.264 decoder, the proposed method provides high quality interpolation frames and an obvious decrease of the block artifacts.
To solve the super-resolution reconstruction problem for single-frame image, an algorithm based on sparse representation and nonlocal regularization is proposed. By training the joint dictionaries, this algorithm look...
详细信息
Scanpath prediction for omnidirectional images aims to effectively simulate the human visual perception mechanism to generate dynamic realistic fixation trajectories. However, the majority of scanpath prediction metho...
详细信息
Scanpath prediction for omnidirectional images aims to effectively simulate the human visual perception mechanism to generate dynamic realistic fixation trajectories. However, the majority of scanpath prediction methods for omnidirectional images are still in their infancy as they fail to accurately capture the time-dependency of viewing behavior and suffer from sub-optimal performance along with limited generalization capability. A desirable solution should achieve a better trade-off between prediction performance and generalization ability. To this end, we propose a novel dual-temporal modulation scanpath prediction (ScanDTM) model for omnidirectional images. Such a model is designed to effectively capture long-range time-dependencies between various fixation regions across both internal and external time dimensions, thereby generating more realistic scanpaths. In particular, we design a Dual Graph Convolutional Network (Dual-GCN) module comprising a semantic-level GCN and an image-level GCN. This module servers as a robust visual encoder that captures spatial relationships among various object regions within an image and fully utilizes similar images as complementary information to capture similarity relations across relevant images. Notably, the proposed Dual-GCN focuses on modeling temporal correlations from both local and global perspectives within the internal time dimension. Furthermore, drawing inspiration from the promising generalization capabilities of diffusion models across various generative tasks, we introduce a novel diffusion-guided saliency module. This module formulates the prediction issue as a conditional generative process for the saliency map, utilizing extracted semantic-level and image-level visual features as conditions. With the well-designed diffusion-guided saliency module, our proposed ScanDTM model acting as an external temporal modulator, we can progressively refine the generated scanpath from the noisy map. We conduct extensive expe
Conventional single-chip digital cameras use color filter arrays(CFA) to sample different spectral components. image demosaicing is a problem of interpolating these data to complete red, green, and blue values for eac...
详细信息
ISBN:
(纸本)9781467321969
Conventional single-chip digital cameras use color filter arrays(CFA) to sample different spectral components. image demosaicing is a problem of interpolating these data to complete red, green, and blue values for each image pixel, to produce an RGB image. Many color demosaicing(CDM) methods assume that the high local spatial redundancy exists among the color samples. Such an assumption, however, may be fail for images with high color saturation and sharp color transitions. This paper presents an adaptive demosaicing algorithm by exploiting both the non-local similarity and the local correlation(NLS-LC) in the color filter array image. First, the most flattest nonlocal image patches are searched in the searching window centered on the estimated pixel. Second, the patch, which is the most similar to the current patch, is selected among the most smoothest nonlocal patches. Third, according to the similar degree and the local correlation degree, the obtained nonlocal image patch and the current patch are adaptively chosen to estimate the missing color samples. Experimental results indicate that the proposed method exhibits superior performance over many state-of-the-art color interpolation methods.
Two-dimensional (2-D) array sets with good 2-D correlation properties have received considerable attention in wireless communication systems. This paper focuses on 2-D Z-complementary array code sets (ZCACSs), which h...
详细信息
Action recognition is an important topic in computer vision and most current work focuses on view-dependent representations. In this paper, we develop a novel free viewpoint action recognition based on Self-similarity...
详细信息
ISBN:
(纸本)9781467321969
Action recognition is an important topic in computer vision and most current work focuses on view-dependent representations. In this paper, we develop a novel free viewpoint action recognition based on Self-similarity matrix (SSM), which tends to be stable across views. We choose Local Self-similarity (LSS) descriptor as our low-level feature, then SSM is calculated by computing the similarity between any pair of frame features. Each video sequence is represented using a diagonal descriptor vector extracted from the SSM. Support Vector Machines (SVM) is employed for classification. The encouraging experimental results on the public IXMAS multi-view data set demonstrate effectiveness of the proposed method.
In Wireless Mesh Network,multimedia applications require the network to guarantee Quality-of-Service(QoS). A new QoS-aware routing protocol based on DSR named QDSR(QoS-DSR) is further proposed. QDSR guarantees the QoS...
详细信息
In Wireless Mesh Network,multimedia applications require the network to guarantee Quality-of-Service(QoS). A new QoS-aware routing protocol based on DSR named QDSR(QoS-DSR) is further proposed. QDSR guarantees the QoS of application, such as bandwidth and delay, defines routing cost function according to the number of hops of path and buffer and chooses the best path based on routing cost. Simulation results show that, compared with DSR, QDSR greatly improves the throughput, reduces the average end to end delay, improves the efficiency, satisfies the QoS of application better and has stronger applicability and expansibility.
暂无评论