检索结果-内蒙古大学图书馆

2010 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, BMSB 2010

作者： Yan, Qing Xu, Yi Yang, Xiaokang Traversoni, Leonardo Institute of Image Communication and Information Processing Shanghai Key Lab. of Digital Media Processing and Transmission Shanghai Jiao Tong University Shanghai China Ciencias Bsicas e Ingenieria Mexico Mexico

ISBN: (纸本)9781424444625

Robust foreground detection is a fundamental precursor of many video processing applications. Although various approaches were advanced, there still exist many factors making detection very challenging: 1) Dynamic background with gradual brightness changes, camera movement and large amount of noises. 2) Sharp illumination changes caused by shadows, light on-off, and so on. 3) Real-time requirement for practical systems. To overcome these problems, a new approach is proposed in this paper. It is based on the background of conventional Gaussian Mixed Model, incorporating tempo-spatial consistency validation to search genuine foreground seeds, so that foreground segments can be reliably acquired using region growth method. Experiments demonstrate that our approach achieves better performance than conventional GMM approach in detection accuracy, adaptability to sudden illumination changes and computation time.

关键词： Gaussian distribution

来源：评论

学校读者我要写书评

暂无评论

SVC bitstream extraction based on the importance of MGS slice

SVC bitstream extraction based on the importance of MGS slic...

引用

2010 2nd International Conference on Industrial and Information Systems, IIS 2010

作者： Liu, Wei Guo, Sen Xu, Chen Jihong, Zhang Institute of Information Technology Shenzhen Key Laboratory of Visual Media Processing and Transmission Shenzhen Institute of Information Technology Shen Zhen China College of Information Engineering Shen Zhen University Shen Zhen China

ISBN: (纸本)9781424482177

Medium Grain Scalable (MGS) coding is widely used as the quality scalable video coding (SVC) method. In this paper, we study bitstream extractor for MGS-based bitstream. In MGS, the coded data corresponding to a quantization step size can be fragmented into at most 15 sublayers, which is called MGS slices. The contribution of each slice to the video quality is evaluated by our proposed algorithm, and based on the importance of MGS slices, the optimal extraction is proposed to get the best video quality. Compared to conventional method of SVC, the proposed method can both expand the range of supported bitrates and improve the quality. © 2010 IEEE.

关键词： Scalable video coding

来源：评论

学校读者我要写书评

暂无评论

Error control for MGS-based SVC bitstream

Error control for MGS-based SVC bitstream

引用

2010 International Conference on Multimedia Technology, ICMT 2010

作者： Chen, Xu Zhang, Jihong Liu, Wei Liang, Yongsheng College of Information Engineering Shen Zhen University Shen Zhen China Institute of Information Technology Shenzhen Key Laboratory of Visual Media Processing and Transmission Shenzhen Institute of Information Technology Shen Zhen China

ISBN: (纸本)9781424478743

Error Control is an important aspect for video coding and transmission. In this paper, we study error control for MGS-based bitstream. Our method not only duplicate the key data of MGS slice to improve the quality of decoded video, but also construct a compensated MGS slice to maintain bitstream conformance. Experiments show better effect and better performance of the proposed method compared to the conventional method of SVC. ©2010 IEEE.

关键词： Scalable video coding

来源：评论

学校读者我要写书评

暂无评论

Image Fusion Based on NSCT and Fuzzy Logic

Image Fusion Based on NSCT and Fuzzy Logic

引用

International Conference on Multimedia Technology

作者： Xianyi Ren Yijun Zheng Tao Hu Jihong Zhang Shenzhen Key Laboratory of Visual Media Processing and Transmission Shenzhen Institute of Information Technology Shenzhen China College of Computer Science and Software Engineering Shenzhen University Shenzhen China

Nonsubsampled contourlet transform (NSCT) can provide flexible multiresolution, anisotropy, and directional expansion for images. Compared with the original contourlet transform, it is shift-invariant and can overcome the pseudo-gibbs phenomena around singularities. Fuzzy logic is an efficient intelligent method to handle uncertain information. In this paper, a novel image fusion algorithm is proposed based on the NSCT and fuzzy logic. Extensive experiments show that the proposed method can improve subjective and objective results compared to some other fusion approaches.

关键词： Fuzzy logic Image fusion Algorithm design and analysis Discrete wavelet transforms Fuzzy sets

来源：评论

学校读者我要写书评

暂无评论

Novel Quality Measures for Image Fusion Based on Structural Similarity and visual Attention Mechanism

Novel Quality Measures for Image Fusion Based on Structural ...

引用

International Conference on Multimedia Technology

作者： Xianyi Ren Xiujian Liu Tao Hu Jihong Zhang Shenzhen Key Laboratory of Visual Media Processing and Transmission Shenzhen Institute of Information Technology Shenzhen China College of Computer Science and Software Engineering Shenzhen University Shenzhen China

A novel objective quality for image fusion based on structural similarity and visual attention mechanism (VAM) is presented. By giving higher weight to the salient areas in the input images, the quality measure can estimate how much visual meaningful information is preserved in the fused image. The correlation analysis between objective measure and subjective evaluation showed that our measures are more consistent with human subjective evaluation.

关键词： visualization Image fusion Humans Correlation Indexes Q measurement

来源：评论

学校读者我要写书评

暂无评论

SVC bitstream extraction based on the importance of MGS slice

SVC bitstream extraction based on the importance of MGS slic...

引用

International Conference on Industrial and Information Systems, IIS

作者： Liu Wei Guo Sen Chen Xu Zhang Jihong Institute of information technology Shenzhen Key Laboratory of Visual Media Processing and Transmission Shenzhen Institute of Information Technology Shenzhen China College of Information Engineering Shenzhen University Shenzhen China

Medium Grain Scalable (MGS) coding is widely used as the quality scalable video coding (SVC) method. In this paper, we study bitstream extractor for MGS-based bitstream. In MGS, the coded data corresponding to a quantization step size can be fragmented into at most 15 sub-layers, which is called MGS slices. The contribution of each slice to the video quality is evaluated by our proposed algorithm, and based on the importance of MGS slices, the optimal extraction is proposed to get the best video quality. Compared to conventional method of SVC, the proposed method can both expand the range of supported bitrates and improve the quality.

关键词： Quantum cascade lasers Variable speed drives Decoding Static VAr compensators Encoding PSNR

来源：评论

学校读者我要写书评

暂无评论

ReAL: Improving Image-Text Retrieval with Authentic Negative Repository Learning

引用

ACM Transactions on Multimedia Computing, Communications, and Applications 1000年

作者： Renjie Pan Hua Yang Xiangyu Zhao Institute of Image Communication and Network Engineering Shanghai Key Lab of Digital Media Processing and Transmission Shanghai Jiao Tong University China Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China

Current methods for image-text retrieval commonly propose various fusion modules to achieve robust visual-textual alignment, primarily relying on in-batch learning to guide the matching process. Some follow-up methods seek to enlarge the number of negative samples to boost image-text contrastive learning. However, these methods often face challenges posed by semantic-consistent negatives, i.e., negatives samples that share correspondence with the ground truth, leading to confusion in learning cross-modal semantics. To address this issue, we propose a novel Retrieve with Authentic negative repository Learning (ReAL) method, which constructs a specific Authentic Negative Repository filled with valuable negative sample pairs. By introducing a Unique Negative Filter with a Discriminative Triplet Ranking Loss, ReAL effectively filters out the semantic-consistent negatives through similarity distribution analysis and threshold learning. Moreover, existing fusion paradigms suffer from intricate use of fine-grained representations from word- and region-level instances to progressively refine the fused embedding. In this paper, we propose a lightweight Cluster Refinement Module to exploit cross-modal semantics in a 1-way-1-out paradigm. Each visual-textual alignment can spontaneously uncover correlations with adjacent alignments through aggregation and re-allocation, without the need for a redundant and cost-inefficient refinement stage. Furthermore, ReAL employs dual momentum encoders with two memory banks, expanding the selection range of the Authentic Negative Repository to include a broader set of negatives. Extensive experiments conducted on Flickr30K, MS-COCO, and the augmented Flickr30K (with more hard negatives) demonstrate the superiority and robustness of ReAL, while also showcasing its significantly reduced inference time compared to other competitive baselines.

关键词： Image-text Retrieval Authentic Negative Repository Cross-modal Fusion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：