检索结果-内蒙古大学图书馆

Data Compression Conference (DCC)

作者： Yin, Shanzhi Chen, Bolin Wang, Shiqi Ye, Yan City Univ Hong Kong Hong Kong Peoples R China Alibaba Grp Hangzhou Peoples R China

ISBN: (纸本)9798350385885;9798350385878

generative face video coding (GFVC) can achieve high-quality visual face communication at ultra-low bit-rate ranges via strong facial prior learning and realistic generation. However, different kinds of feature representations hinder the interoperability of GFVC, as the bitstream generated from one type of feature representation can only be correctly understood by the corresponding decoder. In this paper, we make the first attempt to propose a face feature transcoding framework that enables translatability in GFVC. By integrating a face feature transcoder at the decoder side, received face features can be translated to decoder-specific ones for subsequent face reconstruction. Furthermore, the translation between different types of face features can be achieved using a unified transcoding framework, facilitating seamless interoperability between different facial representations and their associated decoders. Experimental results demonstrate that three main-stream GFVC codecs, each utilizing different face features, can be effectively adapted to one another while retaining promising coding performance, largely extending the generality of the GFVC system. The project page can be found at https://***/xyzysz/GFVC_Software-Decoder_Interoperability.

关键词： Decoding interoperability face video generative coding

来源：评论

学校读者我要写书评

暂无评论

Light-weighted Temporal Evolution Inference for generative Face Video Compression 26

Light-weighted Temporal Evolution Inference for Generative F...

引用

26th International Workshop on Multimedia Signal Processing

作者： Zhang, Zihan Chen, Bolin Yin, Shanzhi Wang, Shiqi Ye, Yan City Univ Hong Kong Hong Kong Peoples R China Alibaba Grp Sunnyvale CA USA

ISBN: (纸本)9798350387261;9798350387254

Recently, generative Face Video Compression (GFVC) has advanced the concept of Model-based coding (MBC) with promising rate-distortion performance relying on the strong inference capabilities of deep generative models. In particular, GFVC can capture temporal evolution of face video using compact representations (i.e., 2D/3D key-points, facial semantics, compact feature), thus achieving the quality and bandwidth trade-offs for ultra-low bit-rate communication. However, there remains an unaddressed challenge, i.e., the existing GFVC models are not light-weighted and low-latency enough for practical applications. To address these obstacles, this paper proposes a practical lightweight scheme based on the Compact Feature Temporal Evolution (CFTE) model, which aims to provide insights into practical deployments and efficient inference. Specifically, the lightweight network architecture is built with depth-wise convolutions and Inverted Residual Blocks to lower the computational complexity. Moreover, a feature-level knowledge distillation is further introduced to improve the performance of lightweight student CFTE model. Experimental results demonstrate that our proposed lightweight GFVC model can achieve an obvious complexity reduction, whilst maintaining competitive rate-distortion performance.

关键词： generative coding light-weighted model efficient inference

来源：评论

学校读者我要写书评

暂无评论

Overview of intelligent video coding: from model-based to learning-based approaches

引用

Visual Intelligence 2023年第1期1卷 1-19页

作者： Ma, Siwei Gao, Junlong Wang, Ruofan Chang, Jianhui Mao, Qi Huang, Zhimeng Jia, Chuanmin National Engineering Research Center of Visual Technology School of Computer Science Peking University Beijing 100871 China State Key Laboratory of Media Convergence and Communication Communication University of China Beijing 100024 China Wangxuan Institue of Computer Technology Peking University Beijing 100871 China

Intelligent video coding (IVC), which dates back to the late 1980s with the concept of encoding videos with knowledge and semantics, includes visual content compact representation models and methods enabling structural, detailed descriptions of visual information at different granularity levels (i.e., block, mesh, region, and object) and in different areas. It aims to support and facilitate a wide range of applications, such as visual media coding, content broadcasting, and ubiquitous multimedia computing. We present a high-level overview of the IVC technology from model-based coding (MBC) to learning-based coding (LBC). MBC mainly adopts a manually designed coding scheme to explicitly decompose videos to be coded into blocks or semantic components. Thanks to emerging deep learning technologies such as neural networks and generative models, LBC has become a rising topic in the coding area. In this paper, we first review the classical MBC approaches, followed by the LBC approaches for image and video data. We also discuss and overview our recent attempts at neural coding approaches, which are inspiring for both academic research and industrial implementation. Some critical yet less studied issues are discussed at the end of this paper. © The Author(s) 2023.

关键词： Multimedia communication Video compression Artificial intelligence Model-based coding generative coding Learning-based coding

来源：评论

学校读者我要写书评

暂无评论

"Value-adding" Analysis: Doing More With Qualitative Data

引用

INTERNATIONAL JOURNAL OF QUALITATIVE METHODS 2020年 19卷 1-13页

作者： Eakin, Joan M. Gladstone, Brenda Univ Toronto Dalla Lana Sch Publ Hlth Toronto ON Canada Univ Toronto Ctr Crit Qualitat Hlth Res Dalla Lana Sch Publ Hlth Toronto ON Canada

Much qualitative research produces little new knowledge. We argue that this is largely due to deficits of analysis. Researchers too seldom venture beyond cataloguing data into pre-existing concepts and scouting for "themes," and fail to exploit the distinctive powers of insight of qualitative methodology. The paper introduces a "value-adding" approach to qualitative analysis that aims to extend and enrich researchers' analytic interpretive practices and enhance the worth of the knowledge generated. We outline key features of this form of analysis, including how it is constituted by principles of interpretation, contextualization, criticality, and the "creative presence" of the researcher. Using concrete examples from our own research, we describe some analytic "devices" that can free up and stretch a researcher's analytic capacities, including putting reflexivity to work, treating everything as data, reading data for what is invisible, anomalous and "gestalt," engaging in "generative" coding, deploying heuristics for theorizing, and recognizing writing as a key analytic activity. We argue that at its core, value-adding analysis is a scientific craft rather than a scientific formula, a creative assemblage of reality rather than a procedural determination of it. The researcher is the primary generative and synthesizing mechanism for transforming empirically observed data into the key products of qualitative research-concepts, accounts and explanations. The ultimate value of value-adding analysis resides in its ability to generate new knowledge, including not just the "discovery" of things heretofore unknown but also the re-conceptualization of what is already known, and, importantly, the reframing and reconstitution of the research problem.

关键词： qualitative data analysis adding value critical interpretation generative coding creative presence of researcher writing as analysis

来源：评论

学校读者我要写书评

暂无评论

USAGE OF COMPUTER GENERATED IMAGERY IN VJ PERFORMANCE

USAGE OF COMPUTER GENERATED IMAGERY IN VJ PERFORMANCE

引用

作者： ANIL OZDEM Ihsan Dogramaci Bilkent University

学位级别：硕士

This thesis aims to study the usage of computer generated imagery in VJ Performance. The main workflow of VJ Practices has been divided into content, process, and output. The project includes the creation of computer generated visual material via generative/creative coding defined as content, VJ performance with these materials defined as process, and presentation of visuals via Video Projection Mapping Technology defined as output.

关键词： Audio Visual Art Computer Generated Imagery generative coding Live coding VJ Performance

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：