检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,257 篇 会议
64 篇 期刊文献
2 册 图书

馆藏范围

4,323 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,428 篇 工学
- 1,613 篇 计算机科学与技术...
- 1,167 篇 信息与通信工程
- 944 篇 软件工程
- 670 篇 电气工程
- 280 篇 光学工程
- 144 篇 电子科学与技术（可...
- 125 篇 生物工程
- 122 篇 控制科学与工程
- 88 篇 生物医学工程（可授...
- 79 篇 仪器科学与技术
- 68 篇 机械工程
- 43 篇 化学工程与技术
- 33 篇 网络空间安全
- 25 篇 动力工程及工程热...
- 25 篇 安全科学与工程
- 21 篇 测绘科学与技术
- 21 篇 轻工技术与工程
1,097 篇 医学
- 1,086 篇 临床医学
- 38 篇 基础医学(可授医学...
- 33 篇 药学(可授医学、理...
1,023 篇 理学
- 745 篇 物理学
- 350 篇 数学
- 134 篇 生物学
- 99 篇 统计学（可授理学、...
- 44 篇 化学
- 32 篇 系统科学
191 篇 管理学
- 109 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 30 篇 工商管理
32 篇 法学
- 29 篇 社会学
12 篇 军事学
9 篇 文学
7 篇 教育学
6 篇 经济学
5 篇 农学

主题

531 篇 image coding
460 篇 image processing
351 篇 visual communica...
310 篇 visualization
253 篇 feature extracti...
223 篇 image segmentati...
173 篇 image compressio...
166 篇 image reconstruc...
149 篇 video coding
146 篇 cameras
134 篇 training
132 篇 humans
124 篇 image quality
120 篇 image color anal...
111 篇 image enhancemen...
111 篇 signal processin...
110 篇 image retrieval
103 篇 image edge detec...
102 篇 decoding
100 篇 deep learning

机构

36 篇 shanghai jiao to...
29 篇 institute of ima...
24 篇 school of electr...
20 篇 university of sc...
18 篇 shanghai jiao to...
16 篇 shanghai jiao to...
16 篇 tianjin univ sch...
16 篇 beijing universi...
12 篇 institute for in...
12 篇 school of electr...
11 篇 university of el...
11 篇 cas key laborato...
11 篇 tsinghua univ de...
10 篇 univ sci & techn...
10 篇 peking univ inst...
10 篇 tsinghua univ de...
10 篇 institute of ima...
9 篇 zhejiang univers...
9 篇 smart computer v...
9 篇 xidian univ sch ...

作者

34 篇 zhai guangtao
26 篇 sumei li
25 篇 song li
22 篇 li sumei
21 篇 guangtao zhai
19 篇 li li
18 篇 li song
18 篇 min xiongkuo
16 篇 dong liu
16 篇 shan liu
15 篇 andré kaup
15 篇 yang xiaokang
14 篇 chen zhibo
13 篇 xie rong
13 篇 xiongkuo min
13 篇 gao wen
11 篇 m. vetterli
11 篇 heming sun
11 篇 zhibo chen
11 篇 zhenzhong chen

语言

4,232 篇 英文
66 篇 土耳其文
26 篇 中文
12 篇 其他
1 篇 法文

检索条件"任意字段=Conference on Visual Communications and Image Processing 2004"

共 4323 条记录，以下是31-40 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Enhancing Privacy-Utility Tradeoff with Few-Round Strategy in Heterogeneous Federated Learning

Enhancing Privacy-Utility Tradeoff with Few-Round Strategy i...

引用

2024 conference on visual communications and image processing

作者： Wei, Qingbin Zhang, Feilong Bai, Yuanchao Zhai, Deming Jiang, Junjun Liu, Xianming Harbin Inst Technol Fac Comp Harbin Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Federated learning inherently provides a certain level of privacy protection, which however is often inadequate in many real-world scenarios. Existing privacy-preserving methods frequently incur unbearable time overheads or result in non-negligible deterioration to model performance, thus suffering from the tradeoff between performance and privacy. In this work, we propose a novel Federated Privacy-Preserving Knowledge Transfer framework, namely FedPPKT, which employs data-free knowledge distillation in a meta-learning manner to rapidly generates pseudo data and performs privacy-preserving knowledge transfer. FedPPKT establishes a protective barrier between the original private data and the federated model, thereby ensuring user privacy. Furthermore, leveraging the few-round strategy of FedPPKT, it has the capability to reduce the number of communication rounds, further mitigating the risk of privacy exposure for user data. With the help of the meta generator, the problem of uneven local label distribution on clients is alleviated, mitigating data heterogeneity and improving model performance. Experiments show that FedPPKT outperforms the state-of-the-art privacy-preserving federated learning methods. Our code is publicly available at https://***/HIT-weiqb/FedPPKT.

关键词： Generative image Compression Learned image Compression

来源：评论

学校读者我要写书评

暂无评论

End-to-end Prediction of Streaming Video Quality of Experience: Dataset and Approach

End-to-end Prediction of Streaming Video Quality of Experien...

引用

2024 conference on visual communications and image processing

作者： Jia, Ziheng Min, Xiongkuo Zhai, Guangtao Shanghai Jiao Tong Univ Inst Image Commun & Network Engn Shanghai Peoples R China

ISBN: (纸本)9798331529543;9798331529550

With the rapid development of video-on-demand (VOD) and real-time streaming video technologies, the accurate objective assessment of streaming video Quality of Experience (QoE) has become a focal point for optimizing streaming-related technologies. However, due to the inherent transmission distortions caused by poor Quality of Service (QoS) conditions in streaming videos, such as intermittent stalling, rebuffering, and drastic changes in video sharpness due to bitrate fluctuations, evaluating streaming video QoE presents numerous challenges. This paper introduces a large and diverse in-the-wild streaming video QoE evaluation dataset - the SJLIVE-1k dataset. This work addresses the limitations of corresponding datasets, which lack in-the-wild video sequences under real network conditions and whose amount of video content is insufficient. Furthermore, we propose an end-to-end objective QoE evaluation strategy that extracts video content and QoS features from the video itself without using any extra information. By implementing self-supervised contrastive learning as the "reminder" to bridge the gap between the different types of features, our approach achieves state-of-the-art results across three datasets. Our proposed dataset will be released to facilitate further research.

关键词： Quality of Experience streaming videos contrastive learning

来源：评论

学校读者我要写书评

暂无评论

MSCFormer: Multi-Scale Circular Transformer for image Deblurring

MSCFormer: Multi-Scale Circular Transformer for Image Deblur...

引用

2024 conference on visual communications and image processing

作者： Wang, Shuai Wang, Han Liu, Renhe Wu, Zhipeng Wei, Bo Liu, Yu Tianjin Univ Sch Microelect Tianjin Peoples R China Univ Tokyo Sch Engn Tokyo Japan

ISBN: (纸本)9798331529543;9798331529550

Currently, with the extensive application of digital cameras in dynamic capturing, implications such as camera jitter, out-of-focus, and target motion induce various types and degrees of image blurring. Deep learning (DL) is a powerful technique that offers data-adaptive recovery without prior characterization of deblurring filter kernels. However, end-to-end networks can still be improved to restore regions with severe localized blurring. Therefore, we propose a multi-scale circular transformer (MSC-Former) employing averaged neighborhood attention (AvgNA) to solve this problem. It computes the local attention of each feature pixel by learning the correlation between the center and the surrounding windowed neighborhood, then produces integrated attention with direct averaging. We employ a multiscale circular strategy (MSCS) to compute attention at different spatial scales to expand the receptive field while maintaining a low parameter count. It uses concentric circular regions with varying radii to define neighborhoods at different scales, which expands the receptive field during attention computation while capturing spatial continuity across larger neighborhoods. Experimental results demonstrate that the proposed method surpasses the recent state-of-the-art deblurring techniques on the benchmark dataset.

关键词： image deblurring averaged neighborhood attention multi-scale circular strategy transformer

来源：评论

学校读者我要写书评

暂无评论

ACIQA: A Dataset and Method for Assessing the Imaging Quality of Automotive Cameras

ACIQA: A Dataset and Method for Assessing the Imaging Qualit...

引用

2024 conference on visual communications and image processing

作者： Huang, Yijie Ni, Haoyang Lu, Fangfang Zhang, Kaiwei Jia, Ziheng Min, Xiongkuo Zhai, Guangtao Shanghai Univ Elect Power Shanghai Peoples R China Shanghai Jiao Tong Univ Inst Image Commun & Network Engn Shanghai Peoples R China

ISBN: (纸本)9798331529543;9798331529550

The imaging quality of automotive cameras is crucial in complex driving environments. Therefore, it is essential to conduct subjective experiments that can realistically reflect drivers' evaluation of the imaging quality of automotive cameras in real traffic scenarios. To accurately assess the imaging quality of automotive cameras, this paper proposes a no-reference quality assessment method with quality scores that are highly consistent with human subjective perception. Initially, this study constructs a new image quality assessment dataset and then obtains the subjective scores of image quality through subjective experiments. The dataset is constructed by using a variety of realistic props to simulate scene elements that might be captured by an automotive camera and are captured using a wide range of cameras with different sensor types, lens focus, and viewing angles, resulting in a dataset of diverse images. The objective quality assessment method proposed in this paper consists of an object detection network and a multi-branch quality evaluation network. The object detection network is responsible for identifying and classifying scene elements, while the multi-branch quality evaluation network performs feature extraction and score regression on various types of elements to effectively evaluate the imaging quality of the automotive cameras. In the experiments, this no-reference quality assessment method is tested on our built dataset, and the results show that the proposed method exhibits the best performance compared with the state-of-the-art image quality assessment methods.

关键词： image quality assessment automotive camera image dataset multi-branch quality evaluation network

来源：评论

学校读者我要写书评

暂无评论

IoU-CLIP: IoU-Aware Language-image Model Tuning for Open Vocabulary Object Detection

IoU-CLIP: IoU-Aware Language-Image Model Tuning for Open Voc...

引用

2024 conference on visual communications and image processing

作者： He, Mingzhou Wu, Qingbo Ngan, King Ngi Xiao, Yiming Meng, Fanman Qiu, Heqian Li, Hongliang Univ Elect Sci & Technol China Sch Informat & Commun Engn Chengdu 611731 Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Open vocabulary object detection (OVD), which detects novel categories through detectors trained on base categories, has achieved remarkable advancement attributable to large-scale vision-language models, such as CLIP. The prior OVD works mainly focused on improving the classification accuracy of proposals, ignoring the ability of localization for novel categories. In this work, we propose IoU-aware language-image model tuning (IoU-CLIP) for open vocabulary object detection. Specifically, we construct a region image dataset with different IoU and adopt IoU values as labels to fine-tune the CLIP model to learn IoU-aware and class-agnostic semantic prompts and visual embeddings. The fine-tuned IoU-CLIP can predict IoU scores for proposals, which interact with classification scores. Meanwhile, IoU-aware and class-agnostic visual embeddings are utilized for box regression to enhance the generalization of the localization capability. We evaluate our method on the COCO and LVIS OVD benchmarks, outperforming the baseline (RegionCLIP) by 5.5% AP(50) and 5.8% AP on novel categories, respectively, achieving state-of-the-art performance.

关键词： object detection open vocabulary IoU-aware class-agnostic semantic prompts

来源：评论

学校读者我要写书评

暂无评论

ReLI-QA: A Multidimensional Quality Assessment Dataset for Relighted Human Heads

ReLI-QA: A Multidimensional Quality Assessment Dataset for R...

引用

2024 conference on visual communications and image processing

作者： Zhou, Yingjie Zhang, Zicheng Wen, Farong Jia, Jun Min, Xiongkuo Wang, Jia Zhai, Guangtao Shanghai Jiao Tong Univ Inst Image Commun & Network Engn Shanghai Peoples R China PengCheng Lab Shenzhen Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Lighting conditions significantly affect the quality of both real and AI-generated images. Facial images are particularly sensitive to lighting due to their detailed nature and the importance of facial features in conveying identity. Poor lighting can easily obscure these critical details. To address this issue, various portrait relighting methods have been developed to adjust the lighting in improperly exposed images. However, these methods often encounter challenges such as overexposure, underexposure, and detail loss in the relighted portraits. Consequently, there is a need for effective quality assessment and control of relighted human heads (RHHs). In this study, one proposed simple baseline and three typical relighting methods are applied to six selected human head (HH) images, resulting in the creation of a quality assessment dataset named ReLI-QA, which comprises 840 RHHs. A multidimensional subjective quality assessment method based on visual guidance is proposed to accurately evaluate the visual quality of each RHH in the dataset. By analyzing the results of subjective experiments, the quality of RHHs is shown to be affected by multiple factors. Finally, based on ReLI-QA, some typical image quality assessment (IQA) methods are selected for benchmark experiments. The experimental results show the limitations of the existing methods in RHH quality assessment. The dataset and code for this research has been released at https://***/zyj-2000/ReLI-QA.

关键词： image Quality Assessment Subjective Quality Assessment Relighted Human Heads AI Generated Content

来源：评论

学校读者我要写书评

暂无评论

Efficient Bitrate Ladder Construction for Per-Shot Adaptive Encoding

Efficient Bitrate Ladder Construction for Per-Shot Adaptive ...

引用

2024 conference on visual communications and image processing

作者： Zhao, Yan Cheng, ZhengXue Lu, Guo Xie, Rong Song, Li Shanghai Jiao Tong Univ Inst Image Commun & Network Engn Shanghai Peoples R China

ISBN: (纸本)9798331529543;9798331529550

HTTP adaptive streaming (HAS) constructs bitrate ladders to deliver videos with the best possible quality under varying network conditions. Though per-shot content adaptive encoding (CAE) largely improves the compression efficiency by constructing the optimal bitrate ladder for each video shot, it suffers from excessive encoding complexity as all the points in the operating space (typically resolution x bitrate) need to be encoded and compared. To address this issue, this paper proposes an efficient bitrate ladder construction method that encodes only a subset of operating points, then uses curve fitting and inter-curve prediction to estimate other points' RD performance. The proposed method enables low-complexity ladder construction even for high-dimension operating spaces that incorporate dimensions like encoding presets. Experiments show that this method can achieve RD performance comparable to the original per-shot CAE with only 42% encoding points. Even when minimizing the encoding points to 3.6% of the original CAE, it achieves 15% BDRate improvements compared to using the fixed bitrate ladder.

关键词： Content-adaptive encoding per-shot encoding bitrate ladder prediction adaptive streaming

来源：评论

学校读者我要写书评

暂无评论

Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization

Color Enhancement for V-PCC Compressed Point Cloud via 2D At...

引用

2024 conference on visual communications and image processing

作者： Bao, Jingwei Liu, Yu Li, Zeliang Zhu, Shuyuan Yeung, Siu-Kei Au Univ Elect Sci & Technol China Sch Informat & Commun Engn Chengdu Peoples R China Hong Kong Metropolitan Univ Sch Sci & Technol Hong Kong Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Video-based point cloud compression (V-PCC) converts the dynamic point cloud data into video sequences using traditional video codecs for efficient encoding. However, this lossy compression scheme introduces artifacts that degrade the color attributes of the data. This paper introduces a framework designed to enhance the color quality in the V-PCC compressed point clouds. We propose the lightweight de-compression Unet (LDC-Unet), a 2D neural network, to optimize the projection maps generated during V-PCC encoding. The optimized 2D maps will then be back-projected to the 3D space to enhance the corresponding point cloud attributes. Additionally, we introduce a transfer learning strategy and develop a customized natural image dataset for the initial training. The model was then fine-tuned using the projection maps of the compressed point clouds. The whole strategy effectively addresses the scarcity of point cloud training data. Our experiments, conducted on the public 8i voxelized full bodies long sequences (8iVSLF) dataset, demonstrate the effectiveness of our proposed method in improving the color quality.

关键词： Point cloud compression image restoration Transfer learning Point cloud reconstruction

来源：评论

学校读者我要写书评

暂无评论

Tell Codec What Worth Compressing: Semantically Disentangled image Coding for Machine with LMMs

Tell Codec What Worth Compressing: Semantically Disentangled...

引用

2024 conference on visual communications and image processing

作者： Liu, Jinming Wei, Yuntao Lin, Junyan Zhao, Shengyang Sun, Heming Chen, Zhibo Zeng, Wenjun Jin, Xin Shanghai Jiao Tong Univ Shanghai Peoples R China Ningbo Inst Digital Twin Eastern Inst Technol Ningbo Peoples R China Univ Sci & Technol China Hefei Peoples R China Yokohama Natl Univ Yokohama Kanagawa Japan

ISBN: (纸本)9798331529543;9798331529550

We present a new image compression paradigm to achieve "intelligently coding for machine" by cleverly leveraging the common sense of Large Multimodal Models (LMMs). We are motivated by the evidence that large language/multimodal models are powerful general-purpose semantics predictors for understanding the real world. Different from traditional image compression typically optimized for human eyes, the image coding for machines (ICM) framework we focus on requires the compressed bitstream to more comply with different downstream intelligent analysis tasks. To this end, we employ LMM to tell codec what to compress: 1) first utilize the powerful semantic understanding capability of LMMs w.r.t object grounding, identification, and importance ranking via prompts, to disentangle image content before compression, 2) and then based on these semantic priors we accordingly encode and transmit objects of the image in order with a structured bitstream. In this way, diverse vision benchmarks including image classification, object detection, instance segmentation, etc., can be well supported with such a semantically structured bitstream. We dub our method "SDComp" for "Semantically Disentangled Compression", and compare it with state-of-the-art codecs on a wide variety of different vision tasks. SDComp codec leads to more flexible reconstruction results, promised decoded visual quality, and a more generic/satisfactory intelligent task-supporting ability.

关键词： image Coding for Machine Large Multimodal Model Semantically Structured Bitstream

来源：评论

学校读者我要写书评

暂无评论

Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization

Bit Distribution Study and Implementation of Spatial Quality...

引用

2024 conference on visual communications and image processing

作者： Jia, Panqi Mao, Jue Koyuncu, Esin Koyuncu, A. Burakhan Solovyev, Timofey Karabutov, Alexander Zhao, Yin Alshina, Elena Kaup, Andre Friedrich Alexander Univ Erlangen Nurnberg Multimedia Commun & Signal Proc Nurnberg Germany Tech Univ Munich Chair Media Technol Munich Germany Huawei Technol Shenzhen Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Currently, there is a high demand for neural network-based image compression codecs. These codecs employ non-linear transforms to create compact bit representations and facilitate faster coding speeds on devices compared to the hand-crafted transforms used in classical frameworks. The scientific and industrial communities are highly interested in these properties, leading to the standardization effort of JPEG-AI. The JPEG-AI verification model has been released and is currently under development for standardization. Utilizing neural networks, it can outperform the classic codec VVC intra by over 10% BD-rate operating at base operation point. Researchers attribute this success to the flexible bit distribution in the spatial domain, in contrast to VVC intra's anchor that is generated with a constant quality point. However, our study reveals that VVC intra displays a more adaptable bit distribution structure through the implementation of various block sizes. As a result of our observations, we have proposed a spatial bit allocation method to optimize the JPEG-AI verification model's bit distribution and enhance the visual quality. Furthermore, by applying the VVC bit distribution strategy, the objective performance of JPEG-AI verification mode can be further improved, resulting in a maximum gain of 0.45 dB in PSNR-Y.

关键词： Learned image Compression JPEG-AI bit rate matching algorithm optimization

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共433页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：