检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

3,429 篇 会议
27 篇 期刊文献
14 册 图书

馆藏范围

3,470 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,946 篇 工学
- 1,271 篇 计算机科学与技术...
- 955 篇 信息与通信工程
- 686 篇 软件工程
- 520 篇 电气工程
- 190 篇 光学工程
- 108 篇 生物工程
- 79 篇 电子科学与技术（可...
- 62 篇 生物医学工程（可授...
- 60 篇 仪器科学与技术
- 60 篇 控制科学与工程
- 54 篇 机械工程
- 33 篇 化学工程与技术
- 32 篇 网络空间安全
- 23 篇 动力工程及工程热...
- 23 篇 安全科学与工程
- 13 篇 土木工程
- 13 篇 交通运输工程
959 篇 医学
- 951 篇 临床医学
- 29 篇 基础医学(可授医学...
- 26 篇 药学(可授医学、理...
774 篇 理学
- 574 篇 物理学
- 192 篇 数学
- 114 篇 生物学
- 53 篇 统计学（可授理学、...
- 33 篇 化学
- 29 篇 系统科学
132 篇 管理学
- 71 篇 管理科学与工程(可...
- 69 篇 图书情报与档案管...
- 17 篇 工商管理
30 篇 法学
- 28 篇 社会学
11 篇 军事学
9 篇 文学
5 篇 经济学
5 篇 农学
4 篇 教育学

主题

418 篇 image coding
349 篇 visual communica...
308 篇 image processing
293 篇 visualization
222 篇 feature extracti...
177 篇 image segmentati...
147 篇 image compressio...
139 篇 training
136 篇 video coding
130 篇 image reconstruc...
117 篇 image color anal...
107 篇 cameras
103 篇 image quality
100 篇 deep learning
95 篇 image enhancemen...
88 篇 image edge detec...
85 篇 humans
83 篇 three-dimensiona...
77 篇 motion estimatio...
76 篇 decoding

机构

36 篇 shanghai jiao to...
29 篇 institute of ima...
24 篇 school of electr...
20 篇 university of sc...
18 篇 shanghai jiao to...
16 篇 shanghai jiao to...
16 篇 tianjin univ sch...
16 篇 beijing universi...
11 篇 university of el...
11 篇 cas key laborato...
11 篇 tsinghua univ de...
10 篇 univ sci & techn...
10 篇 peking univ inst...
10 篇 institute of ima...
9 篇 zhejiang univers...
9 篇 tsinghua univ de...
9 篇 school of electr...
9 篇 xidian univ sch ...
9 篇 shanghai jiao to...
8 篇 school of remote...

作者

34 篇 zhai guangtao
26 篇 sumei li
25 篇 song li
22 篇 li sumei
21 篇 guangtao zhai
18 篇 li li
18 篇 li song
18 篇 min xiongkuo
16 篇 dong liu
16 篇 yang xiaokang
16 篇 shan liu
15 篇 andré kaup
14 篇 chen zhibo
13 篇 xie rong
13 篇 xiongkuo min
12 篇 gao wen
11 篇 heming sun
11 篇 zhibo chen
11 篇 zhenzhong chen
11 篇 gao zhiyong

语言

3,406 篇 英文
49 篇 土耳其文
22 篇 中文
7 篇 其他

检索条件"任意字段=Conference on Visual Communications and Image Processing"

共 3470 条记录，以下是371-380 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Deviation Control for Learned image Compression

Deviation Control for Learned Image Compression

引用

IEEE visual communications and image processing (VCIP)

作者： Yuqi Li Haotian Zhang Xiaomin Song Zheng Liu Huiming Zheng Li Li Dong Liu University of Science and Technology of China Sichuan Xinshi Chuangwei Ultra HD Technology Co. Ltd

ISBN: (数字)9798331529543

ISBN: (纸本)9798331529550

Most approaches in learned image compression follow the transform coding scheme. The characteristics of latent variables transformed from images significantly influence the performance of codecs. In this paper, we present visual analyses on latent features of learned image compression and find that the latent variables are spread over a wide range, which may lead to complex entropy coding processes. To address this, we introduce a Deviation Control (DC) method, which applies a constraint loss on latent features and entropy parameter μ. Training with DC loss, we obtain latent features with smaller values of coding symbols and σ, effectively reducing entropy coding complexity. Our experimental results show that the plug-and-play DC loss reduces entropy coding time by 30-40% and improves compression performance.

关键词： Training Analytical models visualization image coding visual communication Transform coding Symbols Entropy coding Entropy Complexity theory

来源：评论

学校读者我要写书评

暂无评论

An Efficient Method for Real-Time image Exposure Correction

An Efficient Method for Real-Time Image Exposure Correction

引用

IEEE visual communications and image processing (VCIP)

作者： Jie Yang Yuantong Zhang Daiqin Yang Zhenzhong Chen School of Remote Sensing and Information Engineering Wuhan University Wuhan China

Exposure errors in images, including both underexposure and overexposure, significantly diminish images’ contrast and visual appeal. Existing deep learning-based exposure correction methods either require large networks or longer processing time for inference and are thus not applicable for embedded devices and real-time applications. To address these issues, a lightweight network is proposed in this paper to correct exposure errors with limited memory occupation and inference steps. It adopts the Laplacian pyramid to incrementally recover the color and details of the image through a layer-by-layer procedure. A structural re-parameterization structure is designed to both reduce model size for inference speed up and improve performance with a multi-branch learning structure. Extensive experiments demonstrate that our method achieves a better performance-efficiency trade-off than other exposure correction methods.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Vision Transformer and Bidirectional RoBERTa: A Hybrid image Captioning Model Between VirTex and CPTR 12th

Vision Transformer and Bidirectional RoBERTa: A Hybrid Imag...

引用

12th International Advanced Computing conference, IACC 2022

作者： Lam, Khang Nhut Le, Diem-Kieu Thi Ngo, Truong Dinh Kalita, Jugal Can Tho University Can Tho Viet Nam University of Colorado Colorado Springs United States

ISBN: (纸本)9783031356407

image captioning neural networks are trained simultaneously on image recognition sub-models and natural language processing sub-models to generate description sentences for images. This paper presents several image captioning models based on the encoder-decoder framework. We change the neural sub-models used for the encoder as well as the decoder, and make comparisons. First, we experiment with several ResNet architectures (viz., ResNet-50, ResNet-101, and ResNet-152) as encoders, and Transformer or bidirectional Transformer models as decoders. Second, we use the combination of the Vision Transformer as a visual encoder, and the standard Transformer or RoBERTa as the language decoder. Finally, we propose an image captioning model using Vision Transformer for encoding images and bidirectional Transformer for predicting image captions. The models are trained on the Flickr8k dataset in English and Vietnamese and evaluated using the BLEU metric. The combination model between the Vision Transformer and the bidirectional RoBERTa model outperforms the existing image captioning models, including VirTex and CPTR models. The BLEU-1, BLEU-2, BLEU-3, and BLEU-4 scores of our best image captioning model are 0.870, 0.661, 0.443, and 0.331 on the English dataset, and 0.829, 0.647, 0.483, and 0.387 on the Vietnamese dataset. © 2023, Springer Nature Switzerland AG.

关键词： image recognition

来源：评论

学校读者我要写书评

暂无评论

Low-complexity learning-based intra prediction with direction-dependent adaptive weights for beyond VVC

Low-complexity learning-based intra prediction with directio...

引用

IEEE visual communications and image processing (VCIP)

作者： Haruhisa Kato Yoshitaka Kidani Kei Kawamura KDDI Research Inc. Saitama Japan

ISBN: (数字)9798331529543

ISBN: (纸本)9798331529550

This paper introduces an advanced intra prediction method designed for the Enhanced Compression Model (ECM), which is the reference software for beyond versatile video coding (VVC) standard. It employs a learning-based method to adaptively assign weights for a weighted average across neighboring samples, resulting in more precise prediction samples. The proposed method derives optimized weights for each intra prediction mode, for each block size, and for each sample position. To achieve a reasonable balance between encoding time and prediction accuracy, the conventional intra prediction mode is shared with the proposed method. Experimental evaluations have demonstrated that the proposed method provides bitrate reduction of up to 0.4%.

关键词： Video coding Learning systems image coding Accuracy visual communication Bit rate Video sequences Predictive models Software Standards

来源：评论

学校读者我要写书评

暂无评论

Efficient Context and Saliency Aware Transformer Network for No-Reference image Quality Assessment

Efficient Context and Saliency Aware Transformer Network for...

引用

IEEE visual communications and image processing (VCIP)

作者： Hui Li Luxi Wang Yingming Li College of Information Science and Electronic Engineering Zhejiang University Hangzhou China

No-Reference image Quality Assessment (NR-IQA) aims to estimate the perceptual image quality without access to reference images. To deal with it effectively and efficiently, in this work we propose a Context and Saliency aware Transformer Network (CSTNet), which is built based on a lightweight pyramid Vision Transformer (ViT). Specifically, a Multi-scale Context Aware Refinement (MCAR) block is devised to fully leverage hierarchical context features extracted by the ViT backbone. Further, saliency map prediction is incorporated as a sub-task to simulate the human attention on salient regions when perceiving images. Extensive experiments on public image quality datasets demonstrate its efficiency and superiority compared to the state-of-the-art models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An Efficient ConvNet for Learned image Compression with Transformer-Style Architecture

An Efficient ConvNet for Learned Image Compression with Tran...

引用

IEEE visual communications and image processing (VCIP)

作者： Haihang Ruan Feng Wang Yan Wang Institute for AI Industry Research (AIR) Tsinghua University School of Electronic and Computer Engineering Peking University

Recently, transformer-based and convolution-based methods have achieved significant results in learned image compression. By comparing the design of convolutional network (convnet) and transformers, we replace the self-attention with convolution to capture spatial and channel adaptability. We propose a simple attention module (SAM) with transformer style. Combining the proposed SAM with channel-wise and checkerboard entropy model, we propose an efficient end-to-end learned image compression method. It is a simple method but obtains strong result and efficient coding speed. Experiments demonstrate that our method achieves competitive results by comparing with previous learning-based methods and conventional image codecs.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Using Regularity Unit As Guidance For Summarization-Based image Resizing

Using Regularity Unit As Guidance For Summarization-Based Im...

引用

IEEE International conference on visual communications and image processing (VCIP) - visual communications in the Era of AI and Limited Resources

作者： Hsiao, Fang-Tsung Lin, Yi-Hsien Lu, Yi-Chang Natl Taiwan Univ Dept Elect Engn Taipei 10617 Taiwan Natl Taiwan Univ Grad Inst Elect Engn Taipei 10617 Taiwan

ISBN: (纸本)9781728185514

In this paper, we propose a novel algorithm for summarization-based image resizing. In the past, a process of detecting precise locations of repeating patterns is required before the pattern removal step in resizing. However, it is difficult to find repeating patterns which are illuminated under different lighting conditions and viewed from different perspectives. To solve the problem, we first identify the regularity unit of repeating patterns by statistics. Then we can use the regularity unit for shift-map optimization to obtain a better resized image. The experimental results show that our method is competitive with other well-known methods.

关键词： image resizing image summarization repeating pattern removal

来源：评论

学校读者我要写书评

暂无评论

An image Retrieval System Using Deep Learning to Extract High-Level Features 14th

An Image Retrieval System Using Deep Learning to Extract Hig...

引用

14th International conference on Computational Collective Intelligence (ICCCI)

作者： Jabnoun, Jihed Haffar, Nafaa Zrigui, Ahmed Nsir, Sirine Nicolas, Henri Trigui, Aymen Univ Monastir Res Lab Algebra Numbers Theory & Intelligent Syst Monastir Tunisia Univ Bordeaux LaBRI Lab Talence France DB Consulting 4 Rue Simone de Beauvoir F-94140 Alfortville France

ISBN: (纸本)9783031162107;9783031162091

The usual procedure used in Content Based image retrieval (CBIR), is to extract some useful low-level features such as color, texture and shape from the query image and retrieve images that have a similar set of features. However, the problem with using low-level features is the semantic gap between image feature representation and human visual understanding. That is why many researchers are devoted for improving content-based image retrieval methods with a particular focus on reducing the semantic gap between low-level features and human visual perceptions. Those researchers are mainly focused on combining low level features together to have a better representation of the content of an image, which make it closer to the human visual perception but still not close enough to reduce the semantic gap. In this paper we'll start by a comprehensive review on the recent researches in the field of image Retrieval, then we propose a CBIR system based on convolutional neural network and transfer learning to extract high-level features, as an initiative part of a larger project that aims to retrieve and collect images containing the Arabic language for natural language processing tasks.

关键词： image retrieval CNNs Features extraction Transfer learning

来源：评论

学校读者我要写书评

暂无评论

KonIQ-10k-LT: Overcoming Score Priors in Blind image Quality Assessment Under Imbalanced Distributions

KonIQ-10k-LT: Overcoming Score Priors in Blind Image Quality...

引用

IEEE visual communications and image processing (VCIP)

作者： Desen Yuan Lei Wang University of Electronic Science and Technology of China

ISBN: (数字)9798331529543

ISBN: (纸本)9798331529550

Blind image Quality Assessment (BIQA) is essential in computational vision for predicting the visual quality of digital images without reference counterparts. Despite advancements through convolutional neural networks (CNNs), a significant challenge in BIQA remains the long-tail distribution of image quality scores, leading to biased training and reduced model generalization. To address this, we restructured the KonIQ-10k dataset to create an imbalanced version named KonIQ-10k-LT, manipulating the distribution of image quality scores to have opposing distributions in the training and validation sets. This restructuring increases the proportion of certain quality scores in the training set while decreasing them in the validation set. Experimental results show a significant performance decline of BIQA models on the KonIQ-10k-LT dataset compared to the original KonIQ-10k, highlighting the challenge posed by the long-tail distribution. To mitigate this issue, we propose a Proportion Weighted Balancing (PWB) method as a baseline, designed to enhance the robustness and generalization ability of BIQA models. Our findings demonstrate that the proposed WB method improves the performance and reliability of BIQA models under these challenging conditions.

关键词： image quality Training visualization Heavily-tailed distribution visual communication image processing Digital images Computational modeling Robustness Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Effect of latency on social presence in traditional video conference and VR conference: a comparative study

Effect of latency on social presence in traditional video co...

引用

IEEE visual communications and image processing (VCIP)

作者： Shengnan Wang Jiarun Song Anthony Trioux Miao Wang Yusong Gao Fuzheng Yang School of Telecommunications Engineering Xidian University Xi’an China

Virtual reality (VR) conference, as a typical social VR application, has gained popularity in recent years. It offers users located at different locations a fully immersive experience and a sense of togetherness. However, the remote communication also introduces inevitable latencies, which may adversely affect the so-called social presence. There is still a lack of research on the effect of latency on social presence. To fill the gap, this paper aims to examine the impact of latency on social presence of VR conference and contrast it with that of traditional video conference. Here, the social presence is measured using the Networked Minds Social Presence Inventory (NMSPI). We design and conduct two conversation-based subjective tests for both types of conference and compare the impact of the latency based on the test results. The conclusions of these studies can be used as guidelines for VR service providers to optimize their conference systems.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共347页 << < 34 35 36 37 38 39 40 41 42 43 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：