检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,064 篇 会议
52 篇 期刊文献
18 册 图书
1 篇 学位论文

馆藏范围

4,135 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,368 篇 工学
- 1,627 篇 计算机科学与技术...
- 1,017 篇 信息与通信工程
- 848 篇 软件工程
- 787 篇 电气工程
- 229 篇 光学工程
- 113 篇 生物工程
- 102 篇 电子科学与技术（可...
- 76 篇 控制科学与工程
- 76 篇 生物医学工程（可授...
- 72 篇 仪器科学与技术
- 56 篇 机械工程
- 36 篇 化学工程与技术
- 33 篇 网络空间安全
- 29 篇 测绘科学与技术
- 25 篇 动力工程及工程热...
- 25 篇 安全科学与工程
- 17 篇 轻工技术与工程
1,076 篇 理学
- 838 篇 物理学
- 245 篇 数学
- 120 篇 生物学
- 65 篇 统计学（可授理学、...
- 39 篇 系统科学
- 36 篇 化学
1,031 篇 医学
- 1,017 篇 临床医学
- 32 篇 基础医学(可授医学...
- 26 篇 药学(可授医学、理...
149 篇 管理学
- 79 篇 图书情报与档案管...
- 78 篇 管理科学与工程(可...
- 18 篇 工商管理
30 篇 法学
- 28 篇 社会学
11 篇 军事学
9 篇 文学
7 篇 农学
6 篇 教育学
5 篇 经济学

主题

510 篇 image coding
400 篇 image processing
352 篇 visual communica...
309 篇 visualization
234 篇 feature extracti...
228 篇 image segmentati...
173 篇 image compressio...
158 篇 image reconstruc...
146 篇 video coding
133 篇 training
130 篇 cameras
124 篇 humans
118 篇 image color anal...
114 篇 image quality
112 篇 signal processin...
109 篇 image enhancemen...
100 篇 deep learning
100 篇 image edge detec...
95 篇 image retrieval
93 篇 motion estimatio...

机构

36 篇 shanghai jiao to...
29 篇 institute of ima...
24 篇 school of electr...
20 篇 university of sc...
18 篇 shanghai jiao to...
16 篇 shanghai jiao to...
16 篇 tianjin univ sch...
16 篇 beijing universi...
11 篇 university of el...
11 篇 cas key laborato...
11 篇 tsinghua univ de...
10 篇 univ sci & techn...
10 篇 peking univ inst...
10 篇 institute of ima...
9 篇 zhejiang univers...
9 篇 tsinghua univ de...
9 篇 school of electr...
9 篇 xidian univ sch ...
9 篇 shanghai jiao to...
8 篇 school of remote...

作者

34 篇 zhai guangtao
26 篇 sumei li
25 篇 song li
22 篇 li sumei
21 篇 guangtao zhai
18 篇 li li
18 篇 li song
18 篇 min xiongkuo
16 篇 dong liu
16 篇 yang xiaokang
16 篇 shan liu
15 篇 chen zhibo
15 篇 andré kaup
13 篇 xie rong
13 篇 xiongkuo min
13 篇 gao wen
11 篇 heming sun
11 篇 zhibo chen
11 篇 zhenzhong chen
11 篇 gao zhiyong

语言

4,068 篇 英文
49 篇 土耳其文
24 篇 中文
8 篇 其他

检索条件"任意字段=Conference on Visual Communications and Image Processing 2001"

共 4135 条记录，以下是41-50 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Content-Adaptive Rate-Quality Curve Prediction Model in Media processing System

Content-Adaptive Rate-Quality Curve Prediction Model in Medi...

引用

2024 conference on visual communications and image processing

作者： Yin, Shibo Zhang, Zhiyu Ning, Peirong Chen, Qiubo Chen, Jing Zhou, Quan Lu, Guo Song, Li Xiaohongshu Inc Shanghai Peoples R China Shanghai Jiao Tong Univ Shanghai Peoples R China

ISBN: (纸本)9798331529543;9798331529550

In streaming media services, video transcoding is a common practice to alleviate bandwidth demands. Unfortunately, traditional methods employing a uniform rate factor (RF) across all videos often result in significant inefficiencies. Content-adaptive encoding (CAE) techniques address this by dynamically adjusting encoding parameters based on video content characteristics. However, existing CAE methods are often tightly coupled with specific encoding strategies, leading to inflexibility. In this paper, we propose a model that predicts both RF-quality and RF-bitrate curves, which can be utilized to derive a comprehensive bitrate-quality curve. This approach facilitates flexible adjustments to the encoding strategy without necessitating model retraining. The model leverages codec features, content features, and anchor features to predict the bitrate-quality curve accurately. Additionally, we introduce an anchor suspension method to enhance prediction accuracy. Experiments confirm that the actual quality metric (VMAF) of the compressed video stays within +/- 1 of the target, achieving an accuracy of 99.14%. By incorporating our quality improvement strategy with the rate-quality curve prediction model, we conducted online A/B tests, obtaining both +0.107% improvements in video views and video completions and +0.064% app duration time. Our model has been deployed on the Xiaohongshu App.

关键词： Prediction models

来源：评论

学校读者我要写书评

暂无评论

image-Prompt Integration Network with Self-Ranking and Inter-Ranking Loss for AI-Generated image Quality Assessment

Image-Prompt Integration Network with Self-Ranking and Inter...

引用

2024 conference on visual communications and image processing

作者： Zhou, Tianwei Yao, Xizhang Tan, Songbai Ding, Xiaoying Yue, Guanghui Shenzhen Univ Coll Management Shenzhen Peoples R China Zhongnan Univ Econ & Law Sch Informat Engn Wuhan Peoples R China Shenzhen Univ Sch Biomed Engn Shenzhen Peoples R China

ISBN: (纸本)9798331529543;9798331529550

AI-generated images (AGIs) are increasingly utilized across diverse domains due to their ability to quickly produce high-quality visuals. However, assessing the quality of AGIs remains challenging due to their inherent variability and distinctive distortions. To address these challenges, we propose a novel AGI quality assessment method named SIRQA, which enhances feature representation by integrating visual features with textual prompts, effectively measureing the alignment between the generated images and the described content to improve the precision of quality assessment. Specifically, SIRQA employs self-ranking and inter-ranking mechanisms to refine feature representation. The self-ranking mechanism maintains consistency between feature distances and sampling scales, making sure that features from similar sampling scales are positioned closer together. Additionally, inter-ranking mechanism sorts the weighted similarity scores between images and prompts to align with the ranking in the label space. Extensive experiments on the AGIQA3K and PKUI2IQA datasets show that our SIRQA outperforms eight state-of-the-art algorithms in terms of both Spearman's rank correlation coefficient (SRCC) and Pearson linear correlation coefficient (PLCC).

关键词： AI-generated images image quality assessment multi-scale feature image-prompt integration

来源：评论

学校读者我要写书评

暂无评论

A Practical Approach to Depth-Aware Augmentation for Neural Radiance Fields

A Practical Approach to Depth-Aware Augmentation for Neural ...

引用

2024 conference on visual communications and image processing

作者： Khosroshahi, Hamed Razavi Sancho, Jaime Bonatto, Daniele Fachada, Sarah Bang, Gun Lafruit, Gauthier Juarez, Eduardo Teratani, Mehrdad Univ Libre Bruxelles Lab Image Synth & Anal Brussels Belgium Univ Politecn Madrid Res Ctr Software Technol & Multimedia Syst Madrid Spain Telecommun Res Inst Elect Daejeon South Korea

ISBN: (纸本)9798331529543;9798331529550

Neural Radiance Fields (NeRF) have demonstrated exceptional performance in generating novel views of scenes by learning implicit volumetric representations from calibrated RGB images, without depth information. A major limitation is the need for large training datasets in neural network-based view synthesis frameworks. The challenge of effective data augmentation for view synthesis remains unresolved. NeRF models require extensive scene coverage from multiple views to accurately estimate radiance and density. Insufficient coverage reduces the model's ability to interpolate or extrapolate unseen parts of the scene effectively. In this paper, we propose a novel pipeline to address this data augmentation issue using depth map information. We use depth image-based rendering (DIBR) to overcome the lack of enough views for training NeRF. Experimental results indicate that our approach enhances the quality of rendered images using the NeRF framework, achieving an average peak signal-to-noise ratio (PSNR) increase of 7.2 dB, with a maximum improvement of 12 dB.

关键词： Neural Radiance Fields NeRF View synthesis Data augmentation Depth map

来源：评论

学校读者我要写书评

暂无评论

MGTN: Multi-scale Graph Transformer Network for 3D Point Cloud Semantic Segmentation

MGTN: Multi-scale Graph Transformer Network for 3D Point Clo...

引用

2024 conference on visual communications and image processing

作者： Ai, Da Qin, Siyu Nie, Zihe Yuan, Hui Liu, Ying Xian Univ Posts & Telecommun Xian Key Lab Image Proc Technol & Applicat Publ S Xian Shaanxi Peoples R China Xian Univ Posts & Telecommun Sch Commun & Informat Engn Xian Shaanxi Peoples R China Shandong Univ Sch Control Sci & Engn Jinan Shandong Peoples R China

ISBN: (纸本)9798331529543;9798331529550

The structural similarity of point clouds presents challenges in accurately recognizing and segmenting semantic information at the demarcation points of complex scenes or objects. In this study, we propose a multi-scale graph transformer network (MGTN) for 3D point cloud semantic segmentation. First, a multi-scale graph convolution (MSG-Conv) is devised to address the limitations faced by existing methods when extracting local and global features of point cloud data with varying densities simultaneously. Subsequently, we employ a graph-transformer (G-T) module to enhance edge details and spatial position information in the point cloud, thereby improving recognition accuracy for small objects and confusing elements such as columns and beams. Extensive testing on ShapeNet parts and S3DIS datasets was conducted to demonstrate the effectiveness of MGTN. Compared to the baseline network DGCNN, our proposed MGTN achieves substantial performance improvements, as evidenced by notable increases in mIoU of 1.5% and 18.5% on the ShapeNet parts and S3DIS datasets respectively. Additionally, MGTN outperforms the recent CFSA-Net by 2.3% and 3.4% on OA and mIoU respectively.

关键词： point cloud semantic segmentation multi-scale graph convolution transformer feature extraction

来源：评论

学校读者我要写书评

暂无评论

Hybrid Representation for 4D Medical image Compression

Hybrid Representation for 4D Medical Image Compression

引用

2024 conference on visual communications and image processing

作者： Zheng, Wuyang Meng, Jiarui Zhang, Jiaqi Zhang, Jian Ma, Siwei Peking Univ Sch Elect & Comp Engn Beijing Peoples R China Peking Univ Sch Comp Sci Beijing Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Due to the substantial storage requirements of the 4D medical images, achieving efficient compression of such images is a crucial topic. Existing traditional image/video coding methods have achieved remarkable results in most compression tasks, but their performance in encoding 4D medical images remain poor. This is because these methods cannot fully exploit the spatio-temporal correlations in 4D images. Recently, implicit neural representation (INR) based image/video compression methods have made significant progress, with coding performance comparable to traditional methods. However, they also suffer from significant performance losses in 4D medical image compression like traditional methods. In this paper, we propose an efficient hybrid representation framework, which includes six learnable feature planes and a tiny MLP decoder. This framework alleviates the issue of previous methods lacking the ability to utilize the spatio-temporal correlations in 4D medical images, enabling it to capture these information more effectively. We also introduce a novel adaptive plane scaling strategy that allocates the numbers of parameter in each plane based on the resolution of the image. This design allows the model to further enhance the reconstruction quality at the same compression ratio. Extensive experiments show that our model achieves better RD performance compared to traditional and INR-based methods, and it also offers faster encoding speeds than INR-based methods.

关键词： Medical Data Compression Hybrid Representation Feature Planes

来源：评论

学校读者我要写书评

暂无评论

Effect of image Noise Removal Using Bottleneck Attention Mechanism 32

Effect of Image Noise Removal Using Bottleneck Attention Mec...

引用

32nd IEEE Signal processing and communications Applications conference (SIU)

作者： Karaca, Neval Ciftci, Serdar Harran Univ Bilgisayar Muhendisligi Bolumu Sanliurfa Turkiye

ISBN: (纸本)9798350388978;9798350388961

In computer vision applications, image enhancement is important for improving image quality and extracting meaningful information. Noise removal is a commonly used technique in image enhancement. In this study, the Batch Renormalization Denoising Network (BRDNet), which performs well in noise removal, is used as the base model with the use of the Bottleneck Attention Module (BAM) to achieve performance improvement. The proposed method is tested on different datasets with different noise levels and their results are compared. In quantitative experiments, an increase in the PSNR metric value was observed and the visual results were found to be closer to the target images.

关键词： image Enhancement image Denoising BRDNet BAM

来源：评论

学校读者我要写书评

暂无评论

A Comparative Study on Deep CNN visual Encoders for image Captioning 1

引用

8th International conference on Computer Vision and image processing (CVIP)

作者： Arun, M. Arivazhagan, S. Harinisri, R. Raghavi, P. S. Mepco Schlenk Engn Coll Dept Elect & Commun Engn Sivakasi Tamil Nadu India

ISBN: (数字)9783031585357

ISBN: (纸本)9783031585340;9783031585357

Captioning an image is the process of describing it with syntactically and semantically meaningful terms. An image caption generator is developed by the integration of computer vision and natural language processing technology. Despite the fact that numerous techniques for generating image captions have been developed, the result is inadequate and the need for research in this area is still a demanding topic. The human process of describing any image is by seeing, focusing and captioning, which is equivalent that of feature representation, visual encoding and language generation for the image captioning systems. This study presents the construction of a simple deep learning-based image captioning model and investigates the efficacy of different visual encoding methods employed in the model. We have analyzed and compared the performance of six different pre-trained CNN visual encoding models using Bilingual Evaluation Understudy (BLEU) scores.

关键词： image Captioning Flickr8K visual Encoding BLEU Flickr30K

来源：评论

学校读者我要写书评

暂无评论

Autonomous visual Moderation System (AutoMod) 31

Autonomous Visual Moderation System (AutoMod)

引用

31st IEEE conference on Signal processing and communications Applications (SIU)

作者： Bahtiyar, Huseyin Katipoglu, Bahadir Irfan Yuksel, Eda Toroslu, Ismail Hakki Karagoz, Pinar VLMedia R&D Dept Gumus Blok K1-1 ODTU TEKNOKENT Ankara Turkiye Middle East Tech Univ METU Dept Comp Engn TR-06800 Ankara Turkiye

ISBN: (纸本)9798350343557

Automod, the content moderation system, is an artificial intelligence solution that enables the detection of similarities and inconsistencies in visual content (image, video, etc.). It is designed as a content moderation system to detect the similarity and inconsistencies of user-generated visual content (images and videos). With the similarity module installed, labor savings of 15% were achieved, and F1 score results of 90% and higher were achieved for nonconformity detection models. More than 100.000 images can be evaluated daily, and the system's load was tested. Similarly, keyframes obtained from at least 65.000 video content that can be evaluated daily were passed through nonconformity models, and load test was applied.

关键词： image processing Machine Learning Deep Learning

来源：评论

学校读者我要写书评

暂无评论

Alignment of image-Text and Video-Text Datasets 31

Alignment of Image-Text and Video-Text Datasets

引用

31st IEEE conference on Signal processing and communications Applications (SIU)

作者： Ozkose, Yunus Emre Gokce, Zeynep Duygulu, Pinar Hacettepe Univ Bilgisayar Muhendisligi Ankara Turkiye

ISBN: (纸本)9798350343557

In this study, the alignment of video-text and image-text datasets is studied. Firstly, similarities are calculated over the texts in the two data sets. A retrieval setup with visual similarities is then applied to the subset which is created via calculated text similarities. A BERT-based embedding vector method is applied to the raw and pure texts. As a visual feature, object-based and CLIP-based methods are used to define video frames. According to the results, alignment with CLIP features achieves the best results in the subset created by filtering using raw text.

关键词： dataset alignment deep learning machine learning

来源：评论

学校读者我要写书评

暂无评论

Are Standard CNNs Good Enough for No-Reference Stereoscopic image Quality Assessment? 15

Are Standard CNNs Good Enough for No-Reference Stereoscopic ...

引用

15th International conference on Signal processing and communications (SPCOM)

作者： Bardhan, Ishita Channappayya, Sumohana Banerjee, Abhik Forkan, Abdur Rahim Mohammad Jayaraman, Prem Kumar, Abhinav Indian Inst Technol Hyderabad Dept Artificial Intelligence Kandi Telangana India Swinburne Univ Technol Dept Comp Technol Melbourne Vic Australia

ISBN: (纸本)9798350350463;9798350350456

Perceptual quality metrics derived from deep features have led to a boost in modelling the Human visual System (HVS) to perceive the quality of visual content. In this work, we study the effectiveness of fine-tuning three standard convolutional neural networks (CNNs) viz. ResNet50, VGG16 and MobileNetV2 to predict the quality of stereoscopic images in the no-reference setting. This work also aims to understand the impact of using disparity maps for quality prediction. Interestingly, our experiments demonstrate that disparity maps do not significantly contribute to improving perceptual quality estimation in the deep learning framework. To the best of our knowledge, this is the first study that explores the impact of disparity along with the chosen models for Stereoscopic image Quality Assessment. We present a detailed study of our experiments with various architectural configurations on the LIVE Phase I and II datasets. Further, our results demonstrate the innate capability of deep features for quality prediction. Finally, the simple fine-tuning of the models results in solutions that compete with state-of-the-art patch-based stereoscopic image quality assessment methods.

关键词： Stereocenters

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共414页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：