检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

4,257 篇 会议
64 篇 期刊文献
23 册 图书

馆藏范围

4,344 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

2,428 篇 工学
- 1,613 篇 计算机科学与技术...
- 1,167 篇 信息与通信工程
- 944 篇 软件工程
- 670 篇 电气工程
- 280 篇 光学工程
- 144 篇 电子科学与技术（可...
- 125 篇 生物工程
- 122 篇 控制科学与工程
- 88 篇 生物医学工程（可授...
- 79 篇 仪器科学与技术
- 68 篇 机械工程
- 43 篇 化学工程与技术
- 33 篇 网络空间安全
- 25 篇 动力工程及工程热...
- 25 篇 安全科学与工程
- 21 篇 测绘科学与技术
- 21 篇 轻工技术与工程
1,097 篇 医学
- 1,086 篇 临床医学
- 38 篇 基础医学(可授医学...
- 33 篇 药学(可授医学、理...
1,023 篇 理学
- 745 篇 物理学
- 350 篇 数学
- 134 篇 生物学
- 99 篇 统计学（可授理学、...
- 44 篇 化学
- 32 篇 系统科学
191 篇 管理学
- 109 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 30 篇 工商管理
32 篇 法学
- 29 篇 社会学
12 篇 军事学
9 篇 文学
7 篇 教育学
6 篇 经济学
5 篇 农学

主题

531 篇 image coding
460 篇 image processing
351 篇 visual communica...
310 篇 visualization
253 篇 feature extracti...
223 篇 image segmentati...
173 篇 image compressio...
166 篇 image reconstruc...
149 篇 video coding
146 篇 cameras
134 篇 training
132 篇 humans
124 篇 image quality
120 篇 image color anal...
111 篇 image enhancemen...
111 篇 signal processin...
110 篇 image retrieval
103 篇 image edge detec...
102 篇 decoding
100 篇 deep learning

机构

36 篇 shanghai jiao to...
29 篇 institute of ima...
24 篇 school of electr...
20 篇 university of sc...
18 篇 shanghai jiao to...
16 篇 shanghai jiao to...
16 篇 tianjin univ sch...
16 篇 beijing universi...
12 篇 institute for in...
12 篇 school of electr...
11 篇 university of el...
11 篇 cas key laborato...
11 篇 tsinghua univ de...
10 篇 univ sci & techn...
10 篇 peking univ inst...
10 篇 tsinghua univ de...
10 篇 institute of ima...
9 篇 zhejiang univers...
9 篇 smart computer v...
9 篇 xidian univ sch ...

作者

34 篇 zhai guangtao
26 篇 sumei li
25 篇 song li
22 篇 li sumei
21 篇 guangtao zhai
19 篇 li li
18 篇 li song
18 篇 min xiongkuo
16 篇 dong liu
16 篇 shan liu
15 篇 andré kaup
15 篇 yang xiaokang
14 篇 chen zhibo
13 篇 xie rong
13 篇 xiongkuo min
13 篇 gao wen
11 篇 m. vetterli
11 篇 heming sun
11 篇 zhibo chen
11 篇 zhenzhong chen

语言

4,253 篇 英文
66 篇 土耳其文
26 篇 中文
12 篇 其他
1 篇 法文

检索条件"任意字段=Conference on Visual Communications and Image Processing 2004"

共 4344 条记录，以下是71-80 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

image Captioning with Local-Global visual Interaction Network 29th

Image Captioning with Local-Global Visual Interaction Networ...

引用

29th International conference on Neural Information processing

作者： Wang, Changzhi Gu, Xiaodong Fudan Univ Dept Elect Engn Shanghai 200438 Peoples R China

ISBN: (纸本)9789819916443;9789819916450

Existing attention based image captioning approaches treat local feature and global feature in the image individually, neglecting the intrinsic interaction between them that provides important guidance for generating caption. To alleviate above issue, in this paper we propose a novel Local-Global visual Interaction Network (LGVIN) that novelly explores the interactions between local feature and global feature. Specifically, we devise a new visual interaction graph network that mainly consists of visual interaction encoding module and visual interaction fusion module. The former implicitly encodes the visual relationships between local feature and global feature to obtain an enhanced visual representation containing rich local-global feature relationship. The latter fuses the previously obtained multiple relationship features to further enrich different-level relationship attribute information. In addition, we introduce a new relationship attention based LSTM module to guide the word generation by dynamically focusing on the previously output fusion relationship information. Extensive experimental results show that the superiority of our LGVIN approach, and our model obviously outperforms the current similar relationship based image captioning methods.

关键词： image captioning visual interaction Graph network

来源：评论

学校读者我要写书评

暂无评论

visual Instruction Inversion: image Editing via visual Prompting 37

Visual Instruction Inversion: Image Editing via Visual Promp...

引用

37th conference on Neural Information processing Systems (NeurIPS)

作者： Nguyen, Thao Li, Yuheng Ojha, Utkarsh Lee, Yong Jae Univ Wisconsin Madison Madison WI 53706 USA

ISBN: (纸本)9781713899921

Text-conditioned image editing has emerged as a powerful tool for editing images. However, in many situations, language can be ambiguous and ineffective in describing specific image edits. When faced with such challenges, visual prompts can be a more informative and intuitive way to convey the desired edit. We present a method for image editing via visual prompting. Given example pairs that represent the "before" and "after" images of an edit, our approach learns a text-based editing direction that can be used to perform the same edit on new images. We leverage the rich, pretrained editing capabilities of text-to-image diffusion models by inverting visual prompts into editing instructions. Our results show that even with just one example pair, we can achieve competitive results compared to state-of-the-art text-conditioned image editing frameworks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Dynamic Target Tracking Method of Surveillance Video Based on image processing 3

Dynamic Target Tracking Method of Surveillance Video Based o...

引用

3rd Asia-Pacific conference on communications Technology and Computer Science (ACCTCS)

作者： Chen, Yiping Yuan, Fengshan Guangzhou Inst Sci & Technol Dept Comp Sci & Engn Guangzhou 510540 Peoples R China

ISBN: (纸本)9798350310801

In surveillance video, target tracking is an important part. Based on image processing technology, this paper studies a real-time and effective method to collect and recognize camera motion information. Firstly, the influence of visual dead angle and illumination on recognition is analyzed. Secondly, according to the characteristic of background light intensity, the corresponding algorithm is designed to realize the positioning and tracking control strategy of the target and surrounding environment scenery. Finally, the correctness of the method is verified by MATLAB simulation software, so as to obtain a better and scalable scheme, which is more economical and feasible after the occlusion rate is minimized.

关键词： image processing Surveillance Video Dynamic Target Tracking Method

来源：评论

学校读者我要写书评

暂无评论

Distance Weighted Refining Segmentation Method for visual Quality Improvement in V-PCC

Distance Weighted Refining Segmentation Method for Visual Qu...

引用

2023 IEEE International conference on visual communications and image processing, VCIP 2023

作者： Lee, Min Ku Kim, Yong-Hwan Korea Electronics Technology Institute Intelligent Image Processing Research Center Seongnam-si Korea Republic of

ISBN: (纸本)9798350359855

In this paper, we propose an innovative method for refining segmentation method that improves the visual quality of Video-based Point Cloud Compression (V-PCC) encoder. Recently standardized as an international standard by MPEG, V-PCC standard provides state-of-The-Art performance in compressing dynamic and dense point cloud object. However, lossy V-PCC encoder has an unavoidable problem of visual quality degradation due to lost points. When converting a 3D point cloud to 2D patches in the V-PCC encoder, some points constituting a point cloud are not converted. In particular, in the refining segmentation of the 2D patch generation process, points that are changed the projection plane due to over-smoothing can be discarded. We propose a distance weighted refining segmentation method that reduces the number of missed points to improve visual quality. Experimental results show a noticeable improvement in visual quality with minor coding gain. © 2023 IEEE.

关键词： Refining

来源：评论

学校读者我要写书评

暂无评论

A Hybrid CNN-Tree Based Model for Enhanced image Classification Performance 32

A Hybrid CNN-Tree Based Model for Enhanced Image Classificat...

引用

32nd IEEE Signal processing and communications Applications conference (SIU)

作者： Aydin, Musa Kus, Zeki Akcelik, Zeliha Kaya Fatih Sultan Mehmet Vakif Univ Bilgisayar Muhendisligi Istanbul Turkiye

ISBN: (纸本)9798350388978;9798350388961

Blood cells play an essential role in various bodily functions, such as protection against infections and the body's defense. The accurate classification of blood cells, generally grouped as red, white, and platelets is important for clinical diagnosis and hematological analysis. However, identifying these cells is a specialized and time-consuming process. Therefore, there is a hot-topic for high-precision automatic blood cell classification methods. Convolutional neural networks (CNNs) are a deep learning model used for visual data analysis and are very powerful in extracting features from data. In this study, we propose a hybrid classification model that combines the feature extraction power of CNNs with the ensemble-based prediction capabilities of Random Forest and XGBoost algorithms. The proposed hybrid model is compared with different methods on the BloodMNIST dataset in terms of classification performance and inference time. The results show that the tree-based methods outperform CNN by up to 8.49 and 11.62 points and achieve up to 82.9 times better inference times than other methods.

关键词： CNN feature extraction Random forest XGBoost Blood cell classification

来源：评论

学校读者我要写书评

暂无评论

Quality Assessment of Screen Content images Based on Multi-Pathway Convolutional Neural Network

Quality Assessment of Screen Content Images Based on Multi-P...

引用

IEEE International conference on visual communications and image processing (VCIP)

作者： Li, Mengyao Li, Sumei Tianjin Univ Sch Elect & Informat Engn Tianjin Peoples R China

ISBN: (纸本)9781665475921

In this paper, considering the retinal structure of human eye, and the composition characteristics of screen content images (SCIs), a multi-pathway convolutional neural network (CNN) with picture-text competition is proposed for SCIs quality assessment. According to the visual mechanism of human retina, we design a retinal structure simulation module, which uses multiple parallel convolution pathways to simulate the parallel transmission of visual signals by bipolar cells and uses a multi-pathway feature fusion (MPFF) module to allocate the weight for each channel to simulate horizontal cells' regulation of the information transmission. In addition, we design an adaptive feature extraction and competition module (AFEC) to directly extract the features of textural and pictorial regions and distribute the weight. Furthermore, the attention module combined with deformable convolution and channel attention can accurately extract image edge features and reduce redundancy of information. Experimental results show that the proposed method is superior to the mainstream methods.

关键词： image quality assessment screen content image convolutional neural network

来源：评论

学校读者我要写书评

暂无评论

image Captioning with Multiple Perspectives—A visual Context-Based Approach 5th

Image Captioning with Multiple Perspectives—A Visual Contex...

引用

5th International conference on Computing and Network communications, CoCoNet 2023

作者： Ashwin, G. Chaitanya, V. Kishan Rohith Nair, Priyanka C. Department of Computer Science and Engineering Amrita School of Computing Amrita Vishwa Vidyapeetham Bengaluru560035 India

ISBN: (纸本)9789819747108

This study tackles the difficult issues of image captioning while negotiating the complexity of visual data processing. The complexity of visual data and the associated processing requirements make image captioning a daunting task. This research describes an effective method for image captioning that makes advantage of attentional mechanisms. Several models, including Transformer, VGG16, VGG19, Inception, RNN decoders, and Bahdanau’s attention mechanism, are contrasted and analyzed to demonstrate the benefits of integrating attention processes. By allowing the model to focus on the appropriate visual component, attention improves the accuracy of the generated captions. Transformer models, in particular, outperform other methods by capturing complicated relationships and delivering accurate output, and they have the highest BLEU Score of 70. The findings emphasize the significance of attention mechanisms and the relevance of selecting a suitable model architecture to maximize picture captioning performance. The variety of potential applications emphasizes the benefits and potential impact of image captioning in a broad range of scenarios. Several use cases are found beneficial, including supporting the visually impaired, improving product descriptions in e-commerce, assisting medical diagnosis, improving image search ability, and allowing effective communication and comprehension of visual information. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

Research on Some Key Technologies of Deep Learning in the Field of Computer Vision 3

Research on Some Key Technologies of Deep Learning in the Fi...

引用

3rd International conference for Innovation in Technology, INOCON 2024

作者： Mao, Yirong Mao, Rui Yunnan Communications Vocational and Technical College 650500 China

ISBN: (纸本)9798350381931

The role of computer vision technology in the field of artificial intelligence development is very important, but there is a problem of poor application effect of key technologies. Traditional neural network algorithms cannot solve the problems of image classification and inaccurate image detection in computer visual perception tasks. In the tide of artificial intelligence, the combination of computer vision and deep learning has become an important force to promote technological progress. By imitating the processing mechanism of images and videos by human brain, deep learning has not only made a breakthrough in understanding complex visual information, but also demonstrated its incomparable ability in many fields. © 2024 IEEE.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

No-reference Stereoscopic image Quality Assessment Based on Parallel Multi-scale Perception

No-reference Stereoscopic Image Quality Assessment Based on ...

引用

IEEE International conference on visual communications and image processing (VCIP)

作者： Zhang, Ziyi Li, Sumei Tianjin Univ Sch Elect & Informat Engn Tianjin Peoples R China

ISBN: (纸本)9781665475921

With the rapid development of 3D technologies, effective no-reference stereoscopic image quality assessment (NR-SIQA) methods are in great demand. In this paper, we propose a parallel multi-scale feature extraction convolution neural network (CNN) model combined with novel binocular feature interaction consistent with human visual system (HVS). In order to simulate the characteristics of HVS sensing multi-scale information at the same time, parallel multi-scale feature extraction module (PMSFM) followed by compensation information is proposed. And modified convolutional block attention module (MCBAM) with less computational complexity is designed to generate visual attention maps for the multi-scale features extracted by the PMSFM. In addition, we employ cross-stacked strategy for multi-level binocular fusion maps and binocular disparity maps to simulate the hierarchical perception characteristics of HVS. Experimental results show that our method is superior to the state-of-the-art metrics and achieves an excellent performance.

关键词： no-reference stereoscopic image quality assessment (NR-SIQA) convolution neural network (CNN) human visual system (HVS)

来源：评论

学校读者我要写书评

暂无评论

DesnowFormer: an effective transformer-based image desnowing network

DesnowFormer: an effective transformer-based image desnowing...

引用

IEEE International conference on visual communications and image processing (VCIP)

作者： Zhang, Ting Jiang, Nanfeng Lin, Junhong Lin, Jielian Zhao, Tiesong Fuzhou Univ Coll Phys & Informat Engn Fujian Key Lab Intelligent Proc & Wireless Transm Fuzhou Peoples R China

ISBN: (纸本)9781665475921

Single image desnowing is an important and challenge task for lots of computer vision applications, such as visual tracking and video surveillance. Although existing deep learning-based methods have achieved promising results, most of them rely on the local deep features and neglect global relationship information between the local regions. Therefore, inevitably leading to over-smooth or detail loss results. To solve this issue, we design a UNet-based end-to-end architecture for image desnowing. Specially, to better characterize global information and preserve image detail, we combine Window-based Self-Attention (WSA) transformer block with Residue Spatial Attention (RSA) to build basic unit of our network. Besides, to protect the structure of the image effectively, we also introduce a Residue Channel (RC) loss to guide high-quality image restoration. Extensive experimental results on both synthetic and real-world datasets demonstrate that the proposed model achieves new state-of-the-art results.

关键词： image Desnowing Transformer block Residual Spatial Attention

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共435页 << < 4 5 6 7 8 9 10 11 12 13 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：