检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,926 篇 会议
236 册 图书
24 篇 期刊文献

馆藏范围

2,185 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

1,605 篇 工学
- 1,333 篇 计算机科学与技术...
- 466 篇 软件工程
- 250 篇 电气工程
- 235 篇 机械工程
- 186 篇 光学工程
- 178 篇 信息与通信工程
- 119 篇 控制科学与工程
- 96 篇 生物工程
- 62 篇 生物医学工程（可授...
- 41 篇 仪器科学与技术
- 38 篇 电子科学与技术（可...
- 30 篇 化学工程与技术
- 21 篇 安全科学与工程
- 18 篇 材料科学与工程（可...
- 15 篇 交通运输工程
- 13 篇 建筑学
444 篇 理学
- 257 篇 物理学
- 202 篇 数学
- 110 篇 生物学
- 57 篇 统计学（可授理学、...
- 22 篇 化学
228 篇 医学
- 200 篇 临床医学
- 26 篇 基础医学(可授医学...
- 22 篇 特种医学
137 篇 管理学
- 83 篇 图书情报与档案管...
- 60 篇 管理科学与工程(可...
- 19 篇 工商管理
27 篇 艺术学
- 27 篇 设计学（可授艺术学...
16 篇 农学
- 15 篇 作物学
15 篇 法学
- 13 篇 社会学
9 篇 教育学
7 篇 经济学
5 篇 文学
5 篇 军事学

主题

320 篇 computer vision
286 篇 pattern recognit...
166 篇 artificial intel...
119 篇 feature extracti...
117 篇 computer imaging...
101 篇 image processing...
82 篇 face recognition
68 篇 training
61 篇 object detection
60 篇 image segmentati...
57 篇 computer applica...
54 篇 deep learning
51 篇 robustness
47 篇 computer graphic...
46 篇 cameras
45 篇 visualization
43 篇 semantics
38 篇 object recogniti...
37 篇 shape
36 篇 information syst...

机构

89 篇 univ chinese aca...
67 篇 chinese acad sci...
59 篇 national laborat...
56 篇 chinese acad sci...
50 篇 univ chinese aca...
36 篇 chinese univ hon...
36 篇 university of ch...
31 篇 institute of aut...
27 篇 chinese acad sci...
25 篇 school of artifi...
23 篇 univ sci & techn...
22 篇 chinese academy ...
18 篇 chinese acad sci...
17 篇 chinese univ hon...
16 篇 chinese acad sci...
16 篇 univ chinese aca...
15 篇 national laborat...
15 篇 computer vision ...
14 篇 tsinghua univers...
14 篇 department of in...

作者

32 篇 wang xiaogang
29 篇 lu hanqing
28 篇 tan tieniu
28 篇 wang jinqiao
23 篇 li stan z.
22 篇 pal umapada
21 篇 huang kaiqi
21 篇 lei zhen
21 篇 qiao yu
19 篇 tieniu tan
19 篇 hu weiming
18 篇 tang xiaoou
17 篇 xilin chen
15 篇 wang liang
15 篇 chen xilin
15 篇 cheng jian
14 篇 liu jing
14 篇 tang ming
13 篇 xiaoou tang
13 篇 shiguang shan

语言

2,165 篇 英文
19 篇 中文
7 篇 其他
1 篇 土耳其文

检索条件"任意字段=7th Chinese Conference on Pattern Recognition and Computer Vision"

共 2186 条记录，以下是41-50 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Fake-GPT: Detecting Fake Image via Large Language Model 7th

Fake-GPT: Detecting Fake Image via Large Language Model

引用

7th chinese conference on pattern recognition and computer vision

作者： Fan, Yuming Yang, Dongming Zhang, Jiguang Yan, Bang Zou, Yuexian China Telecom Cloud Technol Co Ltd Beijing Peoples R China Chinese Acad Sci Inst Automat Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China Peking Univ Beijing Peoples R China

ISBN: (纸本)9789819786848;9789819786855

With the development of Artificial Intelligence Generated Content (AIGC), fake image detection has become increasingly challenging. Also leveraging the advanced capabilities of large language models (LLMs) in sequence prediction, we propose a novel perspective on fake image detection by fine-tuning pure LLMs. We introduce Fake-GPT, a LLM with 7 billion parameters which can differentiate between real and fake images. Unlike conventional image processing models, our approach directly process RGB pixel values without relying on any position embedding and visual-language feature alignment, thereby reducing model complexity and processing steps. Our research demonstrates the effective application of LLMs in detecting fake images, thereby expanding their application in non-textual domains. Extensive experiments conducted on various deepfake datasets show that Fake-GPT achieves competitive results compared with conventional image processing models, underscoring its potential as a new paradigm in the realm of image authentication.

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

Dual-Stream Network of vision Mamba and CNN with Auto-Scaling for Remote Sensing Image Segmentation 7th

Dual-Stream Network of Vision Mamba and CNN with Auto-Scalin...

引用

7th chinese conference on pattern recognition and computer vision

作者： Song, Shitao Liu, Ye Su, Jintao Nanjing Univ Posts & Telecommun Sch Automat & Artificial Intelligence Nanjing 210003 Peoples R China

ISBN: (纸本)9789819785049;9789819785056

vision Mamba (VMamba) has recently attracted great research attention due to its ability to obtain a global receptive field with linear computational complexity. However, similar to vision Transformer (ViT), due to its mechanism of dividing patches, it also faces the issue of insufficient description ability of local details. To address this issue, we design in this paper a dual-stream network that combines VMamba and CNN, aiming to enable the network to possess both the global receptive field of VMamba and the local detail description capability of CNN. Both of the two characteristics are crucial for remote sensing image semantic segmentation. the two streams are supervised and trained through independent loss functions. On the other hand, to enable sufficient information exchange between the two branches, we introduce an auto-scaling fusion module aiming at bridging the semantic gap between VMamba and CNN. Experiments demonstrate that the method proposed in this paper outperforms state-of-the-art methods on multiple remote sensing semantic segmentation datasets.

关键词： vision Mamba CNN Auto-scaling Semantic segmentation Remote sensing

来源：评论

学校读者我要写书评

暂无评论

Evaluating Attribute Comprehension in Large vision-Language Models 7th

Evaluating Attribute Comprehension in Large Vision-Language ...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhang, Haiwen Yang, Zixi Liu, Yuanzhi Wang, Xinran He, Zheqi Liang, Kongming Ma, Zhanyu Beijing Univ Posts & Telecommun Beijing 100876 Peoples R China Beijing Acad Artificial Intelligence Beijing 100084 Peoples R China

ISBN: (纸本)9789819786190;9789819786206

Currently, large vision-language models have gained promising progress on many downstream tasks. However, they still suffer many challenges in fine-grained visual understanding tasks, such as object attribute comprehension. Besides, there have been growing efforts on the evaluations of large vision-language models, but lack of in-depth study of attribute comprehension and the visual language fine-tuning process. In this paper, we propose to evaluate the attribute comprehension ability of large vision-language models from two perspectives: attribute recognition and attribute hierarchy understanding. We evaluate three vision-language interactions, including visual question answering, image-text matching, and image-text cosine similarity. Furthermore, we explore the factors affecting attribute comprehension during fine-tuning. through a series of quantitative and qualitative experiments, we introduce three main findings: (1) Large vision-language models possess good attribute recognition ability, but their hierarchical understanding ability is relatively limited. (2) Compared to ITC, ITM exhibits superior capability in capturing finer details, making it more suitable for attribute understanding tasks. (3) the attribute information in the captions used for fine-tuning plays a crucial role in attribute understanding. We hope this work can help guide future progress in fine-grained visual understanding of large vision-language models. the code will be available at Attribute-Comprehension-of-VLMs.

关键词： Large vision-Language Models Attribute recognition Hierarchical Understanding

来源：评论

学校读者我要写书评

暂无评论

Species-Aware Guidance for Animal Action recognition with vision-Language Knowledge 7th

Species-Aware Guidance for Animal Action Recognition with Vi...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhai, Zhen Zhang, Hailun Zhao, Qijun Fu, Keren Sichuan Univ Coll Comp Sci Chengdu Peoples R China

ISBN: (纸本)9789819785100;9789819785117

Species diversity is one of the major differences between animal action recognition and human action recognition, resulting in a series of challenges, e.g., action manifestation diversity, concurrent actions, and long-tailed distribution in datasets. As the same action can be manifested significantly differently among animal species due to their physiological differences, it is crucial for models to distinctively learn various visual content under the same label with species-aware perspectives. However, previous works mainly applied single-species recognition methods to animal datasets, without considering species diversity to address animal action recognition. To fill this gap, we propose a novel animal action recognition approach with specific species guidance by exploring pre-trained vision-language knowledge, namely Species-Aware Guidance (SAG). Firstly, we add word-level species semantics to visual embeddings as guidance, leading the model to focus on relevant regions of target animals in subsequent visual understanding. then, we apply spatiotemporal modeling in both global and local granularity via a two-branch module to obtain a cross-modal video representation. Finally, sentence-level species-aware semantics is fused with action labels as an overall query, guiding the video representation to output the final action label via the decoder. On two widely used public benchmarks of animal action recognition, for both single-label and multi-label scenarios, SAG archives state-of-the-art performance, e.g., Animal Kingdom (up arrow 5.0%), Mammalnet (up arrow 27.0%) compared to existing methods, especially well-alleviating the problem of long-tailed distributions, demonstrating the effectiveness of species guidance under limited data for training.

关键词： Animal Action recognition vision-Language Models Multi-label Classification

来源：评论

学校读者我要写书评

暂无评论

Pseudo-Prompt Generating in Pre-trained vision-Language Models for Multi-label Medical Image Classification 7th

Pseudo-Prompt Generating in Pre-trained Vision-Language Mode...

引用

7th chinese conference on pattern recognition and computer vision

作者： Ye, Yaoqin Zhang, Junjie Shi, Hongwei Sichuan Univ Coll Comp Sci Chengdu Peoples R China

ISBN: (纸本)9789819784950;9789819784967

the task of medical image recognition is notably complicated by the presence of varied and multiple pathological indications, presenting a unique challenge in multi-label classification with unseen labels. this complexity underlines the need for computer-aided diagnosis methods employing multi-label zero-shot learning. Recent advancements in pre-trained vision-language models (VLMs) have showcased notable zero-shot classification abilities on medical images. However, these methods have limitations on leveraging extensive pre-trained knowledge from broader datasets, and often depend on manual prompt construction by expert radiologists. By automating the process of prompt tuning, prompt learning techniques have emerged as an efficient way to adapt VLMs to downstream tasks. Yet, existing CoOp-based strategies fall short in performing class-specific prompts on unseen categories, limiting generalizability in fine-grained scenarios. To overcome these constraints, we introduce a novel prompt generation approach inspirited by text generation in natural language processing (NLP). Our method, named Pseudo-Prompt Generating (PsPG), capitalizes on the priori knowledge of multi-modal features. Featuring a RNN-based decoder, PsPG autoregressively generates class-tailored embedding vectors, i.e., pseudo-prompts. Comparative evaluations on various multi-label chest radiograph datasets affirm the superiority of our approach against leading medical vision-language and multi-label prompt learning methods. the source code is available at https://***/fallingnight/PsPG.

关键词： Prompt Learning Medical Image recognition Multi-label Classification vision-Language Models

来源：评论

学校读者我要写书评

暂无评论

More Like Real World Game Challenge for Partially Observable Multi-agent Cooperation 7th

More Like Real World Game Challenge for Partially Observable...

引用

7th chinese conference on pattern recognition and computer vision

作者： Feng, Xueou Yao, Meng Shen, Shengqi Yin, Qiyue Yang, Jun Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China Jianghuai Adv Technol Ctr Hefei 230000 Peoples R China Tsinghua Univ Dept Automat Beijing 100084 Peoples R China

ISBN: (纸本)9789819785049;9789819785056

Partially observable multi-agent cooperation (POMAC) is a popular task in multi-agent systems, where recognized environments play a vital role for algorithms development and testing like the StarCraft Multi-Agent Challenge. However, POMAC in real world often faces more complex situations beyond the simulation scope of current environments, such as asynchronous cooperation, which largely limits the development of multi-agent cooperation algorithms. To cope with this gap, we propose WarGame Challenge (WGC), which provides four sub-environments to reflect reality-inspired characteristics in POMAC, i.e., cooperation with asynchronous actions, strongly stochastic environments, changeable agents, and asymmetric opponents. Along with the benchmark, we embed Pymarl package and provide baseline multi-agent reinforcement learning algorithms for researchers' use. the overall codes and projects will be released after the paper review process.

关键词： Multi-agent learning Benchmark environment Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

BEVDot: Enhancing Environmental Perception for Autonomous Driving with a Deformable Depth Mechanism 7th

BEVDot: Enhancing Environmental Perception for Autonomous Dr...

引用

7th chinese conference on pattern recognition and computer vision

作者： Yang, Chunmeng Lai, Zeyu Lu, Gaofeng Kong, Bin Univ Sci & Technol China Hefei 230026 Peoples R China Chinese Acad Sci Hefei Inst Phys Sci Hefei 230031 Peoples R China Anhui Engn Res Ctr Intelligent Driving Technol & Hefei 230031 Peoples R China

ISBN: (纸本)9789819787913;9789819787920

In the field of autonomous driving, environmental perception is crucial for driving safety. Addressing the limitations of existing visual perception methods in complex scenarios, this study proposes a deformable depth visual perception framework based on a multi-camera system. the framework processes multi-camera data through a feature extraction network to generate and fuse multi-scale features. And a deformable depth prediction mechanism incorporating self-vehicle temporal difference features is introduced to enhance the accuracy of the model in depth prediction. Experimental results show that on the NuScenes dataset, our method achieves a detection accuracy (mAP) of 0.508 using only 5 random cameras out of 6, surpassing existing technologies such as Lift-Splat (0.446), RC-BEVFusion (0.476), and SOGDet-SE (0.474). Future research will focus on improving the prediction accuracy of distant vehicles to further enhance the performance of the model.

关键词： Automated Driving Visual Perception Vehicle Detection

来源：评论

学校读者我要写书评

暂无评论

ODC-SA Net: Orthogonal Direction Enhancement and Scale Aware Network for Polyp Segmentation 7th

ODC-SA Net: Orthogonal Direction Enhancement and Scale Aware...

引用

7th chinese conference on pattern recognition and computer vision

作者： Xu, Chenhao Zhang, Yudian Xu, Kaiye Zhu, Haijiang Beijing Univ Chem Technol CIST Beijing 100029 Peoples R China

ISBN: (纸本)9789819784950;9789819784967

Accurate polyp segmentation is crucial for the early detection of colorectal cancer. However, existing polyp detection methods sometimes ignore multi-directional features and the drastic scale changes of concealed targets. To address these challenges, we design an Orthogonal Direction Enhancement and Scale Aware Network (ODC-SA Net) for polyp segmentation. the Orthogonal Direction Convolutional (ODC) block can extract multi-directional features using transposed rectangular convolution kernels through forming sets of orthogonal feature vector basis, which solves the issue of random feature direction changes. Additionally, the Multi-scale Fusion Attention (MSFA) mechanism is proposed to emphasize scale changes in both spatial and channel dimensions, enhancing the segmentation accuracy for polyps of varying sizes. Extraction with Re-attention (ERA) module is used to re-combine effective features, and Shallow Reverse Attention (SRA) mechanism is used to enhance polyp edge with low level information. A large number of experiments conducted on public datasets have demonstrated that the performance of this model is superior to state-of-the-art methods.

关键词： Polyp segmentation Colonoscopy Medical image processing computer vision

来源：评论

学校读者我要写书评

暂无评论

Adapting vision-Language Models to Open Classes via Test-Time Prompt Tuning 7th

Adapting Vision-Language Models to Open Classes via Test-Tim...

引用

7th chinese conference on pattern recognition and computer vision

作者： Gao, Zhengqing Ao, Xiang Zhang, Xu-Yao Liu, Cheng-Lin Chinese Acad Sci Inst Automat MAIS Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China

ISBN: (纸本)9789819786190;9789819786206

Adapting pre-trained models to open classes is a challenging problem in machine learning. vision-language models fully explore the knowledge of text modality, demonstrating strong zero-shot recognition performance, which is naturally suited for various open-set problems. More recently, some research focuses on fine-tuning such models to downstream tasks. Prompt tuning methods achieved huge improvements by learning context vectors on few-shot data. However, through the evaluation under open-set adaptation setting with the test data including new classes, we find that there exists a dilemma that learned prompts have worse generalization abilities than hand-crafted prompts. In this paper, we consider combining the advantages of both and come up with a test-time prompt tuning approach, which leverages the maximum concept matching (MCM) scores as dynamic weights to generate an input-conditioned prompt for each image during test. through extensive experiments on 11 different datasets, we show that our proposed method outperforms all comparison methods on average considering both base and new classes. the code is available at https://***/gaozhengqing/TTPT.

关键词： vision-language models Test-time adaptation Prompt tuning

来源：评论

学校读者我要写书评

暂无评论

Enhancing Semi-Dense Feature Matching through Probabilistic Modeling of Cascaded Supervision and Consistency 7th

Enhancing Semi-Dense Feature Matching Through Probabilistic ...

引用

7th chinese conference on pattern recognition and computer vision

作者： Min, Hongchang Tang, Yihong Li, Qiankun Wang, Zengfu Chinese Acad Sci Hefei Inst Phys Sci Hefei 230031 Peoples R China Univ Sci & Technol China Hefei 230026 Peoples R China Natl Engn Res Ctr Speech & Language Informat Proc Hefei Peoples R China

ISBN: (纸本)9789819784981;9789819784998

Local feature matching, which identifies correspondences between image pairs, remains a fundamental challenge in computer vision. Current methods usually utilize multi-scale feature fusion to refine reference areas and filter out irrelevant features. However, relying solely on agent loss for supervising upper-level features can reduce refinement accuracy. In addition, the variance in significance among features within the reference region is often overlooked. In this paper, we propose an approach termed Cascaded Supervision-Neighborhood ConsistencyProbabilistic Modeling that generates more accurate reference ranges for feature matching. Specifically, the proposed method first directs cascading supervision of the matching results at various scales, enabling more precise refinement of regions. then, it aggregates matching results at each scale to maintain neighborhood consistency. Finally, probabilistic modeling of the refined reference region is employed, focusing more on relevant features. Extensive experiments conducted on four popular benchmarks demonstrate that our method achieves state-of-the-art and comparable performance.

关键词： Feature matching Cascade networks Neighborhood consistency Probabilistic modeling

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共219页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：