检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,927 篇 会议
237 册 图书
24 篇 期刊文献

馆藏范围

2,187 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

1,606 篇 工学
- 1,333 篇 计算机科学与技术...
- 466 篇 软件工程
- 250 篇 电气工程
- 235 篇 机械工程
- 186 篇 光学工程
- 179 篇 信息与通信工程
- 119 篇 控制科学与工程
- 96 篇 生物工程
- 62 篇 生物医学工程（可授...
- 42 篇 仪器科学与技术
- 39 篇 电子科学与技术（可...
- 30 篇 化学工程与技术
- 21 篇 安全科学与工程
- 18 篇 材料科学与工程（可...
- 15 篇 交通运输工程
- 13 篇 建筑学
444 篇 理学
- 257 篇 物理学
- 202 篇 数学
- 110 篇 生物学
- 57 篇 统计学（可授理学、...
- 22 篇 化学
228 篇 医学
- 200 篇 临床医学
- 26 篇 基础医学(可授医学...
- 22 篇 特种医学
137 篇 管理学
- 83 篇 图书情报与档案管...
- 60 篇 管理科学与工程(可...
- 19 篇 工商管理
27 篇 艺术学
- 27 篇 设计学（可授艺术学...
16 篇 农学
- 15 篇 作物学
15 篇 法学
- 13 篇 社会学
9 篇 教育学
7 篇 经济学
5 篇 文学
5 篇 军事学

主题

320 篇 computer vision
286 篇 pattern recognit...
166 篇 artificial intel...
119 篇 feature extracti...
118 篇 computer imaging...
101 篇 image processing...
82 篇 face recognition
68 篇 training
61 篇 object detection
60 篇 image segmentati...
57 篇 computer applica...
54 篇 deep learning
51 篇 robustness
47 篇 computer graphic...
46 篇 cameras
45 篇 visualization
43 篇 semantics
38 篇 object recogniti...
37 篇 shape
36 篇 information syst...

机构

89 篇 univ chinese aca...
67 篇 chinese acad sci...
59 篇 national laborat...
56 篇 chinese acad sci...
50 篇 univ chinese aca...
36 篇 chinese univ hon...
36 篇 university of ch...
31 篇 institute of aut...
27 篇 chinese acad sci...
25 篇 school of artifi...
23 篇 univ sci & techn...
22 篇 chinese academy ...
18 篇 chinese acad sci...
17 篇 chinese univ hon...
16 篇 chinese acad sci...
16 篇 univ chinese aca...
15 篇 national laborat...
15 篇 computer vision ...
14 篇 tsinghua univers...
14 篇 department of in...

作者

32 篇 wang xiaogang
29 篇 lu hanqing
28 篇 tan tieniu
28 篇 wang jinqiao
23 篇 li stan z.
22 篇 pal umapada
21 篇 huang kaiqi
21 篇 lei zhen
21 篇 qiao yu
19 篇 tieniu tan
19 篇 hu weiming
18 篇 tang xiaoou
17 篇 xilin chen
15 篇 wang liang
15 篇 chen xilin
15 篇 cheng jian
14 篇 liu jing
14 篇 tang ming
13 篇 xiaoou tang
13 篇 shiguang shan

语言

2,167 篇 英文
19 篇 中文
7 篇 其他
1 篇 土耳其文

检索条件"任意字段=7th Chinese Conference on Pattern Recognition and Computer Vision"

共 2188 条记录，以下是61-70 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Efficiency-Aware Fine-Grained vision-Language Retrieval via a Global-Contextual Autoencoder 7th

Efficiency-Aware Fine-Grained Vision-Language Retrieval via ...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zheng, Min Wu, Chunpeng Wang, Yue Liu, Weiwei Ye, Qinghe Chang, Ke Shi, Cuncun Zhou, Fei State Grid Smart Grid Res Inst Co Ltd State Grid Lab Grid Adv Comp & Applicat Beijing 102209 Peoples R China

ISBN: (纸本)9789819786190;9789819786206

Fine-grained vision-language retrieval aims to search for corresponding fine-grained images based on a text query, or vice versa. the challenge lies in how to match cross-modal data by learning an effective alignment. this paper proposes a simple yet effective efficiency-aware fine-grained vision-language retrieval via a global-contextual auto-encoder method. Firstly, global-contextual features from the images and texts are learned to promote the discriminability of the intra-modality features. then, to strengthen the semantic relevance among heterogeneous modalities, this method employs a semantic autoencoder. Concretely, the encoder projects the visual features into the semantic space occupied by the textual features. Further, the decoder applies an additional constraint, which is desirable to reconstruct the original visual features. Notably, the autoencoder is linear and symmetric, making it reasonable to scale up on large datasets. Comprehensive experiments on two fine-grained tasks illustrate that the proposed method surpasses several state-of-the-art baselines, validating its effectiveness and efficiency.

关键词： Fine-grained vision-language retrieval Global-contextual autoencoder Efficiency-aware optimization algorithm

来源：评论

学校读者我要写书评

暂无评论

VS-LLM: Visual-Semantic Depression Assessment Based on LLM for Drawing Projection Test 7th

VS-LLM: Visual-Semantic Depression Assessment Based on LLM f...

引用

7th chinese conference on pattern recognition and computer vision

作者： Wu, Meiqi Kang, Yaxuan Li, Xuchen Hu, Shiyu Chen, Xiaotang Kang, Yunfeng Wang, Weiqiang Huang, Kaiqi Univ Chinese Acad Sci Sch Comp Sci & Technol Beijing Peoples R China Chinese Acad Sci Inst Automat Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China CAS Ctr Excellence Brain Sci & Intelligence Techn Shanghai Peoples R China

ISBN: (纸本)9789819786916;9789819786923

the Drawing Projection Test (DPT) is an essential tool in art therapy, allowing psychologists to assess participants' mental states through their sketches. Specifically, through sketches with the theme of "a person picking an apple from a tree (PPAT)", it can be revealed whether the participants are in mental states such as depression. Compared with scales, the DPT can enrich psychologists' understanding of an individual's mental state. However, the interpretation of the PPAT is laborious and depends on the experience of the psychologists. To address this issue, we propose an effective identification method to support psychologists in conducting a large-scale automatic DPT. Unlike traditional sketch recognition, DPT more focus on the overall evaluation of the sketches, such as color usage and space utilization. Moreover, PPAT imposes a time limit and prohibits verbal reminders, resulting in low drawing accuracy and a lack of detailed depiction. To address these challenges, we propose the following efforts: (1) Providing an experimental environment for automated analysis of PPAT sketches for depression assessment;(2) Offering a Visual-Semantic depression assessment based on LLM (VS-LLM) method;(3) Experimental results demonstrate that our method improves by 17.6% compared to the psychologist assessment method. We anticipate that this work will contribute to the research in mental state assessment based on PPAT sketches' elements recognition. Our datasets and codes are available at: https://***/wmeiqi/VS-LLM.

关键词： Drawing projection test Art therapy LLM Multimodal depression assessment

来源：评论

学校读者我要写书评

暂无评论

Single Model Learns Multiple Styles of chinese Calligraphy via Style Collection Mechanism 7th

Single Model Learns Multiple Styles of Chinese Calligraphy v...

引用

7th chinese conference on pattern recognition and computer vision

作者： Dong, Zhiqiang Xiao, Yun Duan, JiaShun Wang, Xuanhong Xu, Pengfei Zheng, Xia Northwest Univ Sch Informat Sci & Technol Xian Peoples R China Xian Univ Posts & Telecommun Xian Peoples R China Zhejiang Univ Hangzhou Zhejiang Peoples R China

ISBN: (纸本)9789819784899;9789819784905

the automatic generation of chinese calligraphy images is a very challenging task, because the structure of chinese characters is very complex. At present, most methods learn the style of the images one by one, meaning that they lack the ability to model the style of the calligraphy from a more macro perspective. To solve these problems, this paper proposes a one-to-many style transfer model, SCGAN, based on a style collection mechanism. Our model can gather information from the collection level to complete the task of chinese character image generation. the main features of our model are as follows: first, based on the proposed style collection mechanism, our model can collect and transform style features from the collection level;second, we redesigned the structure of the generative adversarial network. Our model can complete the one-to-many style transfer task, which can greatly reduce the workload associated with multi-target style transfer. Compared with other deep learning methods, the results obtained by our method are higher quality and closer to reality. Experimental results show that our method achieves better performance than other methods in one-to-one and one-to-many chinese character generation tasks.

关键词： Style transfer chinese character generation Virtual restoration of calligraphy

来源：评论

学校读者我要写书评

暂无评论

AtomTool: Empowering Large Language Models with Tool Utilization Skills 7th

AtomTool: Empowering Large Language Models with Tool Utiliza...

引用

7th chinese conference on pattern recognition and computer vision

作者： Li, Yongle Zhang, Zheng Zhang, Junqi Hu, Wenbo Wu, Yongyu Hong, Richang Hefei Univ Technol Hefei Anhui Peoples R China AtonEcho Beijing Peoples R China

ISBN: (纸本)9789819784868;9789819784875

In recent years, significant strides have been made in harnessing large language models (LLMs) to leverage various tools across different fields, which largely expands the application scope of LLMs. However, current research predominantly focuses on LLMs' inherent tool exploitation skills from their training data, leading to higher costs when integrating new tools. Additionally, most studies concentrate on English models, leaving a scarcity of open-source resources for other languages. this study investigates the zero-shot generalization of LLMs in tool usage, with a focus on chinese models. We introduce AtomTool, an open-source framework for tool acquisition in LLMs, along with a dataset of 16,000 chinese entries. this work marks the first effort to evaluate zero-shot generalization in chinese models and provides the initial open-source framework and dataset dedicated to tool acquisition in chinese LLMs. Our experiments show AtomTool outperforms the closed-source models like ChatGPT in zero-shot generalization in most cases. We also propose a novel dataset construction method and evaluation framework, examining prompt design and tool quantity effects on model performance. Overall, our work establishes a solid foundation for advancing tool acquisition in chinese LLMs.

关键词： Large Language Models Tool Learning Zero-Shot Generalization Open-Source Framework chinese Dataset

来源：评论

学校读者我要写书评

暂无评论

RefineStyle: Dynamic Convolution Refinement for StyleGAN 7th

RefineStyle: Dynamic Convolution Refinement for StyleGAN

引用

7th chinese conference on pattern recognition and computer vision

作者： Xia, Siwei Hu, Xueqi Sun, Li Li, Qingli East China Normal Univ Shanghai Key Lab Multidimens Informat Proc Shanghai Peoples R China

ISBN: (纸本)9789819786916;9789819786923

In StyleGAN, convolution kernels are shaped by both static parameters shared across images and dynamic modulation factors w(+) is an element of W+ specific to each image. therefore, W+ space is often used for image inversion and editing. However, pre-trained model struggles with synthesizing out-of-domain images due to the limited capabilities of W+ and its resultant kernels, necessitating full fine-tuning or adaptation through a complex hypernetwork. this paper proposes an efficient refining strategy for dynamic kernels. the key idea is to modify kernels by low-rank residuals, learned from input image or domain guidance. these residuals are generated by matrix multiplication between two sets of tokens with the same number, which controls the complexity. We validate the refining scheme in image inversion and domain adaptation. In the former task, we design grouped transformer blocks to learn these token sets by one- or two-stage training. In the latter task, token sets are directly optimized to support synthesis in the target domain while preserving original content. Extensive experiments show that our method achieves low distortions for image inversion and high quality for out-of-domain editing.

关键词： computer vision Generative models GAN inversion Domain adaptation

来源：评论

学校读者我要写书评

暂无评论

An Asymmetric Game theoretic Learning Model 7th

An Asymmetric Game Theoretic Learning Model

引用

7th chinese conference on pattern recognition and computer vision

作者： Yin, Qiyue Yu, Tongtong Feng, Xueou Yang, Jun Huang, Kaiqi Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China Jianghuai Adv Technol Ctr Hefei 230000 Peoples R China Tsinghua Univ Dept Automat Beijing 100084 Peoples R China

ISBN: (纸本)9789819785018;9789819785025

Recent successes of game AIs such as AlphaGo and AlphaStar, which beat professional human players in the games Go and StarCraft, respectively, mark the breakthroughs of intelligent decision making technique in complex games. Generally, games studied previously are mostly symmetric in game-theoretic sense due to their sports or e-sports characteristics. However, games in reality are usually asymmetric because of the position-dependent resource unbalance, and they are rarely studied. In this paper, we propose a novel asymmetric game model based on the framework of game theoretic learning. Specifically, we develop an agent training method with three steps: game model formulation, solution concept definition and game solution computation. To verify our model, a mini-Wargame is used in our experiment, where the initial number and visual scope are set to be unbalanced. Experiments show that the proposed method is better than popular self-play based methods such as naive self-play and prioritized fictitious self-play. the work provides a game-theoretic view for asymmetric games, and it may attract more interests for the rarely studied asymmetric games.

关键词： Game theoretic learning Self-play Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

MedPrompt: Cross-modal Prompting for Multi-task Medical Image Translation 7th

MedPrompt: Cross-modal Prompting for Multi-task Medical Imag...

引用

7th chinese conference on pattern recognition and computer vision

作者： Chen, Xuhang Luo, Shenghong Pun, Chi-Man Wang, Shuqiang Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China Univ Macau Taipa Macao Peoples R China Huizhou Univ Huizhou Peoples R China

ISBN: (纸本)9789819784950;9789819784967

the ability to translate medical images across different modalities is crucial for synthesizing missing data and aiding in clinical diagnosis. However, existing learning-based techniques have limitations when it comes to capturing cross-modal and global features. these techniques are often tailored to specific pairs of modalities, limiting their practical utility, especially considering the variability of missing modalities in different cases. In this study, we introduce MedPrompt, a multi-task framework designed to efficiently translate diverse modalities. Our framework incorporates the Self-adaptive Prompt Block, which dynamically guides the translation network to handle different modalities effectively. To encode the cross-modal prompt efficiently, we introduce the Prompt Extraction Block and the Prompt Fusion Block. Additionally, we leverage the Transformer model to enhance the extraction of global features across various modalities. through extensive experimentation involving five datasets and four pairs of modalities, we demonstrate that our proposed model achieves state-of- the-art visual quality and exhibits excellent generalization capability. the results highlight the effectiveness and versatility of MedPrompt in addressing the challenges associated with cross- modal medical image translation.

关键词： Medical image translation Visual prompting vision transformer

来源：评论

学校读者我要写书评

暂无评论

HHATP: A Lightweight Heterogeneous Hierarchical Attention Model for Trajectory Prediction 7th

HHATP: A Lightweight Heterogeneous Hierarchical Attention Mo...

引用

7th chinese conference on pattern recognition and computer vision

作者： Lai, Zeyu Zhu, Xingliang Yang, Chunmeng Kong, Bin Chinese Acad Sci Hefei Inst Phys Sci Hefei 230031 Peoples R China Univ Sci & Technol China Hefei 230026 Peoples R China Anhui Engn Lab Intelligent Driving Technol & Appl Hefei 230031 Peoples R China

ISBN: (纸本)9789819787913;9789819787920

Predicting the future trajectories of agents in complex traffic scene is one of the key issues in autonomous driving, requiring reliable and effective predictions for all agents in the scene. Existing trajectory prediction models have achieved high performance on public datasets, but deploying models on vehicles requires both high accuracy and fast computation. It is necessary to balance the complexity of computation and the effectiveness of the structure when designing model. To address the above problem, we proposes a lightweight trajectory prediction model HHATP. Our method is scene-centric and located in the same coordinate system. We use different encoders for the heterogeneous scene objects and the encoded results are then fed into a hierarchical attention module, which considers both global and local interaction to model the relationships between elements. Subsequently, a dynamic weight decoder is used to obtain the trajectories of all agents. Our method achieves good accuracy on the Argoverse dataset and enables fast inference.

关键词： Attention mechanism hierarchical structure heterogeneous information trajectory prediction

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Camouflaged Object Detection: Multi Information Fusion Combined with Adaptive Receptive Field Selection Network 7th

Semi-Supervised Camouflaged Object Detection: Multi Informat...

引用

7th chinese conference on pattern recognition and computer vision

作者： Yang, Guang Xiao, Feng Liu, Ruyu Zhang, Jiawei Zhang, Jianhua Chen, Shengyong Tianjin Univ Technol Tianjin 300380 Peoples R China Hangzhou Normal Univ Hangzhou 311121 Peoples R China

ISBN: (纸本)9789819788576;9789819788583

Camouflaged object detection is focused on segmenting objects concealed within their surroundings. this technology can be applied in various fields such as medical image analysis, wildlife conservation, autonomous driving, and others. Existing semi-supervised camouflage object detection methods often suffer from poor network performance due to the accumulation of incorrect pseudo labels, and they fail to fully utilize multi-scale features or account for the diverse scale contexts necessary for various sizes of camouflage objects. In this paper, we propose an innovative semi-supervised learning strategy. We employ a dual-branch network named CAMNet, utilizing salient maps corresponding to camouflage objects to aid detection. We also introduce a Multi-Information Fusion Feature Perception module (MIF) and an Adaptive Receptive Field Selection module (ARFS), which are integrated into the network. Ultimately, we perform thorough comparative experiments on the R2C7K, COD-Water, and COD-Jungle datasets, showcasing superior performance in contrast to current state-of-the-art methods. We also conduct ablation experiments, further confirming the effectiveness of the proposed modules.

关键词： Camouflaged object detection Semi-supervised learning computer vision Deep learning

来源：评论

学校读者我要写书评

暂无评论

Accelerating Domain Adaptation with Cascaded Adaptive vision Transformer 7th

Accelerating Domain Adaptation with Cascaded Adaptive Vision...

引用

7th chinese conference on pattern recognition and computer vision

作者： Jiang, Qilin Cui, Chaoran Zhang, Chunyun Zhen, Yongrui Gong, Shuai Liu, Ziyi Meng, Fan'an Zhao, Hongyan Shandong Univ Finance & Econ Jinan Peoples R China

ISBN: (纸本)9789819784868;9789819784875

Domain adaptation (DA) aims to transfer knowledge from labeled source domains to unlabeled target domains, addressing the challenge of model generalization when there is a distribution mismatch between training and testing data. While many vision Transformer (ViT)-based methods have been developed for DA, they focus primarily on improving accuracy, with less emphasis on accelerating inference on unlabeled target domains. In this paper, we propose a novel method named Cascaded Adaptive vision Transformer (CAViT), which dynamically adjusts token counts for each input image by cascading multiple transformers with increasing tokens. During testing, "easier" images exit early, while "harder" images are processed further until confident predictions are achieved. We further enhance domain adversarial learning by incorporating a token-level domain discriminator in the attention layer, which assigns distinct weights to different patch tokens. this enables the network to learn features with cross-domain transferability and discriminative capabilities, achieving effective feature alignment. Experimental results demonstrate that our method not only improves accuracy but also significantly reduces computational costs, as evidenced by results on three benchmark datasets.

关键词： Domain adaptation Transfer learning Adaptive inference Resource efficiency

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共219页 << < 3 4 5 6 7 8 9 10 11 12 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：