检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,927 篇 会议
237 册 图书
24 篇 期刊文献

馆藏范围

2,187 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

1,606 篇 工学
- 1,333 篇 计算机科学与技术...
- 466 篇 软件工程
- 250 篇 电气工程
- 235 篇 机械工程
- 186 篇 光学工程
- 179 篇 信息与通信工程
- 119 篇 控制科学与工程
- 96 篇 生物工程
- 62 篇 生物医学工程（可授...
- 42 篇 仪器科学与技术
- 39 篇 电子科学与技术（可...
- 30 篇 化学工程与技术
- 21 篇 安全科学与工程
- 18 篇 材料科学与工程（可...
- 15 篇 交通运输工程
- 13 篇 建筑学
444 篇 理学
- 257 篇 物理学
- 202 篇 数学
- 110 篇 生物学
- 57 篇 统计学（可授理学、...
- 22 篇 化学
228 篇 医学
- 200 篇 临床医学
- 26 篇 基础医学(可授医学...
- 22 篇 特种医学
137 篇 管理学
- 83 篇 图书情报与档案管...
- 60 篇 管理科学与工程(可...
- 19 篇 工商管理
27 篇 艺术学
- 27 篇 设计学（可授艺术学...
16 篇 农学
- 15 篇 作物学
15 篇 法学
- 13 篇 社会学
9 篇 教育学
7 篇 经济学
5 篇 文学
5 篇 军事学

主题

320 篇 computer vision
286 篇 pattern recognit...
166 篇 artificial intel...
119 篇 feature extracti...
118 篇 computer imaging...
101 篇 image processing...
82 篇 face recognition
68 篇 training
61 篇 object detection
60 篇 image segmentati...
57 篇 computer applica...
54 篇 deep learning
51 篇 robustness
47 篇 computer graphic...
46 篇 cameras
45 篇 visualization
43 篇 semantics
38 篇 object recogniti...
37 篇 shape
36 篇 information syst...

机构

89 篇 univ chinese aca...
67 篇 chinese acad sci...
59 篇 national laborat...
56 篇 chinese acad sci...
50 篇 univ chinese aca...
36 篇 chinese univ hon...
36 篇 university of ch...
31 篇 institute of aut...
27 篇 chinese acad sci...
25 篇 school of artifi...
23 篇 univ sci & techn...
22 篇 chinese academy ...
18 篇 chinese acad sci...
17 篇 chinese univ hon...
16 篇 chinese acad sci...
16 篇 univ chinese aca...
15 篇 national laborat...
15 篇 computer vision ...
14 篇 tsinghua univers...
14 篇 department of in...

作者

32 篇 wang xiaogang
29 篇 lu hanqing
28 篇 tan tieniu
28 篇 wang jinqiao
23 篇 li stan z.
22 篇 pal umapada
21 篇 huang kaiqi
21 篇 lei zhen
21 篇 qiao yu
19 篇 tieniu tan
19 篇 hu weiming
18 篇 tang xiaoou
17 篇 xilin chen
15 篇 wang liang
15 篇 chen xilin
15 篇 cheng jian
14 篇 liu jing
14 篇 tang ming
13 篇 xiaoou tang
13 篇 shiguang shan

语言

2,167 篇 英文
19 篇 中文
7 篇 其他
1 篇 土耳其文

检索条件"任意字段=7th Chinese Conference on Pattern Recognition and Computer Vision"

共 2188 条记录，以下是71-80 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

SDFReg: Learning Signed Distance Functions for Point Cloud Registration 7th

SDFReg: Learning Signed Distance Functions for Point Cloud R...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhang, Leida Lu, Zhengda Liu, Kai Wang, Yiqun Chongqing Univ Coll Comp Sci Chongqing Peoples R China Univ Chinese Acad Sci Beijing Peoples R China Chongqing Univ Natl Elite Inst Engn Chongqing Peoples R China

ISBN: (纸本)9789819785070;9789819785087

Learning-based point cloud registration methods can handle clean point clouds well, while it is still challenging to generalize to noisy, partial, and density-varying point clouds. To this end, we propose a novel point cloud registration framework for these imperfect point clouds. By introducing a neural implicit representation, we replace the problem of rigid registration between point clouds with a registration problem between the point cloud and the neural implicit function. We then propose to alternately optimize the implicit function and the registration between the implicit function and point cloud. In this way, point cloud registration can be performed in a coarse-to-fine manner. By fully capitalizing on the capabilities of the neural implicit function without computing point correspondences, our method showcases remarkable robustness in the face of challenges such as noise, incompleteness, and density changes of point clouds.

关键词： Point cloud registration Signed distance function Neural implicit representation

来源：评论

学校读者我要写书评

暂无评论

Show Exemplars and Tell Me What You See: In-Context Learning with Frozen Large Language Models for TextVQA 7th

Show Exemplars and Tell Me What You See: In-Context Learning...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhang, Yan Zeng, Gangyan Shen, Huawen Ma, Can Zhou, Yu Chinese Acad Sci Inst Informat Engn Beijing Peoples R China Nankai Univ Coll Comp Sci TMCC Tianjin Peoples R China Univ Chinese Acad Sci Sch Cyber Secur Beijing Peoples R China Nanjing Univ Sci & Technol Sch Cyber Sci & Engn Nanjing Peoples R China

ISBN: (纸本)9789819785100;9789819785117

Modern Large Visual Language Models (LVLMs) can transfer Large Language Models (LLMs)' powerful abilities to visual domains by combining LLMs with the pre-trained visual encoder, and can also leverage in-context learning originated from LLMs to achieve remarkable performance in the Text-based Visual Question Answering (TextVQA) task. However, the alignment process between vision and language requires a significant amount of training resources. this study introduces SETS (stands for Show Exemplars and Tell me what you See), a straightforward yet effective in-context learning framework for TextVQA. SETS consists of two components, an LLM for reasoning and decision-making, as well as a set of external tools that extract visual entities in scene images, including scene text and objects, to assist the LLM. More specifically, SETS selects visual entities relevant to questions, constructs their spatial relationships, and customizes task-specific instructions. Furthermore, given these instructions, a two-round inference strategy is applied to automatically choose the final predicted answer. Extensive experiments on three widely used TextVQA datasets demonstrate that SETS enables frozen LLMs like Vicuna and LLaMA2 to achieve superior performance when compared with LVLMs counterparts.

关键词： TextVQA In-context learning Multimodal reasoning

来源：评论

学校读者我要写书评

暂无评论

End-To-End High-Quality Transformer Object Detection Model Applied to Human Head Detection 7th

End-To-End High-Quality Transformer Object Detection Model A...

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhou, Zhen Li, Rongchun Qiao, Peng Jiang, Jingfei Natl Univ Def Technol Sch Comp Natl Key Lab Parallel & Distributed Comp Changsha 410073 Peoples R China

ISBN: (纸本)9789819788576;9789819788583

Head detection is a challenging and widely applied object detection task. Although previous CNN-based head detectors have made good progress, the inherent locality of CNN restricts the extraction of global contextual information, which leads to low precision and recall rates in head detection. In this article, we propose an end-to-end high-quality head detector based on Transformer, which effectively models the contextual relationships between heads, other objects and the background. To extract and generate discriminative feature maps suitable for detecting small head targets, we incorporate specific CNN-based auxiliary detector heads for joint training. the GIoU-aware classification loss function is improved to generate bounding boxes with high localization quality and high classification confidence, and a feature fusion module is introduced to enhance the feature representation capabilities of the model. We conduct experiments on COCO 2017 dataset and Brainwash head dataset, and the results demonstrate that our method outperforms in both COCO generalized object detection and Brainwash head detection tasks compared to previous CNN-based detectors as well as other current mainstream Transformer-based object detection models.

关键词： Head detection vision transformer Auxiliary detector heads GIoU-aware Feature fusion

来源：评论

学校读者我要写书评

暂无评论

Dental Diagnosis from X-Ray Panoramic Radiography Images: A Dataset and A Hybrid Framework 7th

Dental Diagnosis from X-Ray Panoramic Radiography Images: A ...

引用

7th chinese conference on pattern recognition and computer vision

作者： Shan, Gege Ma, Xiaoliang Bai, Xiaojie Zhu, Hongzhou Wang, Ting Zhu, Shengji Wang, Lei Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen Peoples R China Dent Bauhinia Shenzhen Peoples R China Shenzhen MSU BIT Univ Shenzhen Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China

ISBN: (纸本)9789819784950;9789819784967

Deep neural networks have displayed promising performance in various fields, including biometrics, medical image processing and analysis, as well as dental healthcare. However, deep learning solutions have not yet become the norm in routine dental practice. this is mainly due to the scarcity of dental datasets. To address this challenge, we have built a dataset called Quadruple Dental X-ray Panoramic (Quad-DXP) Dataset, specifically targeted at the recognition of dental disease and treatment. this dataset annotates nine types of dental issues (disease or treatment), and is the dental panorama dataset with the most abundant types of annotations so far. We further propose a framework for dental pathological issue identification on panoramic radiographs. this framework takes a panoramic X-ray image as input, feeds it into a series of neural network modules, and then achieves the recognition results of dental disease/treatment and enumeration detection. We have achieved satisfactory experimental results under the supervision of dentists and experts, which proves the effectiveness and reliability of our framework in dental diagnosis. this work can assist dentists in formulating treatment plans and improving dental healthcare.

关键词： Dental dataset Panoramic X-ray Dental Disease Detection Dental treatment

来源：评论

学校读者我要写书评

暂无评论

Multi-level Distributional Discrepancy Enhancement for Cross Domain Face Forgery Detection 7th

Multi-level Distributional Discrepancy Enhancement for Cross...

引用

7th chinese conference on pattern recognition and computer vision

作者： Qiu, Lingyu Jiang, Ke Liu, Sinan Tan, Xiaoyang Nanjing Univ Aeronaut & Astronaut Coll Comp Sci & Technol Nanjing Peoples R China MIIT Key Lab Pattern Anal & Machine Intelligence Nanjing Peoples R China Nanjing Univ Posts & Telecommun Sch Comp Sci Nanjing Peoples R China

ISBN: (纸本)9789819784981;9789819784998

Face Forgery Detection (FFD) plays a pivotal role in preserving privacy and bolstering information security by identifying counterfeit face images sourced from the internet. However, FFD encounters a significant challenge in terms of its limited capacity to generalize across diverse datasets due to the striking similarities between genuine and forged images. To tackle this issue, this paper introduces a novel approach known as Multi-level Distributional Discrepancy Enhancement (MDDE). the primary objective of MDDE is to discern variations in the distribution patterns of real and fake data at multiple levels of latent representations. To further enhance its capabilities for generalization, we incorporate a deformable convolution module that extracts intricate features from genuine images. the integration of this module equips MDDE with the ability to generalize to a broader range of samples. Extensive experiments conducted on extensive datasets verify the efficacy of our proposed method and its superior performance compared to several stateof-the-art techniques.

关键词： Deep Learning Face Forgery Detection Variational Auto-Encoder Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Automated detection of cetaceans in their natural habitats via R-CNN and computer vision techniques 7

Automated detection of cetaceans in their natural habitats v...

引用

7th Iberian Robotics conference

作者： Roman Ruiz, Marta Dominguez, Sergio Rossi, Claudio Univ Politecn Madrid Ctr Automat & Robot UPM CSIC Madrid Spain

ISBN: (纸本)9798350376371;9798350376364

Passive acoustic monitoring is essential for monitoring cetaceans in their natural habitats. In this paper, we consider the monitoring of Fin Whales (Balaenoptera physalus). the deployment of automated tools for this purpose is essential to efficiently handle and analyse the vast amounts of data generated by hundreds of hours of recordings, something humans cannot do. In this paper, we present two methods for automated detection systems based on a convolutional neural network (CNN) classifier and a circle detection technique. Both of them use a spectrogram of the recordings as input which converts sounds into images. the first method consists of a two-stage R-CNN classifier with 26 layers. the second method is an image-based technique that uses classical computer vision algorithms based on the morphology of pulses. Both approaches demonstrate good performance, with circle detection showing better results, even as it is a simpler method. the results obtained on a large dataset demonstrate that the proposed approach is highly effective in detecting and characterising animals in their habitats, thus offering valuable information to identify seasonal patterns.

关键词： Fin Whales passive acoustic monitoring bioacoustics Deep Neural Networks pattern recognition computer vision

来源：评论

学校读者我要写书评

暂无评论

Completing Saliency from Details 7th

Completing Saliency from Details

引用

7th chinese conference on pattern recognition and computer vision

作者： Zhang, Jin Liu, Yumeng Wu, Lingxiang Dian, Renwei Yao, Yiheng Huang, Shihao Yang, Yang Zhang, Ruiheng Beijing Inst Technol Beijing 100081 Peoples R China Chinese Acad Sci Inst Software Beijing 100190 Peoples R China Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China Hunan Univ Changsha 410082 Hunan Peoples R China Chinese Acad Sci Beijing 100864 Peoples R China Origin Nano Technol Beijing 100080 Peoples R China Nanjing Univ Sci & Technol Nanjing 210094 Jiangsu Peoples R China

ISBN: (纸本)9789819784929;9789819784936

the salient object detection (SOD) models based on the UNet or FCN structure have reached a significant milestone, and the addition of edge constraints to the SOD model has progressively become a common practice in current methods. Despite these methods producing excellent results, they still lack sufficient confidence in places with sharp edges of the objects owing to sample imbalance. In addition, compressing the encoded features to lower dimensions to decrease the computational cost, as a commonly used method, would unavoidably diminish the model's precision. To overcome the aforementioned issues, we propose a feature mutual feedback network (FMFNet) for the SOD task in which the semantic supplement module (SSM) integrates diverse feature information through different receptive fields to preserve important features. In addition, we provide a novel details map, which can better serve as an edge map to aid the model in learning the hard edge regions, resulting in more complete saliency maps. Multiple experiments on five benchmark datasets indicate the effectiveness, robustness, and superiority of the proposed model and details map.

关键词： Salient object detection Edge supervision Details map

来源：评论

学校读者我要写书评

暂无评论

ESD-Pose: Enhanced Semantic Discrimination for Generalizable 6D Pose Estimation 7th

ESD-Pose: Enhanced Semantic Discrimination for Generalizable...

引用

7th chinese conference on pattern recognition and computer vision

作者： Deng, Xingyuan Wang, Kangru Wang, Lei Zhu, Dongchen Li, Jiamao Chinese Acad Sci Shanghai Inst Microsyst & Informat Technol Bion Vis Syst Lab State Key Lab Transducer Technol Shanghai 200050 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China

ISBN: (纸本)9789819785070;9789819785087

Existing generalizable object pose estimation frameworks utilize a set of reference images to predict the complete pose of the target object in a query scene, which does not require textured CAD models to generate training data and can handle unseen novel objects during inference. However, current methods suffer from insufficient discriminative capability due to the template matching strategy. Both potential distractors and negative samples with similar appearance can be confused with the foreground, which limits performance on precise pose estimation. To address these problems, we propose a novel method called ESD-Pose to enhance the discrimination capacity of the framework. Specifically, a semantic interaction aware (SIA) module is introduced to seek semantic consistency among reference images and discrepancies between reference-query pairs. this module mitigates problems related to model deception caused by distractors. For dealing with slender objects robustly, we propose a dynamic scale weight learner to generate adaptive weights for multi-scale feature fusion, making for reasonable utilization of semantic information at different levels. Finally, an IoU-guided loss is designed to align localization and scale prediction, thus facilitating accurate pose estimation. Comprehensive experiments in the LINEMOD and GenMOP datasets demonstrate that ESD-Pose outperforms existing advanced methods, further validating the effectiveness of our method.

关键词： Generalizable 6D pose estimation Semantic information Enhanced discrimination

来源：评论

学校读者我要写书评

暂无评论

Swelling-ViT: Rethink Data-Efficient vision Transformer from Locality 7th

Swelling-ViT: Rethink Data-Efficient Vision Transformer from...

引用

7th chinese conference on pattern recognition and computer vision

作者： Hu, Chuanrui Chen, Bin Feng, Xin Nian, Fudong Wang, Jiaxin Li, Teng 360AI Res Beijing Peoples R China Chongqing Univ Technol Chongqing Peoples R China Hefei Univ Hefei Peoples R China Anhui Univ Sci & Technol Huainan Peoples R China Anhui Univ Hefei Peoples R China

ISBN: (纸本)9789819785049;9789819785056

In the domain of computer vision, Transformers have shown great promise, yet they face difficulties when trained from scratch on small datasets, often underperforming compared to convolutional neural networks (ConvNets). Our work highlights vision Transformers (ViTs) experience a challenge with unfocused attention when trained on limited datasets. this insight has catalyzed the development of our Swelling ViT framework, an adaptive training strategy that initializes ViT with a local attention window, allowing it to expand gradually during training. this innovative approach enables the model to more easily learn local features thereby mitigating the attention dispersion phenomenon. Our empirical evaluation on the Cifar100 dataset with Swelling ViT-B has yielded remarkable results, achieving an accuracy of 82.60% after 300 epochs from scratch and further improving to 83.31% with 900 epochs of training. these outcomes not only signify a state-of-the-art performance but also underscore the Swelling ViT's capability to effectively address the attention dispersion issue, particularly on small datasets. Moreover, the robustness of our Swelling ViT is affirmed by its consistent performance on the extensive ImageNet dataset, confirming that the strategy does not compromise effectiveness when scaled to larger data regimes. this work, therefore, not only bridges the gap in data efficiency for ViT models but also introduces a versatile solution that can be readily adapted to various domains, regardless of data availability.

关键词： Data efficiency Train from scratch vision transformer

来源：评论

学校读者我要写书评

暂无评论

KA-Seg: Improving LiDAR Point Cloud 7th

KA-Seg: Improving LiDAR Point Cloud

引用

7th chinese conference on pattern recognition and computer vision

作者： Cu, Kaining Wang, Xiaoyang Wang, Lu Cheng, Jun Northeastern Univ Shenyang Peoples R China Chinese Acad Sci Shenzhen Inst Adv Technol Shenzhen Peoples R China

ISBN: (纸本)9789819787913;9789819787920

In the field of autonomous driving, profound scene understanding is crucial, and semantic segmentation of LiDAR point clouds plays a key role in this context. A prevalent issue in point cloud datasets is the imbalance in class distribution. To address this, we introduce the InstanceAug data augmentation pipeline, which balances the class distribution by duplicating instances within scenes. this approach significantly enhances the robustness of our model. Deep learning models for point cloud processing often use sparse convolution for efficiency, but this limits feature transmission and the receptive field. Building on the strengthened dataset, we present KA-Seg, an innovative attention-based framework. KA-Seg refines sparse voxel features to further enhance robustness. Its core feature is an attention mechanism with super voxel partitioning and key point subsampling, which greatly improves the model's ability to identify complex spatial patterns and focus on important voxel regions. Inspired by Transformer architecture, KA-Seg utilizes learnable key point sampling for global feature querying, expanding the model's spatial understanding. this method augments spatial information processing across the point cloud and achieves a 1.3% higher mean intersection over union (mIoU) on the test set compared to the baseline model. Our code is publicly available at https://***/cvkdnk/kaseg.

关键词： Global attention sampling Sparse convolution Point clouds Semantic segmentation

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共219页 << < 4 5 6 7 8 9 10 11 12 13 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：