检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,008 篇 会议
126 册 图书
94 篇 期刊文献

馆藏范围

23,227 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,631 篇 工学
- 11,116 篇 计算机科学与技术...
- 3,481 篇 软件工程
- 2,445 篇 机械工程
- 1,716 篇 光学工程
- 1,080 篇 电气工程
- 1,014 篇 控制科学与工程
- 788 篇 信息与通信工程
- 411 篇 仪器科学与技术
- 352 篇 生物工程
- 251 篇 生物医学工程（可授...
- 196 篇 电子科学与技术（可...
- 114 篇 化学工程与技术
- 109 篇 安全科学与工程
- 100 篇 测绘科学与技术
- 88 篇 建筑学
- 88 篇 交通运输工程
- 84 篇 土木工程
3,495 篇 医学
- 3,482 篇 临床医学
- 82 篇 基础医学(可授医学...
3,246 篇 理学
- 1,941 篇 物理学
- 1,643 篇 数学
- 563 篇 统计学（可授理学、...
- 500 篇 生物学
- 249 篇 系统科学
- 106 篇 化学
521 篇 管理学
- 311 篇 图书情报与档案管...
- 223 篇 管理科学与工程(可...
- 76 篇 工商管理
276 篇 艺术学
- 276 篇 设计学（可授艺术学...
66 篇 法学
- 63 篇 社会学
38 篇 农学
28 篇 教育学
22 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,186 篇 computer vision
3,967 篇 pattern recognit...
3,005 篇 training
2,007 篇 computational mo...
1,818 篇 visualization
1,815 篇 cameras
1,515 篇 feature extracti...
1,481 篇 shape
1,455 篇 three-dimensiona...
1,438 篇 image segmentati...
1,287 篇 robustness
1,206 篇 computer archite...
1,155 篇 semantics
1,147 篇 conferences
1,107 篇 layout
1,092 篇 computer science
1,088 篇 object detection
1,025 篇 benchmark testin...
970 篇 codes
922 篇 face recognition

机构

136 篇 univ sci & techn...
121 篇 univ chinese aca...
118 篇 chinese univ hon...
105 篇 carnegie mellon ...
101 篇 tsinghua univers...
101 篇 microsoft resear...
95 篇 swiss fed inst t...
93 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
79 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
69 篇 national laborat...
67 篇 microsoft res as...
67 篇 alibaba grp peop...
64 篇 adobe research
60 篇 peking univ peop...
60 篇 tsinghua univ pe...
59 篇 univ oxford oxfo...

作者

81 篇 van gool luc
72 篇 timofte radu
65 篇 zhang lei
47 篇 luc van gool
40 篇 yang yi
40 篇 li stan z.
37 篇 loy chen change
35 篇 chen chen
33 篇 xiaoou tang
32 篇 liu yang
32 篇 qi tian
31 篇 tian qi
31 篇 sun jian
30 篇 murino vittorio
29 篇 ling haibin
29 篇 darrell trevor
29 篇 pascal fua
29 篇 li fei-fei
28 篇 li xin
28 篇 ying shan

语言

22,989 篇 英文
210 篇 其他
22 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"

共 23228 条记录，以下是961-970 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Unde...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xue, Le Yu, Ning Zhang, Shu Panagopoulou, Artemis Li, Junnan Martin-Martin, Roberto Wu, Jiajun Xiong, Caiming Xu, Ran Niebles, Juan Carlos Savarese, Silvio Salesforce AI Res San Francisco CA 94105 USA Stanford Univ Stanford CA USA Univ Penn Philadelphia PA USA Univ Texas Austin Austin TX USA

ISBN: (纸本)9798350353006

Recent advancements in multimodal pre-training have shown promising efficacy in 3D representation learning by aligning multimodal features across 3D shapes, their 2D counterparts, and language descriptions. However, the methods used by existing frameworks to curate such multimodal data, in particular language descriptions for 3D shapes, are not scalable, and the collected language descriptions are not diverse. To address this, we introduce ULIP-2, a simple yet effective tri-modal pre-training framework that leverages large multimodal models to automatically generate holistic language descriptions for 3D shapes. It only needs 3D data as input, eliminating the need for any manual 3D annotations, and is therefore scalable to large datasets. ULIP-2 is also equipped with scaled-up backbones for better multi-modal representation learning. We conduct experiments on two large-scale 3D datasets, Objaverse and ShapeNet, and augment them with tri-modal datasets of 3D point clouds, images, and language for training ULIP-2. Experiments show that ULIP-2 demonstrates substantial benefits in three downstream tasks: zero-shot 3D classification, standard 3D classification with fine-tuning, and 3D captioning (3D-to-language generation). It achieves a new SOTA of 50.6% (top-1) on Objaverse-LVIS and 84.7% (top-1) on ModelNet40 in zero-shot classification. In the ScanObjectNN benchmark for standard fine-tuning, ULIP-2 reaches an overall accuracy of 91.5% with a compact model of only 1.4 million parameters. ULIP-2 sheds light on a new paradigm for scalable multimodal 3D representation learning without human annotations and shows significant improvements over existing baselines. The code and datasets are released at https://***/salesforce/ULIP.

关键词： 3D vision Multimodal learning

来源：评论

学校读者我要写书评

暂无评论

Multiple Degradation and Reconstruction Network for Single Image Denoising via Knowledge Distillation

Multiple Degradation and Reconstruction Network for Single I...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Juncheng Yang, Hanhui Yi, Qiaosi Fang, Faming Gao, Guangwei Zeng, Tieyong Zhang, Guixu Chinese Univ Hong Kong Hong Kong Peoples R China East China Normal Univ Shanghai Peoples R China Nanjing Univ Posts & Telecommun Nanjing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Single image denoising (SID) has achieved significant breakthroughs with the development of deep learning. However, the proposed methods are often accompanied by plenty of parameters, which greatly limits their application scenarios. Different from previous works that blindly increase the depth of the network, we explore the degradation mechanism of the noisy image and propose a lightweight Multiple Degradation and Reconstruction Network (MDRN) to progressively remove noise. Meanwhile, we propose two novel Heterogeneous Knowledge Distillation Strategies (HMDS) to enable MDRN to learn richer and more accurate features from heterogeneous models, which make it possible to reconstruct higher-quality denoised images under extreme conditions. Extensive experiments show that our MDRN achieves favorable performance against other SID models with fewer parameters. Meanwhile, plenty of ablation studies demonstrate that the introduced HMDS can improve the performance of tiny models or the model under high noise levels, which is extremely useful for related applications.

关键词： Degradation Knowledge engineering Computational modeling Resists pattern recognition Image restoration Noise measurement

来源：评论

学校读者我要写书评

暂无评论

Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction

Goal-driven Self-Attentive Recurrent Networks for Trajectory...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chiara, Luigi Filippo Coscia, Pasquale Das, Sourav Calderara, Simone Cucchiara, Rita Ballan, Lamberto Univ Padua Padua Italy Univ Modena & Reggio Emilia Modena Italy

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Human trajectory forecasting is a key component of autonomous vehicles, social-aware robots and advanced video-surveillance applications. This challenging task typically requires knowledge about past motion, the environment and likely destination areas. In this context, multimodality is a fundamental aspect and its effective modeling can be beneficial to any architecture. Inferring accurate trajectories is nevertheless challenging, due to the inherently uncertain nature of the future. To overcome these difficulties, recent models use different inputs and propose to model human intentions using complex fusion mechanisms. In this respect, we propose a lightweight attention-based recurrent backbone that acts solely on past observed positions. Although this backbone already provides promising results, we demonstrate that its prediction accuracy can be improved considerably when combined with a scene-aware goal-estimation module. To this end, we employ a common goal module, based on a U-Net architecture, which additionally extracts semantic information to predict scene-compliant destinations. We conduct extensive experiments on publicly-available datasets (i.e. SDD, inD, ETH/UCY) and show that our approach performs on par with state-of-the-art techniques while reducing model complexity.

关键词： Measurement Semantics Stochastic processes computer architecture Trajectory Data mining Task analysis

来源：评论

学校读者我要写书评

暂无评论

TikTok for good: Creating a diverse emotion expression database

TikTok for good: Creating a diverse emotion expression datab...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Surabhi, Saimourya Shah, Bhavik Washington, Peter Mutlu, Onur Cezmi Leblanc, Emilie Mohite, Prathamesh Husic, Arman Kline, Aaron Dunlap, Kaitlyn McNealis, Maya Liu, Bennett Deveaux, Nick Sleiman, Essam Wall, Dennis P. Stanford Univ Dept Biomed Data Sci Dept Pediat Syst Med Stanford CA 94305 USA Stanford Univ Dept Psychiat & Behav Sci Stanford CA 94305 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Facial expression recognition (FER) is a critical computer vision task for a variety of applications. Despite the widespread use of FER, there is a dearth of racially diverse facial emotion datasets which are enriched for children, teens, and adults. To bridge this gap, we have built a diverse expression recognition database using publicly available videos from TikTok, a video-focused social networking service. We describe the construction of the TikTok Facial expression recognition (FER) database. The dataset is extracted from 6428 videos scraped from TikTok. The videos consist of 9392 distinct individuals and labels for 15 emotion-related prompts. We were able to achieve a F1 score 0.78 for Ekman emotions on expression classification using transfer learning. We hope that the scale and diversity of the TikTokFER dataset will be of use to affective computing practitioners.

关键词： Bridges computer vision Emotion recognition Databases Social networking (online) Face recognition conferences

来源：评论

学校读者我要写书评

暂无评论

CFA: Constraint-based Finetuning Approach for Generalized Few-Shot Object Detection

CFA: Constraint-based Finetuning Approach for Generalized Fe...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Guirguis, Karim Hendawy, Ahmed Eskandar, George Abdelsamad, Mohamed Kayser, Matthias Beyerer, Juergen Robert Bosch GmbH Stuttgart Germany Univ Stuttgart Stuttgart Germany Karlsruhe Inst Technol Karlsruhe Germany Fraunhofer IOSB Karlsruhe Germany

ISBN: (纸本)9781665487399

Few-shot object detection (FSOD) seeks to detect novel categories with limited data by leveraging prior knowledge from abundant base data. Generalized few-shot object detection (G-FSOD) aims to tackle FSOD without forgetting previously seen base classes and, thus, accounts for a more realistic scenario, where both classes are encountered during test time. While current FSOD methods suffer from catastrophic forgetting, G-FSOD addresses this limitation yet exhibits a performance drop on novel tasks compared to the state-of-the-art FSOD. In this work, we propose a constraint-based finetuning approach (CFA) to alleviate catastrophic forgetting, while achieving competitive results on the novel task without increasing the model capacity. CFA adapts a continual learning method, namely Average Gradient Episodic Memory (A-GEM) to G-FSOD. Specifically, more constraints on the gradient search strategy are imposed from which a new gradient update rule is derived, allowing for better knowledge exchange between base and novel classes. To evaluate our method, we conduct extensive experiments on MS-COCO and PASCAL-VOC datasets. Our method outperforms current FSOD and G-FSOD approaches on the novel task with minor degeneration on the base task. Moreover, CFA is orthogonal to FSOD approaches and operates as a plug-and-play module without increasing the model capacity or inference time.

关键词： Learning systems Adaptation models computer vision conferences Object detection Search problems pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Hyperspectral Object Detection Challenge Results - PBVS 2022

Semi-Supervised Hyperspectral Object Detection Challenge Res...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Rangnekar, Aneesh Mulhollan, Zachary Vodacek, Anthony Hoffman, Matthew Sappa, Angel Blasch, Erik Yu, Jun Zhang, Liwen Du, Shenshen Chang, Hao Lu, Keda Zhang, Zhong Gao, Fang Yu, Ye Shuang, Feng Wang, Lei Ling, Qiang Shyam, Pranjay Yoon, Kuk-Jin Kim, Kyung-Soo Rochester Inst Technol Rochester NY 14623 USA ESPOL Polytech Univ Guayaquil Ecuador Comp Vision Ctr Campus UAB Barcelona Spain US Air Force Res Lab Rome NY USA

ISBN: (纸本)9781665487399

This paper summarizes the top contributions to the first semi-supervised hyperspectral object detection (SSHOD) challenge, which was organized as a part of the Perception Beyond the Visible Spectrum (PBVS) 2022 workshop at the computer vision and pattern recognition (CVPR) conference. The SSHODC challenge is a first-of-its-kind hyperspectral dataset with temporally contiguous frames collected from a university rooftop observing a 4-way vehicle intersection over a period of three days. The dataset contains a total of 2890 frames, captured at an average resolution of 1600 x 192 pixels, with 51 hyperspectral bands from 400nm to 900nm. SSHOD challenge uses 989 images as the training set, 605 images as validation set and 1296 images as the evaluation (test) set. Each set was acquired on a different day to maximize the variance in weather conditions. Labels are provided for 10% of the annotated data, hence formulating a semi-supervised learning task for the participants which is evaluated in terms of average precision over the entire set of classes, as well as individual moving object classes: namely vehicle, bus and bike. The challenge received participation registration from 38 individuals, with 8 participating in the validation phase and 3 participating in the test phase. This paper describes the dataset acquisition, with challenge formulation, proposed methods and qualitative and quantitative results. [GRAPHICS] .

关键词： Training computer vision conferences Training data Object detection Semisupervised learning Transformers

来源：评论

学校读者我要写书评

暂无评论

M2FNet: Multi-modal Fusion Network for Emotion recognition in Conversation

M2FNet: Multi-modal Fusion Network for Emotion Recognition i...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chudasama, Vishal Kar, Purbayan Gudmalwar, Ashish Shah, Nirmesh Wasnik, Pankaj Onoe, Naoyuki Sony Res India Media Anal Grp Bangalore Karnataka India

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Emotion recognition in Conversations (ERC) is crucial in developing sympathetic human-machine interaction. In conversational videos, emotion can be present in multiple modalities, i.e., audio, video, and transcript. However, due to the inherent characteristics of these modalities, multi-modal ERC has always been considered a challenging undertaking. Existing ERC research focuses mainly on using text information in a discussion, ignoring the other two modalities. We anticipate that emotion recognition accuracy can be improved by employing a multi-modal approach. Thus, in this study, we propose a Multi-modal Fusion Network (M2FNet) that extracts emotion-relevant features from visual, audio, and text modality. It employs a multi-head attention-based fusion mechanism to combine emotion-rich latent representations of the input data. We introduce a new feature extractor to extract latent features from the audio and visual modality. The proposed feature extractor is trained with a novel adaptive margin-based triplet loss function to learn emotion-relevant features from the audio and visual data. In the domain of ERC, the existing methods perform well on one benchmark dataset but not on others. Our results show that the proposed M2FNet architecture outperforms all other methods in terms of weighted average F1 score on well-known MELD and IEMOCAP datasets and sets a new state-of-the-art performance in ERC.

关键词： Human computer interaction Emotion recognition Visualization Adaptation models Benchmark testing Feature extraction Robustness

来源：评论

学校读者我要写书评

暂无评论

Focused Feature Differentiation Network for Image Quality Assessment

Focused Feature Differentiation Network for Image Quality As...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： He, Gang Wang, Yong Xu, Li Zhang, Wenli Sun, Ming Wen, Xing Xidian Univ Xian Peoples R China Kuaishou Technol Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Image quality assessment (IQA) intended to assess the perceptual quality of images has been an essential problem in both human and machine vision. Recently, with the help of deep neural network (DNN), IQA algorithms can extract more valuable differences between the distorted and reference images than the traditional algorithms, and thus the performance of DNN-based algorithms is more satisfactory than that of previous algorithms. However, the accuracy for different distorted images preference rating of the existing DNN-based quality assessment methods will be decreased when multiple distorted images are quite similar to each other or to the reference image. To tackle this problem, we propose a focused feature differentiation network (FFDN) to highlight the feature maps with greater distorted and reference differentiation. Furthermore, we use the multi-scale feature fusion module to fuse the focused differentiation features at different scale receptive fields. To further improve the accuracy of our method, we predict the mean opinion score and differentiation score by stages and combine them with different self-learning weights. Finally, we convert the weighted score into different image preference degrees. Experimental results on the validation dataset of CLIC2022 and test dataset of CLIC2021 show that the accuracy of our model FFDN is higher than other excellent quality assessment methods.

关键词： Image quality Deep learning Fuses Convolution Machine vision conferences Neural networks

来源：评论

学校读者我要写书评

暂无评论

Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning

Hybrid Consistency Training with Prototype Adaptation for Fe...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ye, Meng Lin, Xiao Burachas, Giedrius Divakaran, Ajay Yao, Yi SRI Int 333 Ravenswood Ave Menlo Pk CA 94025 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Few-Shot Learning (FSL) aims to improve a model's generalization capability in low data regimes. Recent FSL works have made steady progress via metric learning, meta learning, representation learning, etc. However, FSL remains challenging due to the following longstanding difficulties. 1) The seen and unseen classes are disjoint, resulting in a distribution shift between training and testing. 2) During testing, labeled data of previously unseen classes is sparse, making it difficult to reliably extrapolate from labeled support examples to unlabeled query examples. To tackle the first challenge, we introduce Hybrid Consistency Training to jointly leverage two types of consistency: 1) interpolation consistency, which interpolates hidden features to imposes linear behavior locally, and 2) data augmentation consistency, which learns robust embeddings against sample variations. As for the second challenge, we use unlabeled examples to iteratively normalize features and adapt prototypes, as opposed to commonly used one-time update, for more reliable prototype-based transductive inference. We show that our method generates a 2% to 5% improvement over the state-of-the-art methods with similar backbones on five FSL datasets and, more notably, a 7% to 8% improvement for more challenging cross-domain FSL.

关键词： Training Representation learning Measurement Interpolation Prototypes Inference algorithms pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Is Neuron Coverage Needed to Make Person Detection More Robust?

Is Neuron Coverage Needed to Make Person Detection More Robu...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pavlitskaya, Svetlana Yikmis, Siyar Zoellner, J. Marius FZI Res Ctr Informat Technol D-76131 Karlsruhe Germany

ISBN: (纸本)9781665487399

The growing use of deep neural networks (DNNs) in safety- and security-critical areas like autonomous driving raises the need for their systematic testing. Coverage-guided testing (CGT) is an approach that applies mutation or fuzzing according to a predefined coverage metric to find inputs that cause misbehavior. With the introduction of a neuron coverage metric, CGT has also recently been applied to DNNs. In this work, we apply CGT to the task of person detection in crowded scenes. The proposed pipeline uses YOLOv3 for person detection and includes finding DNN bugs via sampling and mutation, and subsequent DNN retraining on the updated training set. To be a bug, we require a mutated image to cause a significant performance drop compared to a clean input. In accordance with the CGT, we also consider an additional requirement of increased coverage in the bug definition. In order to explore several types of robustness, our approach includes natural image transformations, corruptions, and adversarial examples generated with the Daedalus attack. The proposed framework has uncovered several thousand cases of incorrect DNN behavior. The relative change in mAP performance of the retrained models reached on average between 26.21% and 64.24% for different robustness types. However, we have found no evidence that the investigated coverage metrics can be advantageously used to improve robustness.

关键词： Measurement Deep learning computer vision computer bugs Neurons Neural networks Training data

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 93 94 95 96 97 98 99 100 101 102 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：