检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

6,421 篇 会议
25 篇 期刊文献
3 册 图书

馆藏范围

6,448 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

3,849 篇 工学
- 3,647 篇 计算机科学与技术...
- 1,431 篇 软件工程
- 790 篇 光学工程
- 302 篇 信息与通信工程
- 242 篇 控制科学与工程
- 219 篇 电气工程
- 201 篇 机械工程
- 80 篇 生物医学工程（可授...
- 68 篇 生物工程
- 67 篇 电子科学与技术（可...
- 64 篇 仪器科学与技术
- 36 篇 建筑学
- 33 篇 力学（可授工学、理...
- 33 篇 土木工程
- 33 篇 航空宇航科学与技...
- 26 篇 安全科学与工程
- 22 篇 交通运输工程
- 20 篇 材料科学与工程（可...
- 18 篇 化学工程与技术
1,453 篇 理学
- 945 篇 物理学
- 890 篇 数学
- 352 篇 统计学（可授理学、...
- 134 篇 生物学
- 38 篇 系统科学
- 23 篇 化学
160 篇 管理学
- 110 篇 图书情报与档案管...
- 52 篇 管理科学与工程(可...
- 25 篇 工商管理
112 篇 医学
- 112 篇 临床医学
17 篇 法学
- 17 篇 社会学
12 篇 农学
8 篇 教育学
7 篇 艺术学
6 篇 经济学
2 篇 军事学

主题

2,288 篇 computer vision
789 篇 pattern recognit...
637 篇 cameras
629 篇 computer science
568 篇 face recognition
555 篇 layout
510 篇 image segmentati...
509 篇 conferences
498 篇 shape
445 篇 robustness
439 篇 object recogniti...
388 篇 humans
332 篇 feature extracti...
321 篇 training
303 篇 object detection
262 篇 image recognitio...
257 篇 application soft...
246 篇 lighting
238 篇 image reconstruc...
237 篇 computational mo...

机构

41 篇 microsoft resear...
26 篇 department of co...
21 篇 swiss fed inst t...
21 篇 school of comput...
20 篇 department of co...
19 篇 swiss fed inst t...
19 篇 carnegie mellon ...
18 篇 department of co...
17 篇 department of in...
17 篇 the robotics ins...
17 篇 institute of com...
16 篇 univ sci & techn...
16 篇 robotics institu...
15 篇 tsinghua univ pe...
14 篇 department of el...
14 篇 school of comput...
14 篇 school of comput...
13 篇 univ maryland co...
13 篇 microsoft resear...
13 篇 microsoft resear...

作者

39 篇 timofte radu
28 篇 s.k. nayar
24 篇 huang thomas s.
23 篇 xiaoou tang
22 篇 t. kanade
20 篇 t.s. huang
19 篇 van gool luc
19 篇 t. darrell
19 篇 chellappa rama
18 篇 nayar shree k.
17 篇 a.k. jain
17 篇 a. zisserman
17 篇 jain anil k.
16 篇 g. healey
16 篇 torralba antonio
16 篇 heung-yeung shum
16 篇 zisserman andrew
16 篇 l. van gool
15 篇 m. shah
15 篇 ji qiang

语言

6,447 篇 英文
2 篇 其他

检索条件"任意字段=1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1992"

共 6449 条记录，以下是161-170 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Proceedings - 2024 ieee/CVF conference on computer vision and pattern recognition, cvpr 2024

Proceedings - 2024 IEEE/CVF Conference on Computer Vision an...

引用

2024 ieee/CVF conference on computer vision and pattern recognition, cvpr 2024

ISBN: (纸本)9798350353006

The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large multimodal model for long-term video understanding;geometrically-driven aggregation for zero-shot 3D point cloud understanding;TextCraftor: your text encoder can be image quality controller;ViLa-MIL: dual-scale vision-language multiple instance learning for whole slide image classification;HumanNorm: learning normal diffusion model for high-quality and realistic 3D human generation;AnEmpirical study of scaling law for scene text recognition;improving image restoration through removing degradations in textual representations;and steganographic passport: an owner and user verifiable credential for deep model ip protection without retraining.

关键词：

来源：评论

学校读者我要写书评

暂无评论

The Casual Conversations v2 Dataset A diverse, large benchmark for measuring fairness and robustness in audio/vision/speech models

The Casual Conversations v2 Dataset A diverse, large benchma...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Porgali, Bilal Albiero, Vitor Ryda, Jordan Ferrer, Cristian Canton Hazirbas, Caner Meta AI Menlo Pk CA 94025 USA

ISBN: (纸本)9798350302493

This paper introduces a new large consent-driven dataset aimed at assisting in the evaluation of algorithmic bias and robustness of computer vision and audio speech models in regards to 11 attributes that are self-provided or labeled by trained annotators. The dataset includes 26,467 videos of 5,567 unique paid participants, with an average of almost 5 videos per person, recorded in Brazil, India, Indonesia, Mexico, Vietnam, Philippines, and the USA, representing diverse demographic characteristics. The participants agreed for their data to be used in assessing fairness of AI models and provided self-reported age, gender, language/dialect, disability status, physical adornments, physical attributes and geo-location information, while trained annotators labeled apparent skin tone using the Fitzpatrick Skin Type and Monk Skin Tone scales, and voice timbre. Annotators also labeled for different recording setups and per-second activity annotations.

关键词： Large dataset

来源：评论

学校读者我要写书评

暂无评论

High-level context representation for emotion recognition in images

High-level context representation for emotion recognition in...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Costa, Willams de Lima Martinez, Estefania Talavera Figueiredo, Lucas Silva Teichrieb, Veronica Univ Fed Pernambuco Voxar Labs Ctr Informat Ave Jorn Anibal Fernandes Recife PE Brazil Univ Twente Drienerlolaan 5 NL-7522 NB Enschede Netherlands Univ Fed Rural Pernambuco Unidade Acad Belo Jardim PE-166100 Belo Jardim Brazil

ISBN: (纸本)9798350302493

Emotion recognition is the task of classifying perceived emotions in people. Previous works have utilized various nonverbal cues to extract features from images and correlate them to emotions. Of these cues, situational context is particularly crucial in emotion perception since it can directly influence the emotion of a person. In this paper, we propose an approach for high-level context representation extraction from images. The model relies on a single cue and a single encoding stream to correlate this representation with emotions. Our model competes with the state-of-the-art, achieving an mAP of 0.3002 on the EMOTIC dataset while also being capable of execution on consumer-grade hardware at approximate to 90 frames per second. Overall, our approach is more efficient than previous models and can be easily deployed to address real-world problems related to emotion recognition.

关键词： Emotion recognition

来源：评论

学校读者我要写书评

暂无评论

A Light Touch Approach to Teaching Transformers Multi-view Geometry

A Light Touch Approach to Teaching Transformers Multi-view G...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Bhalgat, Yash Henriques, Joao F. Zisserman, Andrew Univ Oxford Visual Geometry Grp Oxford England

ISBN: (纸本)9798350301298

Transformers are powerful visual learners, in large part due to their conspicuous lack of manually-specified priors. This flexibility can be problematic in tasks that involve multiple-view geometry, due to the near-infinite possible variations in 3D shapes and viewpoints (requiring flexibility), and the precise nature of projective geometry (obeying rigid laws). To resolve this conundrum, we propose a "light touch" approach, guiding visual Transformers to learn multiple-view geometry but allowing them to break free when needed. We achieve this by using epipolar lines to guide the Transformer's cross-attention maps during training, penalizing attention values outside the epipolar lines and encouraging higher attention along these lines since they contain geometrically plausible matches. Unlike previous methods, our proposal does not require any camera pose information at test-time. We focus on pose-invariant object instance retrieval, where standard Transformer networks struggle, due to the large differences in viewpoint between query and retrieved images. Experimentally, our method outperforms state-of-the-art approaches at object retrieval, without needing pose information at test-time.

关键词： detection recognition: Categorization retrieval

来源：评论

学校读者我要写书评

暂无评论

Difficulty Estimation with Action Scores for computer vision Tasks

Difficulty Estimation with Action Scores for Computer Vision...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Arriaga, Octavio Palacio, Sebastian Valdenegro-Toro, Matias Univ Bremen Bremen Germany German Res Ctr Artificial Intelligence Kaiserslautern Germany Univ Groningen Groningen Netherlands

ISBN: (纸本)9798350302493

As more machine learning models are now being applied in real world scenarios it has become crucial to evaluate their difficulties and biases. In this paper we present an unsupervised method for calculating a difficulty score based on the accumulated loss per epoch. Our proposed method does not require any modification to the model, neither any external supervision, and it can be easily applied to a wide range of machine learning tasks. We provide results for the tasks of image classification, image segmentation, and object detection. We compare our score against similar metrics and provide theoretical and empirical evidence of their difference. Furthermore, we show applications of our proposed score for detecting incorrect labels, and test for possible biases.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

AI-Synthesized Voice Detection Using Neural Vocoder Artifact...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Sun, Chengzhe Jia, Shan Hou, Shuwei Lyu, Siwei SUNY Buffalo Buffalo NY 14260 USA

ISBN: (纸本)9798350302493

Advancements in AI-synthesized human voices have created a growing threat of impersonation and disinformation, making it crucial to develop methods to detect synthetic human voices. This study proposes a new approach to identifying synthetic human voices by detecting artifacts of vocoders in audio signals. Most DeepFake audio synthesis models use a neural vocoder, a neural network that generates waveforms from temporal-frequency representations like mel-spectrograms. By identifying neural vocoder processing in audio, we can determine if a sample is synthesized. To detect synthetic human voices, we introduce a multi-task learning framework for a binary-class RawNet2 model that shares the feature extractor with a vocoder identification module. By treating vocoder identification as a pretext task, we constrain the feature extractor to focus on vocoder artifacts and provide discriminative features for the final binary classifier. Our experiments show that the improved RawNet2 model based on vocoder identification achieves high classification performance on the binary task overall. Codes and data can be found at https:// github. com/ csun22/SyntheticVoice-Detection- Vocoder-Artifacts.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Masked Jigsaw Puzzle: A Versatile Position Embedding for vision Transformers

Masked Jigsaw Puzzle: A Versatile Position Embedding for Vis...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Ren, Bin Liu, Yahui Song, Yue Bi, Wei Cucchiara, Rita Sebe, Nicu Wang, Wei Univ Pisa Pisa Italy Univ Trento Trento TN Italy Tencent AI Lab Shenzhen Peoples R China Beijing Jiaotong Univ Beijing Peoples R China Univ Modena & Reggio Emilia Modena Italy

ISBN: (纸本)9798350301298

Position Embeddings (PEs), an arguably indispensable component in vision Transformers (ViTs), have been shown to improve the performance of ViTs on many vision tasks. However, PEs have a potentially high risk of privacy leakage since the spatial information of the input patches is exposed. This caveat naturally raises a series of interesting questions about the impact of PEs on accuracy, privacy, prediction consistency, etc. To tackle these issues, we propose a Masked Jigsaw Puzzle (MJP) position embedding method. In particular, MJP first shuffles the selected patches via our block-wise random jigsaw puzzle shuffle algorithm, and their corresponding PEs are occluded. Meanwhile, for the non-occluded patches, the PEs remain the original ones but their spatial relation is strengthened via our dense absolute localization regressor. The experimental results reveal that 1) PEs explicitly encode the 2D spatial relationship and lead to severe privacy leakage problems under gradient inversion attack;2) Training ViTs with the naively shuffled patches can alleviate the problem, but it harms the accuracy;3) Under a certain shuffle ratio, the proposed MJP not only boosts the performance and robustness on large-scale datasets (i.e., ImageNet-1K and ImageNet-C, -A/O) but also improves the privacy preservation ability under typical gradient attacks by a large margin. The source code and trained models are available at https://***/yhlleo/ MJP.

关键词： Deep learning architectures and techniques

来源：评论

学校读者我要写书评

暂无评论

Quantifying Extrinsic Curvature in Neural Manifolds

Quantifying Extrinsic Curvature in Neural Manifolds

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Acosta, Francisco Sanborn, Sophia Duc, Khanh Dao Madhav, Manu Miolane, Nina UC Santa Barbara Phys Santa Barbara CA 93106 USA UC Santa Barbara Elect & Comp Engn Santa Barbara CA USA UC Santa Barbara Math Santa Barbara CA USA UC Santa Barbara Santa Barbara CA USA

ISBN: (纸本)9798350302493

The neural manifold hypothesis postulates that the activity of a neural population forms a low-dimensional manifold whose structure reflects that of the encoded task variables. In this work, we combine topological deep generative models and extrinsic Riemannian geometry to introduce a novel approach for studying the structure of neural manifolds. This approach (i) computes an explicit parameterization of the manifolds and (ii) estimates their local extrinsic curvature-hence quantifying their shape within the neural state space. Importantly, we prove that our methodology is invariant with respect to transformations that do not bear meaningful neuroscience information, such as permutation of the order in which neurons are recorded. We show empirically that we correctly estimate the geometry of synthetic manifolds generated from smooth deformations of circles, spheres, and tori, using realistic noise levels. We additionally validate our methodology on simulated and real neural data, and show that we recover geometric structure known to exist in hippocampal place cells. We expect this approach to open new avenues of inquiry into geometric neural correlates of perception and behavior.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Gradient Attention Balance Network: Mitigating Face recognition Racial Bias via Gradient Attention

Gradient Attention Balance Network: Mitigating Face Recognit...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Huang, Linzhi Wang, Mei Liang, Jiahao Deng, Weihong Shi, Hongzhi Wen, Dongchao Zhang, Yingjie Zhao, Jian Inspur Elect Informat Ind Co Ltd Jinan Peoples R China Shandong Mass Informat Technol Res Inst Qingdao Peoples R China

ISBN: (纸本)9798350302493

Although face recognition has made impressive progress in recent years, we ignore the racial bias of the recognition system when we pursue a high level of accuracy. Previous work found that for different races, face recognition networks focus on different facial regions, and the sensitive regions of darker-skinned people are much smaller. Based on this discovery, we propose a new de-bias method based on gradient attention, called Gradient Attention Balance Network (GABN). Specifically, we use the gradient attention map (GAM) of the face recognition network to track the sensitive facial regions and make the GAMs of different races tend to be consistent through adversarial learning. This method mitigates the bias by making the network focus on similar facial regions. In addition, we also use masks to erase the Top-N sensitive facial regions, forcing the network to allocate its attention to a larger facial region. This method expands the sensitive region of darker-skinned people and further reduces the gap between GAM of darker-skinned people and GAM of Caucasians. Extensive experiments show that GABN successfully mitigates racial bias in face recognition and learns more balanced performance for people of different races.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Learning unbiased classifiers from biased data with meta-learning

Learning unbiased classifiers from biased data with meta-lea...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Ragonesi, Ruggero Morerio, Pietro Murino, Vittorio Ist Italiano Tecnol Pattern Anal & Comp Vis PAVIS Genoa Italy Univ Verona Dept Comp Sci Verona Italy

ISBN: (纸本)9798350302493

It is well known that large deep architectures are powerful models when adequately trained, but may exhibit undesirable behavior leading to confident incorrect predictions, even when evaluated on slightly different test examples. Test data characterized by distribution shifts (from training data distribution), outliers, and adversarial samples are among the types of data affected by this problem. This situation worsens whenever data are biased, meaning that predictions are mostly based on spurious correlations present in the data. Unfortunately, since such correlations occur in the most of data, a model is prevented from correctly generalizing the considered classes. In this work, we tackle this problem from a meta-learning perspective. Considering the dataset as composed of unknown biased and unbiased samples, we first identify these two subsets by a pseudo-labeling algorithm, even if coarsely. Subsequently, we apply a bi-level optimization algorithm in which, in the inner loop, we look for the best parameters guiding the training of the two subsets, while in the outer loop, we train the final model taking benefit from augmented data generated using Mixup. Properly tuning the contributions of biased and unbiased data, together with the regularization introduced by the mixed data has proved to be an effective training strategy to learn unbiased models, showing superior generalization capabilities. Experimental results on synthetically and realistically biased datasets surpass state-of-the-art performance, as compared to existing methods.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 13 14 15 16 17 18 19 20 21 22 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：