检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

文献类型

29,402 篇 会议
1,394 册 图书
218 篇 期刊文献

馆藏范围

31,012 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,280 篇 工学
- 13,621 篇 计算机科学与技术...
- 5,203 篇 软件工程
- 2,970 篇 机械工程
- 2,648 篇 光学工程
- 1,411 篇 控制科学与工程
- 1,410 篇 电气工程
- 1,333 篇 信息与通信工程
- 656 篇 生物工程
- 576 篇 仪器科学与技术
- 513 篇 生物医学工程（可授...
- 465 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 213 篇 安全科学与工程
- 141 篇 交通运输工程
- 132 篇 建筑学
- 121 篇 材料科学与工程（可...
- 119 篇 土木工程
5,056 篇 理学
- 3,130 篇 物理学
- 2,404 篇 数学
- 824 篇 生物学
- 802 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,830 篇 医学
- 3,799 篇 临床医学
- 186 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,060 篇 管理学
- 617 篇 图书情报与档案管...
- 468 篇 管理科学与工程(可...
- 146 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
116 篇 法学
81 篇 农学
48 篇 教育学
43 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,593 篇 computer vision
5,698 篇 pattern recognit...
3,184 篇 training
2,263 篇 cameras
2,172 篇 computational mo...
2,119 篇 feature extracti...
2,050 篇 image segmentati...
1,970 篇 shape
1,968 篇 visualization
1,636 篇 robustness
1,501 篇 layout
1,477 篇 three-dimensiona...
1,446 篇 computer science
1,338 篇 computer archite...
1,284 篇 object detection
1,225 篇 semantics
1,151 篇 face recognition
1,108 篇 conferences
1,072 篇 benchmark testin...
1,050 篇 humans

机构

136 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
117 篇 chinese univ hon...
102 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
93 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
80 篇 swiss fed inst t...
79 篇 shanghai ai lab ...
69 篇 microsoft res as...
62 篇 adobe research
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...
56 篇 univ toronto on

作者

108 篇 umapada pal
81 篇 van gool luc
68 篇 zhang lei
53 篇 timofte radu
41 篇 yang yi
38 篇 hanqing lu
37 篇 loy chen change
33 篇 liu yang
33 篇 wang liang
32 篇 nassir navab
32 篇 xiaoou tang
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
29 篇 chen chen
28 篇 li stan z.
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 tian qi

语言

30,709 篇 英文
236 篇 其他
93 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31014 条记录，以下是4981-4990 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Finding AI-Generated Faces in the Wild

Finding AI-Generated Faces in the Wild

引用

IEEE computer Society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Gonzalo J. Aniano Porcile Jack Gindi Shivansh Mundra James R. Verbus Hany Farid LinkedIn Sunnyvale CA USA University of California Berkeley Berkeley CA USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

AI-based image generation has continued to rapidly improve, producing increasingly more realistic images with fewer obvious visual flaws. AI-generated images are being used to create fake online profiles which in turn are being used for spam, fraud, and disinformation campaigns. As the general problem of detecting any type of manipulated or synthesized content is receiving increasing attention, here we focus on a more narrow task of distinguishing a real face from an AI-generated face. This is particularly applicable when tackling inauthentic online accounts with a fake user profile photo. We show that by focusing on only faces, a more resilient and general-purpose artifact can be detected that allows for the detection of AI-generated faces from a variety of GAN- and diffusion-based synthesis engines, and across image resolutions (as low as 128 × 128 pixels) and qualities.

关键词： Visualization computer vision Image resolution Image synthesis Face recognition conferences Focusing

来源：评论

学校读者我要写书评

暂无评论

ACTION-Net: Multipath Excitation for Action recognition

ACTION-Net: Multipath Excitation for Action Recognition

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Zhengwei She, Qi Smolic, Aljosa Trinity Coll Dublin V SENSE Dublin Ireland ByteDance AI Lab Beijing Peoples R China

ISBN: (纸本)9781665445092

Spatial-temporal, channel-wise, and motion patterns are three complementary and crucial types of information for video action recognition. Conventional 2D CNNs are computationally cheap but cannot catch temporal relationships;3D CNNs can achieve good performance but are computationally intensive. In this work, we tackle this dilemma by designing a generic and effective module that can be embedded into 2D CNNs. To this end, we propose a spAtio-temporal, Channel and moTion excitatION (ACTION) module consisting of three paths: Spatio-Temporal Excitation (STE) path, Channel Excitation (CE) path, and Motion Excitation (ME) path. The STE path employs one channel 3D convolution to characterize spatio-temporal representation. The CE path adaptively recalibrates channelwise feature responses by explicitly modeling interdependencies between channels in terms of the temporal aspect. The ME path calculates feature-level temporal differences, which is then utilized to excite motion-sensitive channels. We equip 2D CNNs with the proposed ACTION module to form a simple yet effective ACTION-Net with very limited extra computational cost. ACTION-Net is demonstrated by consistently outperforming 2D CNN counterparts on three backbones (i.e., ResNet-50, MobileNet V2 and BNInception) employing three datasets (i.e., Something-Something V2, Jester, and EgoGesture). Code is provided at https://***/V-Sense/ACTION-Net.

关键词： computer vision Adaptation models Three-dimensional displays Codes Target recognition Convolution Computational modeling

来源：评论

学校读者我要写书评

暂无评论

NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis

NeRV: Neural Reflectance and Visibility Fields for Relightin...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Srinivasan, Pratul P. Deng, Boyang Zhang, Xiuming Tancik, Matthew Mildenhall, Ben Barron, Jonathan T. Google Res Mountain View CA 94043 USA MIT Cambridge MA 02139 USA Univ Calif Berkeley Berkeley CA USA

ISBN: (纸本)9781665445092

We present a method that takes as input a set of images of a scene illuminated by unconstrained known lighting, and produces as output a 3D representation that can be rendered from novel viewpoints under arbitrary lighting conditions. Our method represents the scene as a continuous volumetric function parameterized as MLPs whose inputs are a 3D location and whose outputs are the following scene properties at that input location: volume density, surface normal, material parameters, distance to the first surface intersection in any direction, and visibility of the external environment in any direction. Together, these allow us to render novel views of the object under arbitrary lighting, including indirect illumination effects. The predicted visibility and surface intersection fields are critical to our model's ability to simulate direct and indirect illumination during training, because the brute-force techniques used by prior work are intractable for lighting conditions outside of controlled setups with a single light. Our method outperforms alternative approaches for recovering relightable 3D scene representations, and performs well in complex lighting settings that have posed a significant challenge to prior work.

关键词： Training Reflectivity computer vision Three-dimensional displays Lighting Predictive models Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection

Uncertainty Guided Collaborative Training for Weakly Supervi...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Yang, Wenfei Zhang, Tianzhu Yu, Xiaoyuan Qi, Tian Zhang, Yongdong Wu, Feng Univ Sci & Technol China Hefei Anhui Peoples R China Huawei Cloud Shenzhen Peoples R China

ISBN: (纸本)9781665445092

Weakly supervised temporal action detection aims to localize temporal boundaries of actions and identify their categories simultaneously with only video-level category labels during training. Among existing methods, attention based methods have achieved superior performance by separating action and non-action segments. However, without the segment-level ground-truth supervision, the quality of the attention weight hinders the performance of these methods. To alleviate this problem, we propose a novel Uncertainty Guided Collaborative Training (UGCT) strategy, which mainly includes two key designs: (1) The first design is an online pseudo label generation module, in which the RGB and FLOW streams work collaboratively to learn from each other. (2) The second design is an uncertainty aware learning module, which can mitigate the noise in the generated pseudo labels. These two designs work together to promote the model performance effectively and efficiently by imposing pseudo label supervision on attention weight learning. Experimental results on three state-of-the-art attention based methods demonstrate that the proposed training strategy can significantly improve the performance of these methods, e.g., more than 4% for all three methods in terms of mAP@IoU=0.5 on the THUMOS14 dataset.

关键词： Training computer vision Uncertainty Collaboration Benchmark testing Reliability engineering pattern recognition

来源：评论

学校读者我要写书评

暂无评论

The Spatially-Correlative Loss for Various Image Translation Tasks

The Spatially-Correlative Loss for Various Image Translation...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zheng, Chuanxia Cham, Tat-Jen Cai, Jianfei Nanyang Technol Univ Sch Comp Sci & Engn Singapore Singapore Monash Univ Dept Data Sci & AI Melbourne Vic Australia

ISBN: (纸本)9781665445092

We propose a novel spatially-correlative loss that is simple, efficient and yet effective for preserving scene structure consistency while supporting large appearance changes during unpaired image-to-image (I2I) translation. Previous methods attempt this by using pixel-level cycle-consistency or feature-level matching losses, but the domain-specific nature of these losses hinder translation across large domain gaps. To address this, we exploit the spatial patterns of self-similarity as a means of defining scene structure. Our spatially-correlative loss is geared towards only capturing spatial relationships within an image rather than domain appearance. We also introduce a new self-supervised learning method to explicitly learn spatially-correlative maps for each specific translation task. We show distinct improvement over baseline models in all three modes of unpaired I2I translation: single-modal, multi-modal, and even single-image translation. This new loss can easily be integrated into existing network architectures and thus allows wide applicability. The code is available at https://***/lyndonzheng/F-LSeSim.

关键词： Learning systems Visualization computer vision Codes computer architecture Network architecture Solids

来源：评论

学校读者我要写书评

暂无评论

Vehicle Re-Identification based on Ensembling Deep Learning Features including a Synthetic Training Dataset, Orientation and Background Features, and Camera Verification.

Vehicle Re-Identification based on Ensembling Deep Learning ...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Fernandez, Marta Moral, Paula Garcia-Martin, Alvaro Martinez, Jose M. Univ Autonoma Madrid Video Proc & Understanding Lab Madrid Spain

ISBN: (纸本)9781665448994

Vehicle re-identification has the objective of finding a specific vehicle among different vehicle crops captured by multiple cameras placed at multiple intersections. Among the different difficulties, high intra-class variability and high inter-class similarity can be highlighted. Moreover, the resolution of the images can be different, which also means a challenge in the re-identification task. Intending to face these problems, we use as baseline our previous work based on obtaining different deep learning features and ensembling them to get a single, stable and robust feature vector. It also includes post-processing techniques that explode all the information provided by the CityFlowV2-ReID dataset, including a re-ranking step. Then, in this paper, several newly included improvements are described. Background and orientation similarity matrices are added to the system to reduce bias towards these characteristics. Furthermore, we take into account the camera labels to penalize the gallery images that share camera with the query image. Additionally, to improve the training step, a synthetic dataset is added to the original one.

关键词： Training Deep learning Image resolution Image color analysis Cameras Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Simple Copy-Paste is a Strong Data Augmentation Method for I...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ghiasi, Golnaz Cui, Yin Srinivas, Aravind Qian, Rui Lin, Tsung-Yi Cubuk, Ekin D. Le, Quoc, V Zoph, Barret Google Res Brain Team Mountain View CA 94043 USA Univ Calif Berkeley Berkeley CA USA Cornell Univ Ithaca NY 14853 USA Google Res Mountain View CA USA

ISBN: (纸本)9781665445092

Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation (e.g.,[13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies on Copy-Paste relied on modeling the surrounding visual context for pasting the objects. However, we find that the simple mechanism of pasting objects randomly is good enough and can provide solid gains on top of strong baselines. Furthermore, we show Copy-Paste is additive with semi-supervised methods that leverage extra data through pseudo labeling (e.g. self-training). On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP an improvement of +0.6 mask AP and +1.5 box AP over the previous state-of-the-art. We further demonstrate that Copy-Paste can lead to significant improvements on the LVIS benchmark. Our baseline model outperforms the LVIS 2020 Challenge winning entry by +3.6 mask AP on rare categories.(1)

关键词： Training Image segmentation computer vision Visualization Additives Systematics Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Noisy One-Point Homographies are Surprisingly Good

Noisy One-Point Homographies are Surprisingly Good

引用

conference on computer vision and pattern recognition (CVPR)

作者： Yaqing Ding Jonathan Astermark Magnus Oskarsson Viktor Larsson Centre for Mathematical Sciences Lund University Visual Recognition Group Faculty of Electrical Engineering Czech Technical University in Prague

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Two-view homography estimation is a classic and fundamental problem in computer vision. While conceptually simple, the problem quickly becomes challenging when multiple planes are visible in the image pair. Even with correct matches, each individual plane (homography) might have a very low number of inliers when comparing to the set of all correspondences. In practice, this requires a large number of RANSAC iterations to generate a good model hypothesis. The current state-of-the-art methods therefore seek to reduce the sample size, from four point correspondences originally, by including additional information such as keypoint orientation/angles or local affine information. In this work, we continue in this direction and propose a novel one-point solver that leverages different approximate constraints derived from the same auxiliary information. In experiments we obtain state-of-the-art results, with execution time speed-ups, on large benchmark datasets and show that it is more beneficial for the solver to be sample efficient compared to generating more accurate homographies.

关键词： computer vision Accuracy Runtime Computational modeling Noise Estimation Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video recognition

OST: Refining Text Knowledge with Optimal Spatio-Temporal De...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Tongjia Chen Hongshan Yu Zhengeng Yang Zechuan Li Wei Sun Chen Chen Hunan University Hunan Normal University Center for Research in Computer Vision University of Central Florida

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Due to the resource-intensive nature of training vision- language models on expansive video data, a majority of studies have centered on adapting pre-trained image- language models to the video domain. Dominant pipelines propose to tackle the visual discrepancies with additional temporal learners while overlooking the substantial discrepancy for web-scaled descriptive narratives and concise action category names, leading to less distinct semantic space and potential performance limitations. In this work, we prioritize the refinement of text knowledge to facilitate generalizable video recognition. To address the limitations of the less distinct semantic space of category names, we prompt a large language model (LLM) to augment action class names into Spatio-Temporal Descriptors thus bridging the textual discrepancy and serving as a knowledge base for general recognition. Moreover, to assign the best descriptors with different video instances, we propose Optimal Descriptor Solver, forming the video recognition problem as solving the optimal matching flow across frame-level representations and descriptors. Comprehensive evaluations in zero-shot, few-shot, and fully supervised video recognition highlight the effectiveness of our approach. Our best model achieves a state-of-the-art zero-shot accuracy of 75.1% on Kinetics-600.

关键词： Training Adaptation models Visualization Large language models Semantics Pipelines Refining

来源：评论

学校读者我要写书评

暂无评论

Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection

Frequency-aware Discriminative Feature Learning Supervised b...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Jiaming Xie, Hongtao Li, Jiahong Wang, Zhongyuan Zhang, Yongdong Univ Sci & Technol China Hefei Peoples R China Kuaishou Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

Face forgery detection is raising ever-increasing interest in computer vision since facial manipulation technologies cause serious worries. Though recent works have reached sound achievements, there are still unignorable problems: a) learned features supervised by softmax loss are separable but not discriminative enough, since softmax loss does not explicitly encourage intra-class compactness and interclass separability;and b)fixed filter banks and hand-crafted features are insufficient to capture forgery patterns of frequency from diverse inputs. To compensate for such limitations, a novel frequency-aware discriminative feature learning framework is proposed in this paper. Specifically, we design a novel single-center loss (SCL) that only compresses intra-class variations of natural faces while boosting interclass differences in the embedding space. In such a case, the network can learn more discriminative features with less optimization difficulty. Besides, an adaptive frequency feature generation module is developed to mine frequency clues in a completely data-driven fashion. With the above two modules, the whole framework can learn more discriminative features in an end-to-end manner. Extensive experiments demonstrate the effectiveness and superiority of our framework on three versions of the FF++ dataset.

关键词： Measurement computer vision Face recognition Frequency-domain analysis Filter banks Boosting Forgery

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 491 492 493 494 495 496 497 498 499 500 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：