检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

19,438 篇 会议
46 篇 期刊文献
5 册 图书

馆藏范围

19,488 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

12,440 篇 工学
- 10,282 篇 计算机科学与技术...
- 2,395 篇 机械工程
- 2,007 篇 软件工程
- 813 篇 光学工程
- 531 篇 电气工程
- 419 篇 控制科学与工程
- 322 篇 信息与通信工程
- 210 篇 测绘科学与技术
- 80 篇 生物医学工程（可授...
- 73 篇 电子科学与技术（可...
- 70 篇 生物工程
- 60 篇 仪器科学与技术
- 38 篇 建筑学
- 36 篇 土木工程
- 33 篇 力学（可授工学、理...
- 31 篇 航空宇航科学与技...
- 26 篇 安全科学与工程
- 20 篇 材料科学与工程（可...
- 20 篇 交通运输工程
3,409 篇 医学
- 3,408 篇 临床医学
1,980 篇 理学
- 1,006 篇 数学
- 973 篇 物理学
- 359 篇 统计学（可授理学、...
- 336 篇 生物学
- 231 篇 系统科学
- 24 篇 化学
258 篇 管理学
- 138 篇 管理科学与工程(可...
- 122 篇 图书情报与档案管...
- 27 篇 工商管理
19 篇 法学
- 19 篇 社会学
14 篇 农学
8 篇 教育学
7 篇 经济学
3 篇 军事学
3 篇 艺术学

主题

7,893 篇 computer vision
2,727 篇 training
2,680 篇 pattern recognit...
1,760 篇 computational mo...
1,644 篇 visualization
1,410 篇 cameras
1,372 篇 three-dimensiona...
1,327 篇 shape
1,213 篇 face recognition
1,207 篇 image segmentati...
1,164 篇 feature extracti...
1,109 篇 robustness
1,087 篇 semantics
983 篇 layout
959 篇 object detection
949 篇 computer archite...
942 篇 benchmark testin...
931 篇 codes
902 篇 computer science
859 篇 deep learning

机构

174 篇 univ sci & techn...
161 篇 carnegie mellon ...
148 篇 univ chinese aca...
144 篇 chinese univ hon...
110 篇 microsoft resear...
106 篇 tsinghua univ pe...
103 篇 zhejiang univ pe...
99 篇 swiss fed inst t...
92 篇 tsinghua univers...
89 篇 microsoft res as...
88 篇 shanghai ai lab ...
81 篇 zhejiang univers...
76 篇 alibaba grp peop...
73 篇 university of sc...
73 篇 hong kong univ s...
72 篇 peking univ peop...
72 篇 university of ch...
68 篇 shanghai jiao to...
66 篇 univ oxford oxfo...
66 篇 shanghai jiao to...

作者

79 篇 van gool luc
70 篇 zhang lei
59 篇 timofte radu
48 篇 yang yi
47 篇 xiaoou tang
45 篇 luc van gool
43 篇 darrell trevor
43 篇 tian qi
42 篇 loy chen change
42 篇 sun jian
42 篇 li fei-fei
40 篇 qi tian
38 篇 li stan z.
36 篇 chen xilin
36 篇 torralba antonio
35 篇 vasconcelos nuno
35 篇 shan shiguang
35 篇 liu yang
34 篇 liu xiaoming
34 篇 tao dacheng

语言

19,483 篇 英文
2 篇 日文
2 篇 其他
2 篇 中文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2000"

共 19489 条记录，以下是4711-4720 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Accept the Modality Gap: An Exploration in the Hyperbolic Space

Accept the Modality Gap: An Exploration in the Hyperbolic Sp...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Sameera Ramasinghe Violetta Shevchenko Gil Avraham Ajanthan Thalaiyasingam Amazon Australia

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Recent advancements in machine learning have spotlighted the potential of hyperbolic spaces as they effectively learn hierarchical feature representations. While there has been progress in leveraging hyperbolic spaces in single-modality contexts, its exploration in multimodal settings remains under explored. A recent work has sought to transpose Euclidean multimodal learning techniques to hyperbolic spaces, by adopting a geodesic distance based contrastive loss. However, we show both theoretically and empirically that such spatial proximity based contrastive loss significantly disrupts hierarchies in the latent space. To remedy this, we advocate that the cross-modal representations should accept the inherent modality gap between text and images, and introduce a novel approach to measure cross-modal similarity that does not enforce spatial proximity. Our approach shows remarkable capabilities in preserving unimodal hierarchies while aligning the two modalities. Our experiments on a series of downstream tasks demonstrate that a better latent structure emerges with our objective function while being superior in text-to-image and image-to-text retrieval tasks.

关键词： computer vision Text to image Machine learning Linear programming pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Insights from the Use of Previously Unseen Neural Architecture Search Datasets

Insights from the Use of Previously Unseen Neural Architectu...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Rob Geada David Towers Matthew Forshaw Amir Atapour-Abarghouei A. Stephen McGough Newcastle University UK- The Alan Turing Institute UK Durham University UK-

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

The boundless possibility of neural networks which can be used to solve a problem - each with different performance - leads to a situation where a Deep Learning expert is required to identify the best neural network. This goes against the hope of removing the need for experts. Neural Architecture Search (NAS) offers a solution to this by automatically identifying the best architecture. However, to date, NAS work has focused on a small set of datasets which we argue are not representative of real-world problems. We introduce eight new datasets created for a series of NAS Challenges: AddNIST, Language, MultNIST, CIFAR-Tile, Gutenberg, Isabella, GeoClassing, and Chesseract. These datasets and challenges are developed to direct attention to issues in NAS development and to encourage authors to consider how their models will perform on datasets unknown to them at development time. We present experimentation using standard Deep Learning methods as well as the best results from challenge participants.

关键词： Deep learning Knowledge engineering computer vision Costs Neural networks Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation

EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupe...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Jiao, Yang Tran, Trac D. Shi, Guangming Xidian Univ Xian Peoples R China Johns Hopkins Univ Baltimore MD 21218 USA Xidian Guangzhou Inst Technol Guangzhou Peoples R China

ISBN: (数字)9781665445092

ISBN: (纸本)9781665445092

This paper addresses the challenging unsupervised scene flow estimation problem by jointly learning four low-level vision sub-tasks: optical flow F, stereo-depth D, camera pose P and motion segmentation S. Our key insight is that the rigidity of the scene shares the same inherent geometrical structure with object movements and scene depth. Hence, rigidity from S can be inferred by jointly coupling F, D and P to achieve more robust estimation. To this end, we propose a novel scene flow framework named EffiScene with efficient joint rigidity learning, going beyond the existing pipeline with independent auxiliary structures. In EffiScene, we first estimate optical flow and depth at the coarse level and then compute camera pose by Perspectiven-Points method. To jointly learn local rigidity, we design a novel Rigidity From Motion (RfM) layer with three principal components: (i) correlation extraction;(ii) boundary learning;and (iii) outlier exclusion. Final outputs are fused based on the rigid map M-R from RfM at finer levels. To efficiently train EffiScene, two new losses L-bn(d) and L-unc are designed to prevent trivial solutions and to regularize the flow boundary discontinuity. Extensive experiments on scene flow benchmark KITTI show that our method is effective and significantly improves the state-of-the-art ap-proaches for all sub-tasks, i.e. optical flow (5.19 -> 4.20), depth estimation (3.78 -> 3.46), visual odometry (0.012 -> 0.011) and motion segmentation (0.57 -> 0.62).

关键词： Couplings computer vision Image motion analysis Motion segmentation Estimation Benchmark testing Cameras

来源：评论

学校读者我要写书评

暂无评论

Model-Aware Gesture-to-Gesture Translation

Model-Aware Gesture-to-Gesture Translation

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Hu, Hezhen Wang, Weilun Zhou, Wengang Zhao, Weichao Li, Houqiang Univ Sci & Technol China USTC EEIS Dept CAS Key Lab GIPAS Hefei Peoples R China Hefei Comprehens Natl Sci Ctr Inst Artificial Intelligence Hefei Peoples R China

ISBN: (纸本)9781665445092

Hand gesture-to-gesture translation is a significant and interesting problem, which serves as a key role in many applications, such as sign language production. This task involves fine-grained structure understanding of the mapping between the source and target gestures. Current works follow a data-driven paradigm based on sparse 2D joint representation. However, given the insufficient representation capability of 2D joints, this paradigm easily leads to blurry generation results with incorrect structure. In this paper, we propose a novel model-aware gesture-to-gesture translation framework, which introduces hand prior with hand meshes as the intermediate representation. To take full advantage of the structured hand model, we first build a dense topology map aligning the image plane with the encoded embedding of the visible hand mesh. Then, a transformation flow is calculated based on the correspondence of the source and target topology map. During the generation stage, we inject the topology information into generation streams by modulating the activations in a spatially-adaptive manner. Further, we incorporate the source local characteristic to enhance the translated gesture image according to the transformation flow. Extensive experiments on two benchmark datasets have demonstrated that our method achieves new state-of-the-art performance.

关键词： Measurement computer vision Production Gesture recognition Benchmark testing Streaming media Assistive technologies

来源：评论

学校读者我要写书评

暂无评论

Spatially-invariant Style-codes Controlled Makeup Transfer

Spatially-invariant Style-codes Controlled Makeup Transfer

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Deng, Han Han, Chu Cai, Hongmin Han, Guoqiang He, Shengfeng South China Univ Technol Sch Comp Sci & Engn Guangzhou Peoples R China Guangdong Prov Peoples Hosp Guangdong Acad Med Sci Guangzhou Peoples R China

ISBN: (纸本)9781665445092

Transferring makeup from the misaligned reference image is challenging. Previous methods overcome this barrier by computing pixel-wise correspondences between two images, which is inaccurate and computational-expensive. In this paper, we take a different perspective to break down the makeup transfer problem into a two-step extraction-assignment process. To this end, we propose a Style-based Controllable GAN model that consists of three components, each of which corresponds to target style-code encoding, face identity features extraction, and makeup fusion, respectively. In particular, a Part-specific Style Encoder encodes the component-wise makeup style of the reference image into a style-code in an intermediate latent space W. The style-code discards spatial information and therefore is invariant to spatial misalignment. On the other hand, the style-code embeds component-wise information, enabling flexible partial makeup editing from multiple references. This style-code, together with source identity features, is integrated into a Makeup Fusion Decoder equipped with multiple AdaIN layers to generate the final result. Our proposed method demonstrates great flexibility on makeup transfer by supporting makeup removal, shade-controllable makeup transfer, and part-specific makeup transfer, even with large spatial misalignment. Extensive experiments demonstrate the superiority of our approach over state-of-the-art methods.

关键词： computer vision Codes Face recognition Computational modeling Aerospace electronics Feature extraction Encoding

来源：评论

学校读者我要写书评

暂无评论

Metadata Normalization

Metadata Normalization

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Lu, Mandy Zhao, Qingyu Zhang, Jiequan Pohl, Kilian M. Li Fei-Fei Niebles, Juan Carlos Adeli, Ehsan Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9781665445092

Batch Normalization (BN) and its variants have delivered tremendous success in combating the covariate shift induced by the training step of deep learning methods. While these techniques normalize the feature distribution by standardizing with batch statistics, they do not correct the influence on features from extraneous variables or multiple distributions. Such extra variables, referred to as meta-data here, may create bias or confounding effects (e.g., race when classifying gender from face images). We introduce the Metadata Normalization (MDN) layer, a new batch-level operation which can be used end-to-end within the training framework, to correct the influence of meta-data on the feature distribution. MDN adopts a regression analysis technique traditionally used for preprocessing to remove (regress out) the metadata effects on model features during training. We utilize a metric based on distance correlation to quantify the distribution bias from the meta-data and demonstrate that our method successfully removes metadata effects on four diverse settings: one synthetic, one 2D image, one video, and one 3D medical image dataset.

关键词： Deep learning Training Measurement Three-dimensional displays Computational modeling Face recognition computer architecture

来源：评论

学校读者我要写书评

暂无评论

Memory Oriented Transfer Learning for Semi-Supervised Image Deraining

Memory Oriented Transfer Learning for Semi-Supervised Image ...

引用

ieee/CVF conference on computer vision and pattern recognition (cvpr)

作者： Huang, Huaibo Yu, Aijing He, Ran Univ Chinese Acad Sci Ctr Excellence Brain Sci & Intelligence Technol Ctr Res Intelligent Percept & Comp Natl Lab Pattern RecognitCASIACAS Beijing Peoples R China

ISBN: (纸本)9781665445092

Deep learning based methods have shown dramatic improvements in image rain removal by using large-scale paired data of synthetic datasets. However, due to the various appearances of real rain streaks that may differ from those in the synthetic training data, it is challenging to directly extend existing methods to the real-world scenes. To address this issue, we propose a memory-oriented semi-supervised (MOSS) method which enables the network to explore and exploit the properties of rain streaks from both synthetic and real data. The key aspect of our method is designing an encoder-decoder neural network that is augmented with a self-supervised memory module, where items in the memory record the prototypical patterns of rain degradations and are updated in a self-supervised way. Consequently, the rainy styles can be comprehensively derived from synthetic or real-world degraded images with- out the need for clean labels. Furthermore, we present a self-training mechanism that attempts to transfer deraining knowledge from supervised rain removal to unsupervised cases. An additional target network, which is updated with an exponential moving average of the online deraining network, is utilized to produce pseudo-labels for unlabeled rainy images. Meanwhile, the deraining network is optimized with supervised objectives on both synthetic paired data and pseudo-paired noisy data. Extensive experiments show that the proposed method achieves more appealing results not only on limited labeled data but also on unlabeled real-world images than recent state-of-the-art methods.

关键词： Degradation Rain Transfer learning Neural networks Training data Memory modules pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Privacy-Preserving Face recognition Using Trainable Feature Subtraction

Privacy-Preserving Face Recognition Using Trainable Feature ...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Yuxi Mi Zhizhou Zhong Yuge Huang Jiazhen Ji Jianqing Xu Jun Wang Shaoming Wang Shouhong Ding Shuigeng Zhou Fudan University Youtu Lab Tencent WeChat Pay Lab33 Tencent

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

The widespread adoption of face recognition has led to increasing privacy concerns, as unauthorized access to face images can expose sensitive personal information. This paper explores face image protection against viewing and recovery attacks. Inspired by image compression, we propose creating a visually uninformative face image through feature subtraction between an original face and its model-produced regeneration. Recognizable identity features within the image are encouraged by co-training a recognition model on its high-dimensional feature represen-tation. To enhance privacy, the high-dimensional represen-tation is crafted through random channel shuffling, resulting in randomized recognizable images devoid of attacker-leverageable texture details. We distill our methodologies into a novel privacy-preserving face recognition method, MinusFace. Experiments demonstrate its high recognition accuracy and effective privacy protection. Its code is avail-able at https://***/Tencent/TFace.

关键词： Privacy computer vision Image recognition Image coding Codes Accuracy Face recognition

来源：评论

学校读者我要写书评

暂无评论

Accelerating Neural Field Training via Soft Mining

Accelerating Neural Field Training via Soft Mining

引用

conference on computer vision and pattern recognition (cvpr)

作者： Shakiba Kheradmand Daniel Rebain Gopal Sharma Hossam Isack Abhishek Kar Andrea Tagliasacchi Kwang Moo Yi University of British Columbia Google Research Google DeepMind Simon Fraser University University of Toronto

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

We present an approach to accelerate Neural Field training by efficiently selecting sampling locations. While Neural Fields have recently become popular, it is often trained by uniformly sampling the training domain, or through handcrafted heuristics. We show that improved convergence and final training quality can be achieved by a soft mining technique based on importance sampling: rather than either considering or ignoring a pixel completely, we weigh the corresponding loss by a scalar. To implement our idea we use Langevin Monte-Carlo sampling. We show that by doing so, regions with higher error are being selected more frequently, leading to more than 2x improvement in convergence speed. The code and related resources for this study are publicly available at project page.

关键词： Training computer vision Monte Carlo methods Codes Fitting Neural radiance field pattern recognition

来源：评论

学校读者我要写书评

暂无评论

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

IMPRINT: Generative Object Compositing by Learning Identity-...

引用

conference on computer vision and pattern recognition (cvpr)

作者： Yizhi Song Zhifei Zhang Zhe Lin Scott Cohen Brian Price Jianming Zhang Soo Ye Kim He Zhang Wei Xiong Daniel Aliaga Purdue University Adobe Research

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity preservation from that of compositing. The first stage is targeted for context-agnostic, identity-preserving pretraining of the object encoder, enabling the encoder to learn an embedding that is both view-invariant and conducive to enhanced detail preservation. The subsequent stage leverages this representation to learn seamless harmonization of the object composited to the background. In addition, IMPRINT incorporates a shape-guidance mechanism offering user-directed control over the compositing process. Extensive experiments demonstrate that IMPRINT significantly outperforms existing methods and various baselines on identity preservation and composition quality. Project page: https://***/IMPRINT-Project-Page/

关键词： computer vision Limiting Process control pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 468 469 470 471 472 473 474 475 476 477 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：