检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

22,771 篇 会议
112 篇 期刊文献
23 册 图书

馆藏范围

22,905 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,398 篇 工学
- 10,880 篇 计算机科学与技术...
- 3,450 篇 软件工程
- 2,430 篇 机械工程
- 1,721 篇 光学工程
- 1,010 篇 控制科学与工程
- 998 篇 电气工程
- 761 篇 信息与通信工程
- 393 篇 仪器科学与技术
- 337 篇 生物工程
- 257 篇 生物医学工程（可授...
- 215 篇 电子科学与技术（可...
- 113 篇 化学工程与技术
- 112 篇 安全科学与工程
- 98 篇 测绘科学与技术
- 92 篇 交通运输工程
- 86 篇 建筑学
- 82 篇 土木工程
3,362 篇 医学
- 3,348 篇 临床医学
- 79 篇 基础医学(可授医学...
3,250 篇 理学
- 1,953 篇 物理学
- 1,664 篇 数学
- 567 篇 统计学（可授理学、...
- 484 篇 生物学
- 245 篇 系统科学
- 109 篇 化学
506 篇 管理学
- 299 篇 图书情报与档案管...
- 219 篇 管理科学与工程(可...
- 75 篇 工商管理
252 篇 艺术学
- 252 篇 设计学（可授艺术学...
62 篇 法学
- 59 篇 社会学
40 篇 农学
25 篇 教育学
19 篇 经济学
11 篇 军事学
3 篇 文学

主题

10,126 篇 computer vision
4,025 篇 pattern recognit...
2,900 篇 training
1,958 篇 computational mo...
1,792 篇 cameras
1,758 篇 visualization
1,485 篇 shape
1,466 篇 image segmentati...
1,447 篇 feature extracti...
1,412 篇 three-dimensiona...
1,288 篇 robustness
1,169 篇 computer archite...
1,144 篇 layout
1,142 篇 computer science
1,134 篇 semantics
1,071 篇 object detection
1,043 篇 conferences
1,009 篇 benchmark testin...
967 篇 codes
810 篇 face recognition

机构

135 篇 univ sci & techn...
118 篇 univ chinese aca...
118 篇 chinese univ hon...
110 篇 carnegie mellon ...
99 篇 tsinghua univers...
99 篇 microsoft resear...
94 篇 swiss fed inst t...
92 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 university of ch...
72 篇 shanghai jiao to...
68 篇 microsoft res as...
65 篇 national laborat...
65 篇 alibaba grp peop...
64 篇 tsinghua univ pe...
63 篇 adobe research
60 篇 peking univ peop...
59 篇 peng cheng labor...

作者

78 篇 van gool luc
72 篇 timofte radu
63 篇 zhang lei
45 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
33 篇 xiaoou tang
33 篇 li stan z.
33 篇 qi tian
32 篇 sun jian
31 篇 liu yang
31 篇 li fei-fei
30 篇 chen chen
30 篇 tian qi
30 篇 pascal fua
29 篇 darrell trevor
28 篇 ying shan
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 hanqing lu

语言

22,844 篇 英文
35 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"

共 22906 条记录，以下是4881-4890 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

M³P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training

M<SUP>3</SUP>P: Learning Universal Representations via Multi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ni, Minheng Huang, Haoyang Su, Lin Cui, Edward Bharti, Taroon Wang, Lijuan Zhang, Dongdong Duan, Nan Harbin Inst Technol Res Ctr Social Comp & Informat Retrieval Harbin Peoples R China Microsoft Res Asia Nat Language Comp Shanghai Peoples R China Microsoft Bing Multimedia Team Shanghai Peoples R China Microsoft Cloud AI Redmond WA USA

ISBN: (纸本)9781665445092

We present (MP)-P-3, a Multitask Multilingual Multimodal Pre-trained model that combines multilingual pre-training and multimodal pre-training into a unified framework via multitask pre-training. Our goal is to learn universal representations that can map objects occurred in different modalities or texts expressed in different languages into a common semantic space. In addition, to explicitly encourage fine-grained alignment between images and non-English languages, we also propose Multimodal Code-switched Training (MCT) to combine monolingual pre-training and multimodal pre-training via a code-switch strategy. Experiments are performed on the multilingual image retrieval task across two benchmark datasets, including MSCOCO and Multi3OK. (MP)-P-3 can achieve comparable results for English and new state-of-the-art results for non-English languages.

关键词： Training computer vision Computational modeling Semantics Image retrieval Benchmark testing Data models

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-Fine Feature Mining for Video Semantic Segmentation

Coarse-to-Fine Feature Mining for Video Semantic Segmentatio...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sun, Guolei Liu, Yun Ding, Henghui Probst, Thomas Van Gool, Luc Swiss Fed Inst Technol Comp Vis Lab Zurich Switzerland Katholieke Univ Leuven VISICS Leuven Belgium

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

The contextual information plays a core role in semantic segmentation. As for video semantic segmentation, the contexts include static contexts and motional contexts, corresponding to static content and moving content in a video clip, respectively. The static contexts are well exploited in image semantic segmentation by learning multiscale and global/long-range features. The motional contexts are studied in previous video semantic segmentation. However, there is no research about how to simultaneously learn static and motional contexts which are highly correlated and complementary to each other. To address this problem, we propose a Coarse-to-Fine Feature Mining (CFFM) technique to learn a unified presentation of static contexts and motional contexts. This technique consists of two parts: coarse-to-fine feature assembling and cross frame feature mining. The former operation prepares data for further processing, enabling the subsequent joint learning of static and motional contexts. The latter operation mines useful information/contexts from the sequential frames to enhance the video contexts of the features of the target frame. The enhanced features can be directly applied for the final prediction. Experimental results on popular benchmarks demonstrate that the proposed CFFM performs favorably against state-of-the-art methods for video semantic segmentation. Our implementation is available at https://***/GuoleiSun/VSS-CFFM.

关键词： Image segmentation computer vision Shape Motion segmentation Semantics Benchmark testing Data mining

来源：评论

学校读者我要写书评

暂无评论

Hybrid Message Passing with Performance-Driven Structures for Facial Action Unit Detection

Hybrid Message Passing with Performance-Driven Structures fo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Song, Tengfei Cui, Zijun Zheng, Wenming Ji, Qiang Southeast Univ Nanjing Peoples R China Rensselaer Polytech Inst Troy NY 12181 USA

ISBN: (纸本)9781665445092

Message passing neural network has been an effective method to represent dependencies among nodes by propagating messages. However, most of message passing algorithms focus on one structure and messages are estimated by one single approach. For real-world data, like facial action units (AUs), the dependencies may vary in terms of different expressions and individuals. In this paper, we propose a novel hybrid message passing neural network with performance-driven structures (HMP-PS), which combines complementary message passing methods and captures more possible structures in a Bayesian manner. Particularly, a performance-driven Monte Carlo Markov Chain sampling method is proposed for generating high performance graph structures. Besides, hybrid message passing is proposed to combine different types of messages, which provide the complementary information. The contribution of each type of message is adaptively adjusted along with different inputs. The experiments on two widely used benchmark datasets, i.e., BP4D and DISFA, validate that our proposed method can achieve the state-of-the-art performance.

关键词： Gold computer vision Monte Carlo methods Databases Message passing Neural networks Markov processes

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

Semi-Supervised 3D Hand-Object Poses Estimation with Interac...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Shaowei Jiang, Hanwen Xu, Jiarui Liu, Sifei Wang, Xiaolong Univ Calif San Diego La Jolla CA 92093 USA NVIDIA Santa Clara CA USA

ISBN: (纸本)9781665445092

Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly. To tackle these challenges, we propose a unified framework for estimating the 3D hand and object poses with semi-supervised learning. We build a joint learning framework where we perform explicit contextual reasoning between hand and object representations. Going beyond limited 3D annotations in a single image, we leverage the spatial-temporal consistency in large-scale hand-object videos as a constraint for generating pseudo labels in semi-supervised learning. Our method not only improves hand pose estimation in challenging real-world dataset, but also substantially improve the object pose which has fewer ground-truths per instance. By training with large-scale diverse videos, our model also generalizes better across multiple out-of-domain datasets. Project page and code: https://***/Semi-Hand-Object.

关键词： Training computer vision Three-dimensional displays Annotations Filtering Pose estimation Semisupervised learning

来源：评论

学校读者我要写书评

暂无评论

DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

DANNet: A One-Stage Domain Adaptation Network for Unsupervis...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wu, Xinyi Wu, Zhenyao Guo, Hao Ju, Lili Wang, Song Univ South Carolina Columbia SC 29208 USA Farsee2 Technol Ltd Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781665445092

Semantic segmentation of nighttime images plays an equally important role as that of daytime images in autonomous driving, but the former is much more challenging due to poor illuminations and arduous human annotations. In this paper, we propose a novel domain adaptation network (DANNet) for nighttime semantic segmentation without using labeled nighttime image data. It employs an adversarial training with a labeled daytime dataset and an unlabeled dataset that contains coarsely aligned day-night image pairs. Specifically, for the unlabeled day-night image pairs, we use the pixel-level predictions of static object categories on a daytime image as a pseudo supervision to segment its counterpart nighttime image. We further design a re-weighting strategy to handle the inaccuracy caused by misalignment between day-night image pairs and wrong predictions of daytime images, as well as boost the prediction accuracy of small objects. The proposed DANNet is the first one-stage adaptation framework for nighttime semantic segmentation, which does not train additional day-night image transfer models as a separate pre-processing stage. Extensive experiments on Dark Zurich and Nighttime Driving datasets show that our method achieves state-of-the-art performance for nighttime semantic segmentation.

关键词： Training Bridges Image segmentation computer vision Annotations Semantics Neural networks

来源：评论

学校读者我要写书评

暂无评论

Unbalanced Optimal Transport: A Unified Framework for Object Detection

Unbalanced Optimal Transport: A Unified Framework for Object...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： De Plaen, Henri De Plaen, Pierre-Francois Suykens, Johan A. K. Proesmans, Marc Tuytelaars, Tinne Van Gool, Luc Katholieke Univ Leuven ESAT STADIUS Leuven Belgium Katholieke Univ Leuven ESAT PSI Leuven Belgium Swiss Fed Inst Technol Comp Vis Lab Zurich Switzerland

ISBN: (纸本)9798350301298

During training, supervised object detection tries to correctly match the predicted bounding boxes and associated classification scores to the ground truth. This is essential to determine which predictions are to be pushed towards which solutions, or to be discarded. Popular matching strategies include matching to the closest ground truth box (mostly used in combination with anchors), or matching via the Hungarian algorithm (mostly used in anchor-free methods). Each of these strategies comes with its own properties, underlying losses, and heuristics. We show how Unbalanced Optimal Transport unifies these different approaches and opens a whole continuum of methods in between. This allows for a finer selection of the desired properties. Experimentally, we show that training an object detection model with Unbalanced Optimal Transport is able to reach the state-of-the-art both in terms of Average Precision and Average Recall as well as to provide a faster initial convergence. The approach is well suited for GPU implementation, which proves to be an advantage for large-scale models.

关键词： computer vision theory

来源：评论

学校读者我要写书评

暂无评论

Tracking People by Predicting 3D Appearance, Location and Pose

Tracking People by Predicting 3D Appearance, Location and Po...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Rajasegaran, Jathushan Pavlakos, Georgios Kanazawa, Angjoo Malik, Jitendra Univ Calif Berkeley Berkeley CA 94720 USA

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

We present an approach for tracking people in monocular videos by predicting their fixture 3D representations. To achieve this, we first lift people to 3D from a single frame in a robust manner. This lifting includes information about the 3D pose of the person, their location in the 3D space, and the 3D appearance. As we track a person, we collect 3D observations over time in a tracklet representation. Given the 3D nature of our observations, we build temporal models for each one of the previous attributes. We use these models to predict the future state of the tracklet, including 3D appearance, 3D location, and 3D pose. For a future frame, we compute the similarity between the predicted state of a tracklet and the single frame observations in a probabilistic manner. Association is solved with simple Hungarian matching, and the matches are used to update the respective tracklets. We evaluate our approach on various benchmarks and report state-of-the-art results. Code and models are available at: https: //***/PHALP.

关键词： Solid modeling computer vision Three-dimensional displays Codes Computational modeling Predictive models Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Coarse-to-Fine Domain Adaptive Semantic Segmentation with Photometric Alignment and Category-Center Regularization

Coarse-to-Fine Domain Adaptive Semantic Segmentation with Ph...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ma, Haoyu Lin, Xiangru Wu, Zifeng Yu, Yizhou Univ Hong Kong Hong Kong Peoples R China Deepwise AI Lab Beijing Peoples R China

ISBN: (纸本)9781665445092

Unsupervised domain adaptation (UDA) in semantic segmentation is a fundamental yet promising task relieving the need for laborious annotation works. However, the domain shifts/discrepancies problem in this task compromise the final segmentation performance. Based on our observation, the main causes of the domain shifts are differences in imaging conditions, called image-level domain shifts, and differences in object category configurations called category-level domain shifts. In this paper, we propose a novel UDA pipeline that unifies image-level alignment and category-level feature distribution regularization in a coarse-to-fine manner. Specifically, on the coarse side, we propose a photometric alignment module that aligns an image in the source domain with a reference image from the target domain using a set of image-level operators;on the fine side, we propose a category-oriented triplet loss that imposes a soft constraint to regularize category centers in the source domain and a self-supervised consistency regularization method in the target domain. Experimental results show that our proposed pipeline improves the generalization capability of the final segmentation model and significantly outperforms all previous state-of-the-arts.

关键词： Image segmentation Adaptation models computer vision Annotations Computational modeling Semantics Pipelines

来源：评论

学校读者我要写书评

暂无评论

High Quality Segmentation for Ultra High-resolution Images

High Quality Segmentation for Ultra High-resolution Images

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Shen, Tiancheng Zhang, Yuechen Qi, Lu Kuen, Jason Xie, Xingyu Wu, Jianlong Lin, Zhe Jia, Jiaya Chinese Univ Hong Kong Hong Kong Peoples R China Adobe Res Beijing Peoples R China Peking Univ Beijing Peoples R China Shandong Univ Jinan Peoples R China SmartMore Beijing Peoples R China

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

To segment 4K or 6K ultra high-resolution images needs extra computation consideration in image segmentation. Common strategies, such as down-sampling, patch cropping, and cascade model, cannot address well the balance issue between accuracy and computation cost. Motivated by the fact that humans distinguish among objects continuously from coarse to precise levels, we propose the Continuous Refinement Model (CRM) for the ultra high-resolution segmentation refinement task. CRM continuously aligns the feature map with the refinement target and aggregates features to reconstruct these image details. Besides, our CRM shows its significant generalization ability to fill the resolution gap between low-resolution training images and ultra high-resolution testing ones. We present quantitative performance evaluation and visualization to show that our proposed method is fast and effective on image segmentation refinement. Code is available at https://***/dvlab-research/Entity/tree/main/CRM.

关键词： Training Image segmentation computer vision Visualization Computational modeling Aggregates Customer relationship management

来源：评论

学校读者我要写书评

暂无评论

Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression

Bottom-Up Human Pose Estimation Via Disentangled Keypoint Re...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Geng, Zigang Sun, Ke Xiao, Bin Zhang, Zhaoxiang Wang, Jingdong Univ Sci & Technol China Beijing Peoples R China Chinese Acad Sci Univ Chinese Acad Sci Inst Automat Ctr Artificial Intelligence & RobotHKISI Beijing Peoples R China Microsoft Corp Redmond WA 98052 USA

ISBN: (纸本)9781665445092

In this paper, we are interested in the bottom-up paradigm of estimating human poses from an image. We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework. Our motivation is that regressing keypoint positions accurately needs to learn representations that focus on the keypoint regions. We present a simple yet effective approach, named disentangled keypoint regression (DEKR). We adopt adaptive convolutions through pixel-wise spatial transformer to activate the pixels in the keypoint regions and accordingly learn representations from them. We use a multi-branch structure for separate regression: each branch learns a representation with dedicated adaptive convolutions and regresses one keypoint. The resulting disentangled representations are able to attend to the keypoint regions, respectively, and thus the keypoint regression is spatially more accurate. We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods and achieves superior bottom-up pose estimation results on two benchmark datasets, COCO and Crowd-Pose. The code and models are available at https: //***/HRNet/DEKR.

关键词： Convolutional codes Location awareness computer vision Pose estimation Focusing Object detection Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 485 486 487 488 489 490 491 492 493 494 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：