检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,008 篇 会议
126 册 图书
94 篇 期刊文献

馆藏范围

23,227 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,631 篇 工学
- 11,116 篇 计算机科学与技术...
- 3,481 篇 软件工程
- 2,445 篇 机械工程
- 1,716 篇 光学工程
- 1,080 篇 电气工程
- 1,014 篇 控制科学与工程
- 788 篇 信息与通信工程
- 411 篇 仪器科学与技术
- 352 篇 生物工程
- 251 篇 生物医学工程（可授...
- 196 篇 电子科学与技术（可...
- 114 篇 化学工程与技术
- 109 篇 安全科学与工程
- 100 篇 测绘科学与技术
- 88 篇 建筑学
- 88 篇 交通运输工程
- 84 篇 土木工程
3,495 篇 医学
- 3,482 篇 临床医学
- 82 篇 基础医学(可授医学...
3,246 篇 理学
- 1,941 篇 物理学
- 1,643 篇 数学
- 563 篇 统计学（可授理学、...
- 500 篇 生物学
- 249 篇 系统科学
- 106 篇 化学
521 篇 管理学
- 311 篇 图书情报与档案管...
- 223 篇 管理科学与工程(可...
- 76 篇 工商管理
276 篇 艺术学
- 276 篇 设计学（可授艺术学...
66 篇 法学
- 63 篇 社会学
38 篇 农学
28 篇 教育学
22 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,186 篇 computer vision
3,967 篇 pattern recognit...
3,005 篇 training
2,007 篇 computational mo...
1,818 篇 visualization
1,815 篇 cameras
1,515 篇 feature extracti...
1,481 篇 shape
1,455 篇 three-dimensiona...
1,438 篇 image segmentati...
1,287 篇 robustness
1,206 篇 computer archite...
1,155 篇 semantics
1,147 篇 conferences
1,107 篇 layout
1,092 篇 computer science
1,088 篇 object detection
1,025 篇 benchmark testin...
970 篇 codes
922 篇 face recognition

机构

136 篇 univ sci & techn...
121 篇 univ chinese aca...
118 篇 chinese univ hon...
105 篇 carnegie mellon ...
101 篇 tsinghua univers...
101 篇 microsoft resear...
95 篇 swiss fed inst t...
93 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
79 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
69 篇 national laborat...
67 篇 microsoft res as...
67 篇 alibaba grp peop...
64 篇 adobe research
60 篇 peking univ peop...
60 篇 tsinghua univ pe...
59 篇 univ oxford oxfo...

作者

81 篇 van gool luc
72 篇 timofte radu
65 篇 zhang lei
47 篇 luc van gool
40 篇 yang yi
40 篇 li stan z.
37 篇 loy chen change
35 篇 chen chen
33 篇 xiaoou tang
32 篇 liu yang
32 篇 qi tian
31 篇 tian qi
31 篇 sun jian
30 篇 murino vittorio
29 篇 ling haibin
29 篇 darrell trevor
29 篇 pascal fua
29 篇 li fei-fei
28 篇 li xin
28 篇 ying shan

语言

22,989 篇 英文
210 篇 其他
22 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"

共 23228 条记录，以下是691-700 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Less is More: Proxy Datasets in NAS approaches

Less is More: Proxy Datasets in NAS approaches

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Moser, Brian Raue, Federico Hees, Joern Dengel, Andreas German Res Ctr Artificial Intelligence DFKI Darmstadt Germany TU Kaiserslautern Kaiserslautern Germany

ISBN: (纸本)9781665487399

Neural Architecture Search (NAS) defines the design of Neural Networks as a search problem. Unfortunately, NAS is computationally intensive because of various possibilities depending on the number of elements in the design and the possible connections between them. In this work, we extensively analyze the role of the dataset size based on several sampling approaches for reducing the dataset size (unsupervised and supervised cases) as an agnostic approach to reduce search time. We compared these techniques with four common NAS approaches in NAS-Bench-201 in roughly 1,400 experiments on CIFAR-100. One of our surprising findings is that in most cases we can reduce the amount of training data to 25%, consequently also reducing search time to 25%, while at the same time maintaining the same accuracy as if training on the full dataset. In addition, some designs derived from subsets out-perform designs derived from the full dataset by up to 22 p.p. accuracy.

关键词： Training computer vision conferences Neural networks Training data computer architecture Search problems

来源：评论

学校读者我要写书评

暂无评论

Sea Situational Awareness (SeaSAw) Dataset

Sea Situational Awareness (SeaSAw) Dataset

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kaur, Parneet Aziz, Arslan Jain, Darshan Patel, Harshil Hirokawa, Jonathan Townsend, Lachlan Reimers, Christoph Hua, Fiona

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Vessels move 90% of international cargo by volume, with the marine economy contributing to 5.1% of global GDP. As one of the oldest industries, the marine industry has yet to embrace innovations in modern technology to safeguard the blue economy. Situational awareness from intelligent vessel systems can enable enhanced safety and decision-making for mariners. As the foundation for these intelligent systems, advanced perception technology requires sufficient real-world operational data to leverage recent AI technologies. In this work, we introduce the Sea Situational Awareness (SeaSAw) dataset - a novel dataset that is comprised of 1.9 million images with 14.6 million objects associated with 20.4 million attributes from 12 object classes, making it the largest maritime dataset for object detection, fine-grained classification and tracking. Furthermore, this dataset consists of 9 sources in combination with various RGB cameras, mounted on different moving vessels, operating in different geographic locations globally, having variations in scenario, weather and illumination conditions. This data collection took place across 4 years with rigorous efforts on data selection, annotation, management and analysis to enhance the marine perception technology.

关键词： Industries computer vision Technological innovation Image resolution Lighting Object detection Safety

来源：评论

学校读者我要写书评

暂无评论

Video Action Detection: Analysing Limitations and Challenges

Video Action Detection: Analysing Limitations and Challenges

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Modi, Rajat Rana, Aayush Jung Kumar, Akash Tirupattur, Praveen Vyas, Shruti Rawat, Yogesh Singh Shah, Mubarak Univ Cent Florida Ctr Res Comp Vis Orlando FL 32816 USA

ISBN: (纸本)9781665487399

Beyond possessing large enough size to feed data hungry machines (eg, transformers), what attributes measure the quality of a dataset? Assuming that the definitions of such attributes do exist, how do we quantify among their relative existences? Our work attempts to explore these questions for video action detection. The task aims to spatio-temporally localize an actor and assign a relevant action class. We first analyze the existing datasets on video action detection and discuss their limitations. Next, we propose a new dataset, Multi Actor Multi Action (MAMA) which overcomes these limitations and is more suitable for real world applications. In addition, we perform a biasness study which analyzes a key property differentiating videos from static images: the temporal aspect. This reveals if the actions in these datasets really need the motion information of an actor, or whether they predict the occurrence of an action even by looking at a single frame. Finally, we investigate the widely held assumptions on the importance of temporal ordering: is temporal ordering important for detecting these actions? Such extreme experiments show existence of biases which have managed to creep into existing methods inspite of careful modeling.

关键词： pattern recognition conferences computer vision

来源：评论

学校读者我要写书评

暂无评论

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

BigDetection: A Large-scale Benchmark for Improved Object De...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cai, Likun Zhang, Zhi Zhu, Yi Zhang, Li Li, Mu Xue, Xiangyang Fudan Univ Shanghai Peoples R China Amazon Inc Seattle WA USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Multiple datasets and open challenges for object detection have been introduced in recent years. To build more general and powerful object detection systems, in this paper, we construct a new large-scale benchmark termed BigDetection. Our goal is to simply leverage the training data from existing datasets (LVIS, OpenImages and Object365) with carefully designed principles, and curate a larger dataset for improved detector pre-training. Specifically, we generate a new taxonomy which unifies the heterogeneous label spaces from different sources. Our BigDetection dataset has 600 object categories and contains over 3.4M training images with 36M bounding boxes. It is much larger in multiple dimensions than previous benchmarks, which offers both opportunities and challenges. Extensive experiments demonstrate its validity as a new benchmark for evaluating different object detection methods and its effectiveness as a pre-training dataset. The code and models are available at https://***/amazonresearch/bigdetection.

关键词： Training computer vision conferences Taxonomy Training data Object detection Detectors

来源：评论

学校读者我要写书评

暂无评论

Dress Code: High-Resolution Multi-Category Virtual Try-On

Dress Code: High-Resolution Multi-Category Virtual Try-On

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Morelli, Davide Fincato, Matteo Cornia, Marcella Landi, Federico Cesari, Fabio Cucchiara, Rita Univ Modena & Reggio Emilia Modena Italy YOOX NET A PORTER GRP Milan Italy

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Image-based virtual try-on strives to transfer the appearance of a clothing item onto the image of a target person. Existing literature focuses mainly on upper-body clothes (e.g. t-shirts, shirts, and tops) and neglects full-body or lower-body items. This shortcoming arises from a main factor: current publicly available datasets for image-based virtual try-on do not account for this variety, thus limiting progress in the field. In this research activity, we introduce Dress Code, a novel dataset which contains images of multi-category clothes. Dress Code is more than 3x larger than publicly available datasets for image-based virtual try-on and features high-resolution paired images (1024 x 768) with front-view, full-body reference models. To generate HD try-on images with high visual quality and rich in details, we propose to learn fine-grained discriminating features. Specifically, we leverage a semantic-aware discriminator that makes predictions at pixel-level instead of image- or patch-level. The Dress Code dataset is publicly available at https://***/aimagelab/dress-code.

关键词： Visualization computer vision Codes Limiting conferences Clothing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Learning Generalized Feature for Temporal Action Detection: Application for Natural Driving Action recognition Challenge

Learning Generalized Feature for Temporal Action Detection: ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chuong Nguyen Ngoc Nguyen Su Huynh Vinh Nguyen Son Nguyen CyberCore AI Morioka Iwate Japan

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

This paper reports our approach for the 2022 AI City Challenge - Naturalistic Driving Action recognition (Track 3), where the objective is to detect when and what kinds of actions that a driver performs in a long, untrimmed video. Our solution is built upon the single stage ActionFormer detector, in which temporal location and classification are predicted simultaneously for efficiency. The input feature for the detector is extracted offline using our proposed backbone, which we named "ConvNext-Video". However, due to the small size of the dataset, training the model to avoid over-fitting becomes challenging. To address this problem, we focus on training techniques that can improve the generalization of underlying features. Specifically, we utilize two methods: "learning without forgetting" and semi-weak supervised learning on the unlabeled data A2. Finally, we also add a second-stage classifier (SSC) using our ConvNeXt-Video backbone. The SSC Classifer is designed to combine information from multi-clips and multi-view cameras to improve the prediction precision. Our best result achieves 29.1 F1 score on the public test set. Our source code is released at link.

关键词： Training conferences Urban areas Supervised learning Detectors Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems

PseudoProp: Robust Pseudo-Label Generation for Semi-Supervis...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hu, Shu Liu, Chun-Hao Dutta, Jayanta Chang, Ming-Ching Lyu, Siwei Ramakrishnan, Naveen Univ Buffalo SUNY Buffalo NY USA Bosch Ctr Artificial Intelligence Sunnyvale CA 94085 USA SUNY Albany Albany NY 12222 USA Amazon Seattle WA USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Semi-supervised object detection methods are widely used in autonomous driving systems, where only a fraction of objects are labeled. To propagate information from the labeled objects to the unlabeled ones, pseudo-labels for unlabeled objects must be generated. Although pseudo-labels have proven to improve the performance of semisupervised object detection significantly, the applications of image-based methods to video frames result in numerous miss or false detections using such generated pseudo-labels. In this paper, we propose a new approach, PseudoProp, to generate robust pseudo-labels by leveraging motion continuity in video frames. Specifically, PseudoProp uses a novel bidirectional pseudo-label propagation approach to compensate for misdetection. A feature-based fusion technique is also used to suppress inference noise. Extensive experiments on the large-scale Cityscapes dataset demonstrate that our method outperforms the state-of-the-art semi-supervised object detection methods by 7.4% on mAP(75).

关键词： Training computer vision conferences Object detection Predictive models Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

On-Sensor Binarized Fully Convolutional Neural Network for Localisation and Coarse Segmentation

On-Sensor Binarized Fully Convolutional Neural Network for L...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yanan Lu, Yao Shanghai Univ Sch Microelect Shanghai Peoples R China Univ Bristol Visual Informat Lab Bristol Avon England

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Current neural networks are compatible with high-performance GPU/CPUs. However, implementing neural networks on emerging embedded sensor for inference is challenging due to sensor's unique hardware architecture and stringent computing resources. With this in mind, this work presents new methods to implement fully convolutional neural networks (FCNs) on Pixel Processor Array (PPA) sensors with many techniques to fully use the limited resources on sensor. Specifically, we, for the first time, design and train binarized FCN for both binary weights and activations using batchnorm, group convolution, and learnable threshold for binarization, producing networks small enough to be embedded on the focal plane of the PPA, with limited local memory resources, and using parallel elementary add/subtract, shifting, and bit operations only. We demonstrate the first implementation of an FCN on a PPA device, performing three convolution layers entirely in the pixel-level processors. We use this architecture to demonstrate inference generating heat maps for object segmentation and localisation at over 280 FPS using the SCAMP-5 PPA vision chip.

关键词： Performance evaluation Heating systems Convolution Neural networks computer architecture Object segmentation Parallel processing

来源：评论

学校读者我要写书评

暂无评论

SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference

SqueezeNeRF: Further factorized FastNeRF for memory-efficien...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wadhwani, Krishna Kojima, Tamaki Sony Grp Corp Nihonbashi Tokyo Japan

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Neural Radiance Fields (NeRF) has emerged as the state-of-the-art method for novel view generation of complex scenes, but is very slow during inference. Recently, there have been multiple works on speeding up NeRF inference, but the state of the art methods for real-time NeRF inference rely on caching the neural network output, which occupies several giga-bytes of disk space that limits their real-world applicability. As caching the neural network of original NeRF network is not feasible, Garbin et al. proposed "FastNeRF" which factorizes the problem into 2 subnetworks - one which depends only on the 3D coordinate of a sample point and one which depends only on the 2D camera viewing direction. Although this factorization enables them to reduce the cache size and perform inference at over 200 frames per second, the memory overhead is still substantial. In this work, we propose SqueezeNeRF, which is more than 60 times memory-efficient than the sparse cache of FastNeRF and is still able to render at more than 190 frames per second on a high spec GPU during inference.

关键词： computer vision Three-dimensional displays conferences Neural networks Graphics processing units Cameras Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Continual Learning Based on OOD Detection and Task Masking

Continual Learning Based on OOD Detection and Task Masking

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kim, Gyuhak Esmaeilpour, Sepideh Xiao, Changnan Liu, Bing Univ Illinois Chicago IL 60607 USA ByteDance Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Existing continual learning techniques focus on either task incremental learning (TIL) or class incremental learning (CIL) problem, but not both. CIL and TIL differ mainly in that the task-id is provided for each test sample during testing for TIL, but not provided for CIL. Continual learning methods intended for one problem have limitations on the other problem. This paper proposes a novel unified approach based on out-of-distribution (OOD) detection and task masking, called CLOM, to solve both problems. The key novelty is that each task is trained as an OOD detection model rather than a traditional supervised learning model, and a task mask is trained to protect each task to prevent forgetting. Our evaluation shows that CLOM outperforms existing state-of-the-art baselines by large margins. The average TIL/CIL accuracy of CLOM over six experiments is 87.6/67.9% while that of the best baselines is only 84.4/55.0%.

关键词： Training computer vision Machine learning algorithms Codes conferences Supervised learning Data models

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 66 67 68 69 70 71 72 73 74 75 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：