检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,851 篇 英文
7 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4861-4870 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond

BasicVSR: The Search for Essential Components in Video Super...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chan, Kelvin C. K. Wang, Xintao Yu, Ke Dong, Chao Loy, Chen Change Nanyang Technol Univ S Lab Singapore Singapore Tencent PCG Appl Res Ctr Shenzhen Peoples R China Chinese Univ Hong Kong CUHK SenseTime Joint Lab Hong Kong Peoples R China Chinese Acad Sci SIAT SenseTime Joint Lab Shenzhen Key Lab Comp Vis & Pattern Recognit Shenzhen Inst Adv Technol Beijing Peoples R China Shenzhen Inst Artificial Intelligence & Robot Soc SIAT Branch Shenzhen Peoples R China

ISBN: (纸本)9781665445092

Video super-resolution (VSR) approaches tend to have more components than the image counterparts as they need to exploit the additional temporal dimension. Complex designs are not uncommon. In this study, we wish to untangle the knots and reconsider some most essential components for VSR guided by four basic functionalities, i.e., Propagation, Alignment, Aggregation, and Upsampling. By reusing some existing components added with minimal redesigns, we show a succinct pipeline, BasicVSR, that achieves appealing improvements in terms of speed and restoration quality in comparison to many state-of-the-art algorithms. We conduct systematic analysis to explain how such gain can be obtained and discuss the pitfalls. We further show the extensibility of BasicVSR by presenting an information-refill mechanism and a coupled propagation scheme to facilitate information aggregation. The BasicVSR and its extension, IconVSR, can serve as strong baselines for future VSR approaches.

关键词： computer vision Systematics Superresolution Pipelines Noise reduction computer architecture pattern recognition

来源：评论

学校读者我要写书评

暂无评论

BABEL: Bodies, Action and Behavior with English Labels

BABEL: Bodies, Action and Behavior with English Labels

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Punnakkal, Abhinanda R. Chandrasekaran, Arjun Athanasiou, Nikos Quiros-Ramirez, Alejandra Black, Michael J. Max Planck Inst Intelligent Syst Tubingen Germany Univ Konstanz Constance Germany

ISBN: (数字)9781665445092

ISBN: (纸本)9781665445092

Understanding the semantics of human movement - the what, how and why of the movement - is an important problem that requires datasets of human actions with semantic labels. Existing datasets take one of two approaches. Large-scale video datasets contain many action labels but do not contain ground-truth 3D human motion. Alternatively, motion-capture (mocap) datasets have precise body motions but are limited to a small number of actions. To address this, we present BABEL, a large dataset with language labels describing the actions being performed in mocap sequences. BABEL consists of language labels for over 43 hours of mocap sequences from AMASS, containing over 250 unique actions. Each action label in BABEL is precisely aligned with the duration of the corresponding action in the mocap sequence. BABELalso allows overlap of multiple actions, that may each span different durations. This results in a total of over 66000 action segments. The dense annotations can be leveraged for tasks like action recognition, temporal localization, motion synthesis, etc. To demonstrate the value of BABEL as a benchmark, we evaluate the performance of models on 3D action recognition. We demonstrate that BABEL poses interesting learning challenges that are applicable to real-world scenarios, and can serve as a useful benchmark for progress in 3D action recognition.

关键词： Location awareness Solid modeling computer vision Three-dimensional displays Motion segmentation Semantics Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Debiased Subjective Assessment of Real-World Image Enhancement

Debiased Subjective Assessment of Real-World Image Enhanceme...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Cao, Peibei Wang, Zhangyang Ma, Kede City Univ Hong Kong Hong Kong Peoples R China Univ Texas Austin Austin TX 78712 USA

ISBN: (纸本)9781665445092

In real-world image enhancement, it is often challenging (if not impossible) to acquire ground-truth data, preventing the adoption of distance metrics for objective quality assessment. As a result, one often resorts to subjective quality assessment, the most straightforward and reliable means of evaluating image enhancement. Conventional subjective testing requires manually pre-selecting a small set of visual examples, which may suffer from three sources of biases: 1) sampling bias due to the extremely sparse distribution of the selected samples in the image space;2) algorithmic bias due to potential overfitting the selected samples;3) subjective bias due to further potential cherry-picking test results. This eventually makes the field of real-world image enhancement more of an art than a science. Here we take steps towards debiasing conventional subjective assessment by automatically sampling a set of adaptive and diverse images for subsequent testing. This is achieved by casting sample selection into a joint maximization of the discrepancy between the enhancers and the diversity among the selected input images. Careful visual inspection on the resulting enhanced images provides a debiased ranking of the enhancement algorithms. We demonstrate our subjective assessment method using three popular and practically demanding image enhancement tasks: dehazing, super-resolution, and low-light enhancement.

关键词： Photography Measurement Visualization Machine vision Superresolution Quality assessment pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Simple In-place Data Augmentation for Surveillance Object Detection

Simple In-place Data Augmentation for Surveillance Object De...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Munkh-Erdene Otgonbold Ganzorig Batnasan Munkhjargal Gochoo Department of Computer Science and Software Engineering United Arab Emirates University UAE Department of Electronics Mongolian University of Science and Technology Mongolia Emirates Center for Mobility Research United Arab Emirates University UAE

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Motivated by the need to improve model performance in traffic monitoring tasks with limited labeled samples, we propose a straightforward augmentation technique tailored for object detection datasets, specifically designed for stationary camera-based applications. Our approach focuses on placing objects in the same positions as the originals to ensure its effectiveness. By applying in-place augmentation on objects from the same camera input image, we address the challenge of overlapping with original and previously selected objects. Through extensive testing on two traffic monitoring datasets, we illustrate the efficacy of our augmentation strategy in improving model performance, particularly in scenarios with limited labeled samples and imbalanced class distributions. Notably, our method achieves comparable performance to models trained on the entire dataset while utilizing only 8.5 percent of the original data. Moreover, we report significant improvements, with mAP@.5 increasing from 0.4798 to 0.5025, and the mAP@.5:.95 rising from 0.29 to 0.3138 on the FishEye8K dataset. These results highlight the potential of our augmentation approach in enhancing object detection models for traffic monitoring applications.

关键词： Image segmentation Surveillance Scalability conferences Object detection Data augmentation Cameras

来源：评论

学校读者我要写书评

暂无评论

TAME: Task Agnostic Continual Learning using Multiple Experts

TAME: Task Agnostic Continual Learning using Multiple Expert...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Haoran Zhu Maryam Majzoubi Arihant Jain Anna Choromanska New York University Google

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The goal of lifelong learning is to continuously learn from non-stationary distributions, where the non-stationarity is typically imposed by a sequence of distinct tasks. Prior works have mostly considered idealistic settings, where the identity of tasks is known at least at training. In this paper we focus on a fundamentally harder, so-called task-agnostic, setting where the task identities are not known and the learning machine needs to infer them from the observations. Our algorithm, which we call TAME (Task-Agnostic continual learning using Multiple Experts), automatically detects the shift in data distributions and switches between task expert networks in an online manner. At training, the strategy for switching between tasks hinges on an extremely simple observation that for each new coming task there occurs a statistically-significant deviation in the value of the loss function that marks the onset of this new task. At inference, the switching between experts is governed by the selector network that forwards the test sample to its relevant expert network. The selector network is trained on a small subset of data drawn uniformly at random. We control the growth of the task expert networks as well as selector network by employing pruning. Our experimental results show the efficacy of our approach on benchmark continual learning data sets, outperforming the previous task-agnostic methods and even the techniques that admit task identities at both training and testing, while at the same time using a comparable model size.

关键词： Training Continuing education computer vision conferences Computational modeling Switches Fasteners

来源：评论

学校读者我要写书评

暂无评论

ACRE: Abstract Causal REasoning Beyond Covariation

ACRE: Abstract Causal REasoning Beyond Covariation

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Chi Jia, Baoxiong Edmonds, Mark Zhu, Song-Chun Zhu, Yixin UCLA Ctr Vis Cognit Learning & Auton Los Angeles CA 90095 USA

ISBN: (纸本)9781665445092

Causal induction, i.e., identifying unobservable mechanisms that lead to the observable relations among variables, has played a pivotal role in modern scientific discovery, especially in scenarios with only sparse and limited data. Humans, even young toddlers, can induce causal relationships surprisingly well in various settings despite its notorious difficulty. However, in contrast to the commonplace trait of human cognition is the lack of a diagnostic benchmark to measure causal induction for modern Artificial Intelligence (AI) systems. Therefore, in this work, we introduce the Abstract Causal REasoning (ACRE) dataset for systematic evaluation of current vision systems in causal induction. Motivated by the stream of research on causal discovery in Blicket experiments, we query a visual reasoning system with the following four types of questions in either an independent scenario or an interventional scenario: direct, indirect, screening-off, and backward-blocking, intentionally going beyond the simple strategy of inducing causal relationships by covariation. By analyzing visual reasoning architectures on this testbed, we notice that pure neural models tend towards an associative strategy under their chance-level performance, whereas neuro-symbolic combinations struggle in backward-blocking reasoning. These deficiencies call for future research in models with a more comprehensive capability of causal induction.

关键词： Visualization Pediatrics Systematics Benchmark testing Particle measurements Cognition pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Human Pose Estimation through Transforming Shape Templates

Unsupervised Human Pose Estimation through Transforming Shap...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Schmidtke, Luca Vlontzos, Athanasios Ellershaw, Simon Lukens, Anna Arichi, Tomoki Kainz, Bernhard Imperial Coll London London England Kings Coll London London England Evelina London Childrens Hosp London England

ISBN: (纸本)9781665445092

Human pose estimation is a major computer vision problem with applications ranging from augmented reality and video capture to surveillance and movement tracking. In the medical context, the latter may be an important biomarker for neurological impairments in infants. Whilst many methods exist, their application has been limited by the need for well annotated large datasets and the inability to generalize to humans of different shapes and body compositions, e.g. children and infants. In this paper we present a novel method for learning pose estimators for human adults and infants in an unsupervised fashion. We approach this as a learnable template matching problem facilitated by deep feature extractors. Human-interpretable landmarks are estimated by transforming a template consisting of predefined body parts that are characterized by 2D Gaussian distributions. Enforcing a connectivity prior guides our model to meaningful human shape representations. We demonstrate the effectiveness of our approach on two different datasets including adults and infants.

关键词： Pediatrics computer vision Three-dimensional displays Shape Tracking Surveillance Pose estimation

来源：评论

学校读者我要写书评

暂无评论

Probing Conceptual Understanding of Large Visual-Language Models

Probing Conceptual Understanding of Large Visual-Language Mo...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Madeline Schiappa Raiyaan Abdullah Shehreen Azad Jared Claypoole Michael Cogswell Ajay Divakaran Yogesh Rawat Center for Research in Computer Vision University of Central Florida SRI International

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In recent years large visual-language (V+L) models have achieved great success in various downstream tasks. However, it is not well studied whether these models have a conceptual grasp of the visual content. In this work we focus on conceptual understanding of these large V+L models. To facilitate this study, we propose novel benchmarking datasets for probing three different aspects of content understanding, 1) relations, 2) composition, and 3) context. Our probes are grounded in cognitive science and help determine if a V+L model can, for example, determine if snow garnished with a man is implausible, or if it can identify beach furniture by knowing it is located on a beach. We experimented with many recent state-of-the-art V+L models and observe that these models mostly fail to demonstrate a conceptual understanding. This study reveals several interesting insights such as that cross-attention helps learning conceptual understanding, and that CNNs are better with texture and patterns, while Transformers are better at color and shape. We further utilize some of these insights and investigate a simple finetuning technique that rewards the three conceptual understanding measures with promising initial results. The proposed benchmarks will drive the community to delve deeper into conceptual understanding and foster advancements in the capabilities of large V+L models. The code and dataset is available at: https://***/vlm-robustness

关键词： Training Visualization Shape Snow Color Benchmark testing Transformers

来源：评论

学校读者我要写书评

暂无评论

Discrimination-Aware Mechanism for Fine-grained Representation Learning

Discrimination-Aware Mechanism for Fine-grained Representati...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Xu, Furong Wang, Meng Zhang, Wei Cheng, Yuan Chu, Wei Ant Financial Serv Grp Hangzhou Peoples R China

ISBN: (纸本)9781665445092

Recently, with the emergence of retrieval requirements for certain individual in the same superclass, e.g., birds, persons, cars, fine-grained recognition task has attracted a significant amount of attention from academia and industry. In fine-grained recognition scenario, the inter-class differences are quite diverse and subtle, which makes it challenging to extract all the discriminative cues. Traditional training mechanism optimizes the overall discriminativeness of the whole feature. It may stop early when some feature elements has been trained to distinguish training samples well, leaving other elements insufficiently trained for a feature. This would result in a less generalizable feature extractor that only captures major discriminative cues and ignores subtle ones. Therefore, there is a need for a training mechanism that enforces the discriminativeness of all the elements in the feature to capture more the subtle visual cues. In this paper, we propose a Discrimination-Aware Mechanism (DAM) that iteratively identifies insufficiently trained elements and improves them. DAM is able to increase the number of well learned elements, which captures more visual cues by the feature extractor. In this way, a more informative representation is learned, which brings better generalization performance. We show that DAM can be easily applied to both proxy-based and pair-based loss functions, and thus can be used in most existing fine-grained recognition paradigms. Comprehensive experiments on CUB200-2011, Cars196, Market-1501, and MSMT17 datasets demonstrate the advantages of our DAM based loss over the related state-of-the-art approaches.

关键词： Training Industries Visualization computer vision Dams Feature extraction Birds

来源：评论

学校读者我要写书评

暂无评论

Lens-to-Lens Bokeh Effect Transformation. NTIRE 2023 Challenge Report

Lens-to-Lens Bokeh Effect Transformation. NTIRE 2023 Challen...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Marcos V. Conde Manuel Kolmet Tim Seizinger Tom E. Bishop Radu Timofte Xiangyu Kong Dafeng Zhang Jinlong Wu Fan Wang Juewen Peng Zhiyu Pan Chengxin Liu Xianrui Luo Huiqiang Sun Liao Shen Zhiguo Cao Ke Xian Chaowei Liu Zigeng Chen Xingyi Yang Songhua Liu Yongcheng Jing Michael Bi Mi Xinchao Wang Zhihao Yang Wenyi Lian Siyuan Lai Haichuan Zhang Trung Hoang Amirsaeed Yazdani Vishal Monga Ziwei Luo Fredrik K. Gustafsson Zheng Zhao Jens Sjölund Thomas B. Schön Yuxuan Zhao Baoliang Chen Yiqing Xu JiXiangNiu Computer Vision Lab University of Würzburg Germany Glass Imaging Inc CA

We present the new Bokeh Effect Transformation Dataset (BETD), and review the proposed solutions for this novel task at the NTIRE 2023 Bokeh Effect Transformation Challenge. Recent advancements of mobile photography aim to reach the visual quality of full-frame cameras. Now, a goal in computational photography is to optimize the Bokeh effect itself, which is the aesthetic quality of the blur in out-of-focus areas of an image. Photographers create this aesthetic effect by benefiting from the lens optical *** aim of this work is to design a neural network capable of converting the the Bokeh effect of one lens to the effect of another lens without harming the sharp foreground regions in the image. For a given input image, knowing the target lens type, we render or transform the Bokeh effect accordingly to the lens properties. We build the BETD using two full-frame Sony cameras, and diverse lens *** the best of our knowledge, we are the first attempt to solve this novel task, and we provide the first BETD dataset and benchmark for it. The challenge had 99 registered participants. The submitted methods gauge the state-of-the-art in Bokeh effect rendering and transformation.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 483 484 485 486 487 488 489 490 491 492 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：