检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

3,317 篇 会议
3 篇 期刊文献

馆藏范围

3,320 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,915 篇 工学
- 1,820 篇 计算机科学与技术...
- 376 篇 软件工程
- 142 篇 机械工程
- 136 篇 光学工程
- 42 篇 生物工程
- 28 篇 信息与通信工程
- 11 篇 控制科学与工程
- 9 篇 电气工程
- 9 篇 电子科学与技术（可...
- 9 篇 化学工程与技术
- 9 篇 交通运输工程
- 8 篇 生物医学工程（可授...
- 7 篇 安全科学与工程
- 4 篇 材料科学与工程（可...
- 4 篇 建筑学
- 3 篇 土木工程
- 3 篇 农业工程
174 篇 理学
- 136 篇 物理学
- 43 篇 生物学
- 29 篇 数学
- 16 篇 统计学（可授理学、...
- 10 篇 化学
29 篇 医学
- 28 篇 临床医学
- 3 篇 基础医学(可授医学...
15 篇 管理学
- 8 篇 管理科学与工程(可...
- 7 篇 图书情报与档案管...
- 3 篇 工商管理
5 篇 法学
- 3 篇 社会学
- 2 篇 法学
2 篇 教育学
- 2 篇 教育学
2 篇 农学
1 篇 经济学

主题

1,179 篇 computer vision
801 篇 conferences
570 篇 training
484 篇 pattern recognit...
330 篇 computational mo...
279 篇 computer archite...
255 篇 visualization
180 篇 feature extracti...
160 篇 neural networks
153 篇 semantics
152 篇 task analysis
145 篇 cameras
143 篇 deep learning
140 篇 three-dimensiona...
139 篇 benchmark testin...
126 篇 low-level vision
117 篇 vision
115 篇 language
113 篇 image segmentati...
112 篇 estimation

机构

34 篇 univ sci & techn...
34 篇 tsinghua univ pe...
28 篇 shanghai ai lab ...
26 篇 chinese univ hon...
25 篇 university of sc...
25 篇 tsinghua univers...
24 篇 peng cheng labor...
23 篇 swiss fed inst t...
23 篇 swiss fed inst t...
23 篇 peng cheng lab p...
22 篇 zhejiang univ pe...
22 篇 carnegie mellon ...
20 篇 univ chinese aca...
20 篇 university of ch...
18 篇 harbin inst tech...
18 篇 shanghai jiao to...
17 篇 sensetime res pe...
16 篇 hong kong univ s...
15 篇 fudan univ peopl...
14 篇 national key lab...

作者

60 篇 timofte radu
25 篇 radu timofte
19 篇 van gool luc
15 篇 loy chen change
15 篇 lei lei
14 篇 fan haoqiang
13 篇 zhang yulun
13 篇 liu shuaicheng
13 篇 luc van gool
13 篇 pan jinshan
13 篇 li chongyi
13 篇 qiao yu
12 篇 chen chen
12 篇 marcos v. conde
12 篇 liu shuai
12 篇 chen wei-ting
11 篇 wangmeng zuo
11 篇 yu qiao
10 篇 wei dong
10 篇 zhou shangchen

语言

3,318 篇 英文
2 篇 其他

检索条件"任意字段=2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2023"

共 3320 条记录，以下是2371-2380 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Sliced Optimal Partial Transport

Sliced Optimal Partial Transport

引用

conference on computer vision and pattern recognition (CVPR)

作者： Yikun Bai Bernhard Schmitzer Matthew Thorpe Soheil Kolouri Department of Computer Science Vanderbilt University Institute of Computer Science Göttingen University Department of Mathematics University of Manchester The Alan Turing Institute

Optimal transport (OT) has become exceedingly popular in machine learning, data science, and computer vision. The core assumption in the OT problem is the equal total amount of mass in source and target measures, which limits its application. Optimal Partial Transport (OPT) is a recently proposed solution to this limitation. Similar to the OT problem, the computation of OPT relies on solving a linear programming problem (often in high dimensions), which can become computationally prohibitive. In this paper, we propose an efficient algorithm for calculating the OPT problem between two non-negative measures in one dimension. Next, following the idea of sliced OT distances, we utilize slicing to define the Sliced OPT distance. Finally, we demonstrate the computational and accuracy benefits of the Sliced OPT-based method in various numerical experiments. In particular, we show an application of our proposed Sliced OPT problem in noisy point cloud registration and color adaptation. Our code is available at Github Link.

关键词：

来源：评论

学校读者我要写书评

暂无评论

TOPLight: Lightweight Neural Networks with Task-Oriented Pretraining for Visible-Infrared recognition

TOPLight: Lightweight Neural Networks with Task-Oriented Pre...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Hao Yu Xu Cheng Wei Peng School of Computer Science Nanjing University of Information Science and Technology China Department of Psychiatry and Behavioral Sciences Stanford University

Visible-infrared recognition (VI recognition) is a challenging task due to the enormous visual difference across heterogeneous images. Most existing works achieve promising results by transfer learning, such as pretraining on the ImageNet, based on advanced neural architectures like ResNet and ViT. However, such methods ignore the neg-ative influence of the pretrained colour prior knowledge, as well as their heavy computational burden makes them hard to deploy in actual scenarios with limited resources. In this paper, we propose a novel task-oriented pretrained lightweight neural network (TOPLight) for VI recognition. Specifically, the TOPLight method simulates the domain conflict and sample variations with the proposed fake do-main loss in the pretraining stage, which guides the network to learn how to handle those difficulties, such that a more general modality-shared feature representation is learned for the heterogeneous images. Moreover, an effective fine-grained dependency reconstruction module (FDR) is developed to discover substantial pattern dependencies shared in two modalities. Extensive experiments on VI person re-identification and VI face recognition datasets demonstrate the superiority of the proposed TOPLight, which signifi-cantly outperforms the current state of the arts while de-manding fewer computational resources.

关键词：

来源：评论

学校读者我要写书评

暂无评论

OVTrack: Open-Vocabulary Multiple Object Tracking

OVTrack: Open-Vocabulary Multiple Object Tracking

引用

conference on computer vision and pattern recognition (CVPR)

作者： Siyuan Li Tobias Fischer Lei Ke Henghui Ding Martin Danelljan Fisher Yu Computer Vision Lab ETH Zürich

The ability to recognize, localize and track dynamic objects in a scene is fundamental to many real-world applications, such as self-driving and robotic systems. Yet, traditional multiple object tracking (MOT) benchmarks rely only on a few object categories that hardly represent the multitude of possible objects that are encountered in the real world. This leaves contemporary MOT methods limited to a small set of pre-defined object categories. In this paper, we address this limitation by tackling a novel task, open-vocabulary MOT, that aims to evaluate tracking beyond pre-defined training categories. We further develop OVTrack, an open-vocabulary tracker that is capable of tracking arbitrary object classes. Its design is based on two key ingredients: First, leveraging vision-language models for both classification and association via knowledge distillation; second, a data hallucination strategy for robust appearance feature learning from denoising diffusion probabilistic models. The result is an extremely data-efficient open-vocabulary tracker that sets a new state-of-the-art on the large-scale, large-vocabulary TAO benchmark, while being trained solely on static images.

关键词：

来源：评论

学校读者我要写书评

暂无评论

DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata

DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cell...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Ehsan Pajouheshgar Yitao Xu Tong Zhang Sabine Süsstrunk School of Computer and Communication Sciences EPFL Switzerland

Current Dynamic Texture Synthesis (DyTS) models can synthesize realistic videos. However, they require a slow iterative optimization process to synthesize a single fixed-size short video, and they do not offer any post-training control over the synthesis process. We propose Dynamic Neural Cellular Automata (DyNCA), a framework for real-time and controllable dynamic texture synthesis. Our method is built upon the recently introduced NCA models and can synthesize infinitely long and arbitrary-sized realistic video textures in real time. We quantitatively and qualitatively evaluate our model and show that our synthesized videos appear more realistic than the existing results. We improve the SOTA DyTS performance by 2 ~ 4 orders of magnitude. Moreover, our model offers several real-time video controls including motion speed, motion direction, and an editing brush tool. We exhibit our trained models in an online interactive demo that runs on local hardware and is accessible on personal computers and smartphones.

关键词：

来源：评论

学校读者我要写书评

暂无评论

End-to-end Optimized Video Compression with MV-Residual Prediction

End-to-end Optimized Video Compression with MV-Residual Pred...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wu, XiangJi Zhang, Ziwen Feng, Jie Zhou, Lei Wu, Junmin Tucodec Inc Shanghai Peoples R China

ISBN: (数字)9781728193601

ISBN: (纸本)9781728193601

We present an end-to-end trainable framework for P-frame compression in this paper. A joint motion vector (MV) and residual prediction network MV-Residual is designed to extract the ensembled features of motion representations and residual information by treating the two successive frames as inputs. The prior probability of the latent representations is modeled by a hyperprior auto-encoder and trained jointly with the MV-Residual network. Specially, the spatially-displaced convolution is applied for video frame prediction, in which a motion kernel for each pixel is learned to generate predicted pixel by applying the kernel at a displaced location in the source image. Finally, novel rate allocation and post-processing strategies are used to produce the final compressed bits, considering the bits constraint of the challenge. The experimental results on validation set show that the proposed optimized framework can generate the highest MS-SSIM for P-frame compression competition.

关键词： Image coding Video compression Convolution Kernel computer vision conferences pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Neurodata Lab's approach to the Challenge on computer vision for Physiological Measurement

Neurodata Lab's approach to the Challenge on Computer Vision...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Artemyev, Mikhail Churikova, Marina Grinenko, Mikhail Perepelkina, Olga Neurodata Lab LLC Miami FL 33137 USA Lomonosov Moscow State Univ Fac Biol Dept Higher Nervous Act Moscow Russia

ISBN: (纸本)9781728193601

This paper introduces the Neurodata Lab's approach presented at the 1st Challenge on Remote Physiological Signal Sensing (RePSS) organized within CVPR2020. The RePSS challenge was focused on measuring the average heart rate from color facial videos, which is one of the most fundamental problems in the field of computer vision. Our deep learning-based approach includes 3D spatio-temporal attention convolutional neural network for photoplethysmogram extraction and 1D convolutional neural network pre-trained on synthetic data for time series analysis. It provides state-of-the-art results outperforming those of other participants on a mixture of VIPL and OBF databases: MAE=6.94 (12.3% improvement compared to the top-2 result), RMSE=10.68 (24.6% improvement), Pearson R = 0.755 (28.2% improvement).

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

BBDM: Image-to-Image Translation with Brownian Bridge Diffusion Models

BBDM: Image-to-Image Translation with Brownian Bridge Diffus...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Bo Li Kaitao Xue Bin Liu Yu-Kun Lai School of Mathematics and Information Science Nanchang Hangkong University Nanchang China School of Computer Sciences and Informatics Cardiff University Cardiff UK

Image-to-image translation is an important and challenging problem in computer vision and image processing. Diffusion models (DM) have shown great potentials for high-quality image synthesis, and have gained competitive performance on the task of image-to-image translation. However, most of the existing diffusion models treat image-to-image translation as conditional generation processes, and suffer heavily from the gap between distinct domains. In this paper, a novel image-to-image translation method based on the Brownian Bridge Diffusion Model (BBDM) is proposed, which models image-to-image translation as a stochastic Brownian Bridge process, and learns the translation between two domains directly through the bidirectional diffusion process rather than a conditional generation process. To the best of our knowledge, it is the first work that proposes Brownian Bridge diffusion process for image-to-image translation. Experimental results on various benchmarks demonstrate that the proposed BBDM model achieves competitive performance through both visual inspection and measurable metrics.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-Modal Representation Learning with Text-Driven Soft Masks

Multi-Modal Representation Learning with Text-Driven Soft Ma...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Jaeyoo Park Bohyung Han Computer Vision Laboratory ECE IPAI Seoul National University

We propose a visual-linguistic representation learning approach within a self-supervised learning framework by introducing a new operation, loss, and data augmentation strategy. First, we generate diverse features for the image-text matching (ITM) task via soft-masking the regions in an image, which are most relevant to a certain word in the cor-responding caption, instead of completely removing them. Since our framework relies only on image-caption pairs with no fine-grained annotations, we identify the relevant regions to each word by computing the word-conditional vi-sual attention using multi-modal encoder. Second, we encourage the model to focus more on hard but diverse examples by proposing a focal loss for the image-text contrastive learning (ITC) objective, which alleviates the inherent limitations of overfitting and bias issues. Last, we perform multi-modal data augmentations for self-supervised learning via mining various examples by masking texts and rendering distortions on images. We show that the combination of these three innovations is effective for learning a pretrained model, leading to outstanding performance on multiple vision-language downstream tasks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Iterative vision-and-Language Navigation

Iterative Vision-and-Language Navigation

引用

conference on computer vision and pattern recognition (CVPR)

作者： Jacob Krantz Shurjo Banerjee Wang Zhu Jason Corso Peter Anderson Stefan Lee Jesse Thomason Oregon State University University of Michigan University of Southern California Google Research

We present Iterative vision-and-Language Navigation (IVLN), a paradigm for evaluating language-guided agents navigating in a persistent environment over time. Existing vision-and-Language Navigation (VLN) benchmarks erase the agent's memory at the beginning of every episode, testing the ability to perform cold-start navigation with no prior information. However, deployed robots occupy the same environment for long periods of time. The IVLN paradigm addresses this disparity by training and evaluating VLN agents that maintain memory across tours of scenes that consist of up to 100 ordered instruction-following Room-to-Room (R2R) episodes, each defined by an individual language instruction and a target path. We present discrete and continuous Iterative Room-to-Room (IR2R) benchmarks comprising about 400 tours each in 80 indoor scenes. We find that extending the implicit memory of high-performing transformer VLN agents is not sufficient for IVLN, but agents that build maps can benefit from environment persistence, motivating a renewed focus on map-building agents in VLN.

关键词：

来源：评论

学校读者我要写书评

暂无评论

NIPQ: Noise proxy-based Integrated Pseudo-Quantization

NIPQ: Noise proxy-based Integrated Pseudo-Quantization

引用

conference on computer vision and pattern recognition (CVPR)

作者： Juncheol Shin Junhyuk So Sein Park Seungyeop Kang Sungjoo Yoo Eunhyeok Park Graduate School of Artificial Intelligence POSTECH Department of Computer Science and Engineering POSTECH Department of Computer Science and Engineering Seoul National University

Straight-through estimator (STE), which enables the gradient flow over the non-differentiable function via approximation, has been favored in studies related to quantization-aware training (QAT). However, STE incurs unstable convergence during QAT, resulting in notable quality degradation in low precision. Recently, pseudo-quantization training has been proposed as an alternative approach to updating the learnable parameters using the pseudo-quantization noise instead of STE. In this study, we propose a novel noise proxy-based integrated pseudo-quantization (NIPQ) that enables unified support of pseudo-quantization for both activation and weight by integrating the idea of truncation on the pseudo-quantization framework. NIPQ updates all of the quantization parameters (e.g., bit-width and truncation boundary) as well as the network parameters via gradient descent without STE instability. According to our extensive experiments, NIPQ outperforms existing quantization algorithms in various vision and language applications by a large margin.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共332页 << < 234 235 236 237 238 239 240 241 242 243 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：