检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,856 篇 英文
2 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4411-4420 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Unsupervised Visual Representation Learning by Tracking Patches in Video

Unsupervised Visual Representation Learning by Tracking Patc...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wang, Guangting Zhou, Yizhou Luo, Chong Xie, Wenxuan Zeng, Wenjun Xiong, Zhiwei Univ Sci & Technol China Hefei Anhui Peoples R China Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9781665445092

Inspired by the fact that human eyes continue to develop tracking ability in early and middle childhood, we propose to use tracking as a proxy task for a computer vision system to learn the visual representations. Modelled on the Catch game played by the children, we design a Catch-the-Patch (CtP) game for a 3D-CNN model to learn visual representations that would help with video-related tasks. In the proposed pretraining framework, we cut an image patch from a given video and let it scale and move according to a pre-set trajectory. The proxy task is to estimate the position and size of the image patch in a sequence of video frames, given only the target bounding box in the first frame. We discover that using multiple image patches simultaneously brings clear benefits. We further increase the difficulty of the game by randomly making patches invisible. Extensive experiments on mainstream benchmarks demonstrate the superior performance of CtP against other video pretraining methods. In addition, CtP-pretrained features are less sensitive to domain gaps than those trained by a supervised action recognition task. When both trained on Kinetics-400, we are pleasantly surprised to find that CtP-pretrained representation achieves much higher action classification accuracy than its fully supervised counterpart on Something-Something dataset.

关键词： Visualization computer vision Computational modeling Training data Games Trajectory pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Learning Graphs for Knowledge Transfer with Limited Labels

Learning Graphs for Knowledge Transfer with Limited Labels

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Ghosh, Pallabi Saini, Nirat Davis, Larry S. Shrivastava, Abhinav Univ Maryland College Pk MD 20742 USA

ISBN: (纸本)9781665445092

Fixed input graphs are a mainstay in approaches that utilize Graph Convolution Networks (GCNs) for knowledge transfer. The standard paradigm is to utilize relationships in the input graph to transfer information using GCNs from training to testing nodes in the graph;for example, the semi-supervised, zero-shot, and few-shot learning setups. We propose a generalized framework for learning and improving the input graph as part of the standard GCN-based learning setup. Moreover, we use additional constraints between similar and dissimilar neighbors for each node in the graph by applying triplet loss on the intermediate layer output. We present results of semi-supervised learning on Citeseer, Cora, and Pubmed benchmarking datasets, and zero/few-shot action recognition on UCF101 and HMDB51 datasets, significantly outperforming current approaches. We also present qualitative results visualizing the graph connections that our approach learns to update.

关键词： Training Visualization computer vision Convolution Semisupervised learning Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning

Towards Bridging Event Captioner and Sentence Localizer for ...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Chen, Shaoxiang Jiang, Yu-Gang Fudan Univ Sch Comp Sci Shanghai Key Lab Intelligent Informat Proc Shanghai Peoples R China

ISBN: (纸本)9781665445092

Dense Event Captioning (DEC) aims to jointly localize and describe multiple events of interest in untrimmed videos, which is an advancement of the conventional video captioning task (generating a single sentence description for a trimmed video). Weakly Supervised Dense Event Captioning (WS-DEC) goes one step further by not relying on human-annotated temporal event boundaries. However, there are few methods trying to tackle this task, and how to connect localization and description remains an open problem. In this paper, we demonstrate that under weak supervision, the event captioning module and localization module should be more closely bridged in order to improve description performance. Different from previous approaches, in our method, the event captioner generates a sentence from a video segment and feeds it to the sentence localizer to reconstruct the segment, and the localizer produces word importance weights as a guidance for the captioner to improve event description. To further bridge the sentence localizer and event captioner, a concept learner is adopted as the basis of the sentence localizer, which can be utilized to construct an induced set of concept features to enhance video features and improve the event captioner. Finally, our proposed method outperforms state-of-the-art WS-DEC methods on the ActivityNet Captions dataset.

关键词： Location awareness Bridges computer vision Training data pattern recognition Feeds Task analysis

来源：评论

学校读者我要写书评

暂无评论

Single Image Depth Prediction with Wavelet Decomposition

Single Image Depth Prediction with Wavelet Decomposition

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Ramamonjisoa, Michael Firman, Michael Watson, Jamie Lepetit, Vincent Turmukhambetov, Daniyar Univ Gustave Eiffel CNRS Ecole Ponts IMAGINELIGM Champs Sur Marne Marne La Vallee France Niantic San Francisco CA USA

ISBN: (纸本)9781665445092

We present a novel method for predicting accurate depths from monocular images with high efficiency. This optimal efficiency is achieved by exploiting wavelet decomposition, which is integrated in a fully differentiable encoder-decoder architecture. We demonstrate that we can reconstruct high-fidelity depth maps by predicting sparse wavelet coefficients. In contrast with previous works, we show that wavelet coefficients can be learned without direct supervision on coefficients. Instead we supervise only the final depth image that is reconstructed through the inverse wavelet transform. We additionally show that wavelet coefficients can be learned in fully self-supervised scenarios, without access to ground-truth depth. Finally, we apply our method to different state-of-the-art monocular depth estimation models, in each case giving similar or better results compared to the original model, while requiring less than half the multiplyadds in the decoder network.

关键词： Analytical models Neural networks Estimation computer architecture Wavelet analysis Decoding pattern recognition

来源：评论

学校读者我要写书评

暂无评论

DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector

DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detec...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Johan Edstedt Georg Bökman Zhenjun Zhao Linköping University Chalmers University of Technology The Chinese University of Hong Kong Texas A&M University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this paper, we analyze and improve into the recently proposed DeDoDe keypoint detector. We focus our analysis on some key issues. First, we find that DeDoDe keypoints tend to cluster together, which we fix by performing nonmax suppression on the target distribution of the detector during training. Second, we address issues related to data augmentation. In particular, the DeDoDe detector is sensitive to large rotations. We fix this by including 90-degree rotations as well as horizontal flips. Finally, the decoupled nature of the DeDoDe detector makes evaluation of downstream usefulness problematic. We fix this by matching the keypoints with a pretrained dense matcher (RoMa) and evaluating two-view pose estimates. We find that the original long training is detrimental to performance, and therefore propose a much shorter training schedule. We integrate all these improvements into our proposed detector DeDoDe v2 and evaluate it with the original DeDoDe descriptor on the MegaDepth-1500 and IMC2022 benchmarks. Our proposed detector significantly increases pose estimation results, notably from 75.9 to 78.3 mAA on the IMC2022 challenge. Code and weights are available at ***/Parskatt/DeDoDe.

关键词： Training Schedules computer vision Codes conferences Pose estimation Detectors

来源：评论

学校读者我要写书评

暂无评论

iEdit: Localised Text-guided Image Editing with Weak Supervision

iEdit: Localised Text-guided Image Editing with Weak Supervi...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Rumeysa Bodur Erhan Gundogdu Binod Bhattarai Tae-Kyun Kim Michael Donoser Loris Bazzani Imperial College London UK Amazon University of Aberdeen UK KAIST South Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Diffusion models (DMs) can generate realistic images with text guidance using large-scale datasets. However, they demonstrate limited controllability on the generated images. We introduce iEdit, a novel method for text-guided image editing conditioned on a source image and textual prompt. As a fully-annotated dataset with target images does not exist, previous approaches perform subject-specific fine-tuning at test time or adopt contrastive learning without a target image, leading to issues on preserving source image fidelity. We propose to automatically construct a dataset derived from LAION-5B, containing pseudo-target images and descriptive edit prompts. The dataset allows us to incorporate a weakly-supervised loss function, generating the pseudo-target image from the source image’s latent noise conditioned on the edit prompt. To encourage localised editing we propose a loss function that uses segmentation masks to guide the editing during training and optionally at inference. Trained with limited GPU resources on the constructed dataset, our model outperforms counterparts in image fidelity, CLIP alignment score, and qualitatively for both generated and real images.

关键词： Training Image segmentation computer vision conferences Noise Graphics processing units Contrastive learning

来源：评论

学校读者我要写书评

暂无评论

Semi-Supervised Action recognition with Temporal Contrastive Learning

Semi-Supervised Action Recognition with Temporal Contrastive...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Singh, Ankit Chakraborty, Omprakash Varshney, Ashutosh Panda, Rameswar Feris, Rogerio Saenko, Kate Das, Abir IIT Madras Chennai Tamil Nadu India IIT Kharagpur Kharagpur W Bengal India MIT IBM Watson AI Lab Cambridge MA USA Boston Univ Boston MA 02215 USA

ISBN: (纸本)9781665445092

Learning to recognize actions from only a handful of labeled videos is a challenging problem due to the scarcity of tediously collected activity labels. We approach this problem by learning a two-pathway temporal contrastive model using unlabeled videos at two different speeds leveraging the fact that changing video speed does not change an action. Specifically, we propose to maximize the similarity between encoded representations of the same video at two different speeds as well as minimize the similarity between different videos played at different speeds. This way we use the rich supervisory information in terms of 'time' that is present in otherwise unsupervised pool of videos. With this simple yet effective strategy of manipulating video playback rates, we considerably outperform video extensions of sophisticated state-of-the-art semi-supervised image recognition methods across multiple diverse benchmark datasets and network architectures. Interestingly, our proposed approach benefits from out-of-domain unlabeled videos showing generalization and robustness. We also perform rigorous ablations and analysis to validate our approach.

关键词： computer vision Image recognition Semantics Network architecture Benchmark testing Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

MTLSegFormer: Multi-task Learning with Transformers for Semantic Segmentation in Precision Agriculture

MTLSegFormer: Multi-task Learning with Transformers for Sema...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Diogo Nunes Goncalves Jose Marcato Pedro Zamboni Hemerson Pistori Jonathan Li Keiller Nogueira Wesley Nunes Goncalves Faculty of Computer Science Federal University of Mato Grosso do Sul MS Brazil Faculty of Engineering Architecture and Urbanism and Geography Federal University of Mato Grosso do Sul MS Brazil INOVISAO Dom Bosco Catholic University MS Brazil Department of Geography and Environmental Management University of Waterloo Waterloo Ontario Canada University of Stirling Stirling Scotland UK

Multi-task learning has proven to be effective in improving the performance of correlated tasks. Most of the existing methods use a backbone to extract initial features with independent branches for each task, and the exchange of information between the branches usually occurs through the concatenation or sum of the feature maps of the branches. However, this type of information exchange does not directly consider the local characteristics of the image nor the level of importance or correlation between the tasks. In this paper, we propose a semantic segmentation method, MTLSegFormer, which combines multi-task learning and attention mechanisms. After the backbone feature extraction, two feature maps are learned for each task. The first map is proposed to learn features related to its task, while the second map is obtained by applying learned visual attention to locally re-weigh the feature maps of the other tasks. In this way, weights are assigned to local regions of the image of other tasks that have greater importance for the specific task. Finally, the two maps are combined and used to solve a task. We tested the performance in two challenging problems with correlated tasks and observed a significant improvement in accuracy, mainly in tasks with high dependence on the others.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Large-capacity Image Steganography Based on Invertible Neural Networks

Large-capacity Image Steganography Based on Invertible Neura...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Lu, Shao-Ping Wang, Rong Zhong, Tao Rosin, Paul L. Nankai Univ CS TKLNDST Tianjin Peoples R China Cardiff Univ Sch Comp Sci & Informat Cardiff Wales

ISBN: (纸本)9781665445092

Many attempts have been made to hide information in images, where one main challenge is how to increase the payload capacity without the container image being detected as containing a message. In this paper, we propose a large-capacity Invertible Steganography Network (ISN) for image steganography. We take steganography and the recovery of hidden images as a pair of inverse problems on image domain transformation, and then introduce the forward and backward propagation operations of a single invertible network to leverage the image embedding and extracting problems. Sharing all parameters of our single ISN architecture enables us to efficiently generate both the container image and the revealed hidden image(s) with high quality. Moreover, in our architecture the capacity of image steganography is significantly improved by naturally increasing the number of channels of the hidden image branch. Comprehensive experiments demonstrate that with this significant improvement of the steganography payload capacity, our ISN achieves state-of-the-art in both visual and quantitative comparisons.

关键词： Backpropagation Steganography Visualization computer vision Inverse problems Neural networks computer architecture

来源：评论

学校读者我要写书评

暂无评论

Knowledge Distillation for Efficient Instance Semantic Segmentation with Transformers

Knowledge Distillation for Efficient Instance Semantic Segme...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Maohui Li Michael Halstead Chris McCool University of Bonn Lamarr Institute for Machine Learning and Artificial Intelligence

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Instance-based semantic segmentation provides detailed per-pixel scene understanding information crucial for both computer vision and robotics applications. However, state-of-the-art approaches such as Mask2Former are computationally expensive and reducing this computational burden while maintaining high accuracy remains challenging. Knowledge distillation has been regarded as a potential way to compress neural networks, but to date limited work has explored how to apply this to distill information from the output queries of a model such as *** this paper, we match the output queries of the student and teacher models to enable a query-based knowledge distillation scheme. We independently match the teacher and the student to the groundtruth and use this to define the teacher to student relationship for knowledge distillation. Using this approach we show that it is possible to perform knowledge distillation where the student models can have a lower number of queries and the backbone can be changed from a Transformer architecture to a convolutional neural network architecture. Experiments on two challenging agricultural datasets, sweet pepper (BUP20) and sugar beet (SB20), and Cityscapes demonstrate the efficacy of our approach. Across the three datasets the student models obtain an average absolute performance improvement in AP of 1.8 and 1.9 points for ResNet-50 and Swin-Tiny backbone respectively. To the best of our knowledge, this is the first work to propose knowledge distillation schemes for instance semantic segmentation with transformer-based models.

关键词： Knowledge engineering computer vision Semantic segmentation Computational modeling Impedance matching Neural networks computer architecture

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 438 439 440 441 442 443 444 445 446 447 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：