检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

22,774 篇 会议
111 篇 期刊文献
23 册 图书

馆藏范围

22,907 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,400 篇 工学
- 10,880 篇 计算机科学与技术...
- 3,450 篇 软件工程
- 2,429 篇 机械工程
- 1,723 篇 光学工程
- 1,011 篇 控制科学与工程
- 998 篇 电气工程
- 761 篇 信息与通信工程
- 393 篇 仪器科学与技术
- 337 篇 生物工程
- 257 篇 生物医学工程（可授...
- 214 篇 电子科学与技术（可...
- 113 篇 化学工程与技术
- 112 篇 安全科学与工程
- 98 篇 测绘科学与技术
- 93 篇 交通运输工程
- 86 篇 建筑学
- 82 篇 土木工程
3,361 篇 医学
- 3,347 篇 临床医学
- 79 篇 基础医学(可授医学...
3,251 篇 理学
- 1,953 篇 物理学
- 1,665 篇 数学
- 567 篇 统计学（可授理学、...
- 484 篇 生物学
- 245 篇 系统科学
- 109 篇 化学
506 篇 管理学
- 299 篇 图书情报与档案管...
- 219 篇 管理科学与工程(可...
- 75 篇 工商管理
252 篇 艺术学
- 252 篇 设计学（可授艺术学...
62 篇 法学
- 59 篇 社会学
40 篇 农学
25 篇 教育学
19 篇 经济学
11 篇 军事学
3 篇 文学

主题

10,126 篇 computer vision
4,026 篇 pattern recognit...
2,900 篇 training
1,958 篇 computational mo...
1,792 篇 cameras
1,759 篇 visualization
1,484 篇 shape
1,466 篇 image segmentati...
1,445 篇 feature extracti...
1,412 篇 three-dimensiona...
1,288 篇 robustness
1,170 篇 computer archite...
1,146 篇 layout
1,142 篇 computer science
1,134 篇 semantics
1,071 篇 object detection
1,043 篇 conferences
1,009 篇 benchmark testin...
967 篇 codes
810 篇 face recognition

机构

135 篇 univ sci & techn...
118 篇 univ chinese aca...
118 篇 chinese univ hon...
110 篇 carnegie mellon ...
99 篇 tsinghua univers...
99 篇 microsoft resear...
94 篇 swiss fed inst t...
92 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 university of ch...
72 篇 shanghai jiao to...
68 篇 microsoft res as...
65 篇 national laborat...
65 篇 alibaba grp peop...
63 篇 adobe research
63 篇 tsinghua univ pe...
60 篇 peking univ peop...
59 篇 peng cheng labor...

作者

78 篇 van gool luc
72 篇 timofte radu
63 篇 zhang lei
45 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
33 篇 xiaoou tang
33 篇 li stan z.
33 篇 qi tian
32 篇 sun jian
31 篇 liu yang
31 篇 li fei-fei
30 篇 chen chen
30 篇 tian qi
30 篇 pascal fua
29 篇 darrell trevor
28 篇 ying shan
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 hanqing lu

语言

22,719 篇 英文
162 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"

共 22908 条记录，以下是4651-4660 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Learning Tracking Representations from Single Point Annotations

Learning Tracking Representations from Single Point Annotati...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Qiangqiang Wu Antoni B. Chan Department of Computer Science City University of Hong Kong

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Existing deep trackers are typically trained with large-scale video frames with annotated bounding boxes. However, these bounding boxes are expensive and time-consuming to annotate, in particular for large scale datasets. In this paper, we propose to learn tracking representations from single point annotations (i.e., 4.5 × faster to annotate than the traditional bounding box) in a weakly supervised manner. Specifically, we propose a soft contrastive learning (SoCL) framework that incorporates target objectness prior into end-to-end contrastive learning. Our SoCL consists of adaptive positive and negative sample generation, which is memory-efficient and effective for learning tracking representations. We apply the learned representation of SoCL to visual tracking and show that our method can 1) achieve better performance than the fully supervised baseline trained with box annotations under the same annotation time cost; 2) achieve comparable performance of the fully supervised baseline by using the same number of training frames and meanwhile reducing annotation time cost by 78% and total fees by 85%; 3) be robust to annotation noise.

关键词： Training Visualization Target tracking Costs Correlation Annotations Noise

来源：评论

学校读者我要写书评

暂无评论

CLIP-Guided vision-Language Pre-training for Question Answering in 3D Scenes

CLIP-Guided Vision-Language Pre-training for Question Answer...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Maria Parelli Alexandros Delitzas Nikolas Hars Georgios Vlassis Sotirios Anagnostidis Gregor Bachmann Thomas Hofmann ETH Zurich Switzerland

Training models to apply linguistic knowledge and visual concepts from 2D images to 3D world understanding is a promising direction that researchers have only recently started to explore. In this work, we design a novel 3D pre-training vision-Language method that helps a model learn semantically meaningful and transferable 3D scene point cloud representations. We inject the representational power of the popular CLIP model into our 3D encoder by aligning the encoded 3D scene features with the corresponding 2D image and text embeddings produced by CLIP. To assess our model’s 3D world reasoning capability, we evaluate it on the downstream task of 3D Visual Question Answering. Experimental quantitative and qualitative results show that our pre-training method outperforms state-of-the-art works in this task and leads to an interpretable representation of 3D scene features.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Improving Object Detection to Fisheye Cameras with Open-Vocabulary Pseudo-Label Approach

Improving Object Detection to Fisheye Cameras with Open-Voca...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Long Hoang Pham Quoc Pham-Nam Ho Duong Nguyen-Ngoc Tran Tai Huu-Phuong Tran Huy-Hung Nguyen Duong Khac Vu Chi Dai Tran Ngoc Doan-Minh Huynh Hyung-Min Jeon Hyung-Joon Jeon Jae Wook Jeon Department of Electrical and Computer Engineering Sungkyunkwan University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Fish-eye cameras have long been employed in traffic surveillance systems to allow for wider observation of the roads. Despite their widespread use, limited computer vision research is tailored explicitly to images captured by fish-eye cameras. The AI City Challenge 2024 - Track 4 introduces a novel fish-eye camera dataset for the 2D road object detection task. This paper proposes a framework designed to detect objects in fish-eye camera images. Our approach involves several key steps: first, we generate image data to bridge the representation gap between day and night images. Next, we leverage zero-shot open vocabulary detection to produce pseudo-labels, aiding in training supervised object detection models. Additionally, we optimize the model’s hyper-parameters and inference configuration for better performance. Finally, we apply various post-processing techniques to enhance detection performance. Our solution achieves a final F1 score of 0.6194 in the AI City Challenge 2024 - Track 4, ranking third among competing teams. The source code is available at GitHub Repo.

关键词： Training computer vision Adaptation models Vocabulary Roads Urban areas Object detection

来源：评论

学校读者我要写书评

暂无评论

Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation

Cross-Domain Gradient Discrepancy Minimization for Unsupervi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Du, Zhekai Li, Jingjing Su, Hongzu Zhu, Lei Lu, Ke Univ Elect Sci & Technol China Chengdu Sichuan Peoples R China Shandong Normal Univ Jinan Shandong Peoples R China

ISBN: (纸本)9781665445092

Unsupervised Domain Adaptation (UDA) aims to generalize the knowledge learned from a well-labeled source domain to an unlabled target domain. Recently, adversarial domain adaptation with two distinct classifiers (bi-classifier) has been introduced into UDA which is effective to align distributions between different domains. Previous bi-classifier adversarial learning methods only focus on the similarity between the outputs of two distinct classifiers. However, the similarity of the outputs cannot guarantee the accuracy of target samples, i.e., traget samples may match to wrong categories even if the discrepancy between two classifiers is small. To challenge this issue, in this paper, we propose a cross-domain gradient discrepancy minimization (CGDM) method which explicitly minimizes the discrepancy of gradients generated by source samples and target samples. Specifically, the gradient gives a cue for the semantic information of target samples so it can be used as a good supervision to improve the accuracy of target samples. In order to compute the gradient signal of target smaples, we further obtain target pseudo labels through a clustering-based self-supervised learning. Extensive experiments on three widely used UDA datasets show that our method surpasses many previous state-of-the-arts.

关键词： computer vision Semantics Minimization Adversarial machine learning pattern recognition Reliability

来源：评论

学校读者我要写书评

暂无评论

LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning

LLM-Seg: Bridging Image Segmentation and Large Language Mode...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Junchi Wang Lei Ke ETH Zurich

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Understanding human instructions to identify the target objects is vital for perception systems. In recent years, the advancements of Large Language Models (LLMs) have introduced new possibilities for image segmentation. In this work, we delve into reasoning segmentation, a novel task that enables segmentation system to reason and interpret implicit user intention via large language model reasoning and then segment the corresponding target. Our work on reasoning segmentation contributes on both the methodological design and dataset labeling. For the model, we propose a new framework named LLM-Seg. LLM-Seg effectively connects the current foundational Segmentation Anything Model and the LLM by mask proposals selection. For the dataset, we propose an automatic data generation pipeline and construct a new reasoning segmentation dataset named LLM-Seg40K. Experiments demonstrate that our LLM-Seg exhibits competitive performance compared with existing methods. Furthermore, our proposed pipeline can efficiently produce high-quality reasoning segmentation datasets. The LLM-Seg40K dataset, developed through this pipeline, serves as a new benchmark for training and evaluating various reasoning segmentation approaches. Our code, models and dataset are at https://***/wangjunchi/LLMSeg.

关键词： Training Image segmentation Large language models Design methodology Pipelines Cognition pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Delving Deep into Many-to-many Attention for Few-shot Video Object Segmentation

Delving Deep into Many-to-many Attention for Few-shot Video ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Haoxin Wu, Hanjie Zhao, Nanxuan Ren, Sucheng He, Shengfeng South China Univ Technol Sch Comp Sci & Engn Guangzhou Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9781665445092

This paper tackles the task of Few-Shot Video Object Segmentation (FSVOS), i.e., segmenting objects in the query videos with certain class specified in a few labeled support images. The key is to model the relationship between the query videos and the support images for propagating the object information. This is a many-to-many problem and often relies on full-rank attention, which is computationally intensive. In this paper, we propose a novel Domain Agent Network (DAN), breaking down the full-rank attention into two smaller ones. We consider one single frame of the query video as the domain agent, bridging between the support images and the query video. Our DAN allows a linear space and time complexity as opposed to the original quadratic form with no loss of performance. In addition, we introduce a learning strategy by combining meta-learning with online learning to further improve the segmentation accuracy. We build a FSVOS benchmark on the Youtube-VIS dataset and conduct experiments to demonstrate that our method outperforms baselines on both computational cost and accuracy, achieving the state-of-the-art performance. Code is available at https://***/scutpaul/DANet.

关键词： Image segmentation computer vision Codes Computational modeling Object segmentation pattern recognition Computational efficiency

来源：评论

学校读者我要写书评

暂无评论

Learning-based Image Registration with Meta-Regularization

Learning-based Image Registration with Meta-Regularization

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Al Safadi, Ebrahim Song, Xubo Oregon Hlth & Sci Univ Portland OR 97201 USA Amazon Seattle WA 98121 USA

ISBN: (纸本)9781665445092

We introduce a meta-regularization framework for learning-based image registration. Current learning-based image registration methods use high-resolution architectures such as U-Nets to produce spatial transformations, and impose simple and explicit regularization on the output of the network to ensure that the estimated displacements are smooth. While this approach works well on small deformations, it has been known to struggle when the deformations are large. Our method uses a more advanced form of meta-regularization to increase the generalization ability of learned registration models. We motivate our approach based on Reproducing Kernel Hilbert Space (RKHS) theory, and approximate that framework via a meta-regularization convolutional layer with radially symmetric, positive semi-definite filters that inherent its regularization properties. We then provide a method to learn such regularization filters while also learning to register. Our experiments on synthetic and real datasets as well as ablation analysis show that our method can improve anatomical correspondence compared to competing methods, and reduce the percentage of folding and tear in the large deformation setting, reflecting better regularization and model generalization.

关键词： Optical filters Deformable models Training Image registration computer architecture Filtering theory Registers

来源：评论

学校读者我要写书评

暂无评论

Scaling Local Self-Attention for Parameter Efficient Visual Backbones

Scaling Local Self-Attention for Parameter Efficient Visual ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Vaswani, Ashish Ramachandran, Prajit Srinivas, Aravind Parmar, Niki Hechtman, Blake Shlens, Jonathon Google Res Mountain View CA 94043 USA Univ Calif Berkeley Berkeley CA USA

ISBN: (纸本)9781665445092

Self-attention has the promise of improving computer vision systems due to parameter-independent scaling of receptive fields and content-dependent interactions, in contrast to parameter-dependent scaling and content-independent interactions of convolutions. Self-attention models have recently been shown to have encouraging improvements on accuracy-parameter trade-offs compared to baseline convolutional models such as ResNet-50. In this work, we develop self-attention models that can outperform not just the canonical baseline models, but even the high-performing convolutional models. We propose two extensions to self-attention that, in conjunction with a more efficient implementation of self-attention, improve the speed, memory usage, and accuracy of these models. We leverage these improvements to develop a new self-attention model family, HaloNets, which reach state-of-the-art accuracies on the parameter-limited setting of the ImageNet classification benchmark. In preliminary transfer learning experiments, we find that HaloNet models outperform much larger models and have better inference performance. On harder tasks such as object detection and instance segmentation, our simple local self-attention and convolutional hybrids show improvements over very strong baselines. These results mark another step in demonstrating the efficacy of self-attention models on settings traditionally dominated by convolutions.(1)

关键词： computer vision Visualization Image segmentation Computational modeling Transfer learning Memory management Object detection

来源：评论

学校读者我要写书评

暂无评论

SemiGPC: Distribution-Aware Label Refinement for Imbalanced Semi-Supervised Learning Using Gaussian Processes

SemiGPC: Distribution-Aware Label Refinement for Imbalanced ...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Abdelhak Lemkhenter Manchen Wang Luca Zancato Gurumurthy Swaminathan Paolo Favaro Davide Modolo AWS AI Labs

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this paper we introduce SemiGPC, a distribution-aware label refinement strategy based on Gaussian Processes where the predictions of the model are derived from the labels posterior distribution. Differently from other buffer-based semi-supervised methods such as Co-Match [17] and SimMatch [34], our SemiGPC includes a normalization term that addresses imbalances in the global data distribution while maintaining local sensitivity. This explicit control allows SemiGPC to be more robust to confirmation bias especially under class imbalance. We show that SemiGPC improves performance when paired with different Semi-Supervised methods such as FixMatch [23], ReMixMatch [4], SimMatch [34] and FreeMatch [32] and different pre-training strategies including MSN [2] and Dino [5]. We also show that SemiGPC achieves state of the art results under different degrees of class imbalance on standard CIFAR10-LT/CIFAR100-LT especially in the low data-regime. Using SemiGPC also results in about 2% avg. accuracy increase compared to a new competitive baseline on the more challenging benchmarks SemiAves, SemiCUB, SemiFungi [27] and Semi-iNat [26].

关键词： computer vision Sensitivity Accuracy conferences Gaussian processes Semisupervised learning Predictive models

来源：评论

学校读者我要写书评

暂无评论

Lite-HRNet: A Lightweight High-Resolution Network

Lite-HRNet: A Lightweight High-Resolution Network

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Yu, Changqian Xiao, Bin Gao, Changxin Yuan, Lu Zhang, Lei Sang, Nong Wang, Jingdong Huazhong Univ Sci & Technol Sch Artificial Intelligence & Automat Key Lab Image Proc & Intelligent Control Huazhong Peoples R China Microsoft Redmond WA 98052 USA

ISBN: (纸本)9781665445092

We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution network), yielding stronger performance over popular lightweight networks, such as MobileNet, ShuffleNet, and Small HRNet. We find that the heavily-used pointwise (1 x 1) convolutions in shuffle blocks become the computational bottleneck. We introduce a lightweight unit, conditional channel weighting, to replace costly pointwise (1 x 1) convolutions in shuffle blocks. The complexity of channel weighting is linear w.r.t the number of channels and lower than the quadratic time complexity for pointwise convolutions. Our solution learns the weights from all the channels and over multiple resolutions that are readily available in the parallel branches in HRNet. It uses the weights as the bridge to exchange information across channels and resolutions, compensating the role played by the pointwise (1 x 1) convolution. Lite-HRNet demonstrates superior results on human pose estimation over popular lightweight networks. Moreover, Lite-HRNet can be easily applied to semantic segmentation task in the same lightweight manner. The code and models have been publicly available at https://***/HRNet/Lite-HRNet.

关键词： Convolutional codes Bridges computer vision Computational modeling Pose estimation Semantics pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 462 463 464 465 466 467 468 469 470 471 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：