检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

22,774 篇 会议
111 篇 期刊文献
23 册 图书

馆藏范围

22,907 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,400 篇 工学
- 10,880 篇 计算机科学与技术...
- 3,450 篇 软件工程
- 2,429 篇 机械工程
- 1,723 篇 光学工程
- 1,011 篇 控制科学与工程
- 998 篇 电气工程
- 761 篇 信息与通信工程
- 393 篇 仪器科学与技术
- 337 篇 生物工程
- 257 篇 生物医学工程（可授...
- 214 篇 电子科学与技术（可...
- 113 篇 化学工程与技术
- 112 篇 安全科学与工程
- 98 篇 测绘科学与技术
- 93 篇 交通运输工程
- 86 篇 建筑学
- 82 篇 土木工程
3,361 篇 医学
- 3,347 篇 临床医学
- 79 篇 基础医学(可授医学...
3,251 篇 理学
- 1,953 篇 物理学
- 1,665 篇 数学
- 567 篇 统计学（可授理学、...
- 484 篇 生物学
- 245 篇 系统科学
- 109 篇 化学
506 篇 管理学
- 299 篇 图书情报与档案管...
- 219 篇 管理科学与工程(可...
- 75 篇 工商管理
252 篇 艺术学
- 252 篇 设计学（可授艺术学...
62 篇 法学
- 59 篇 社会学
40 篇 农学
25 篇 教育学
19 篇 经济学
11 篇 军事学
3 篇 文学

主题

10,126 篇 computer vision
4,026 篇 pattern recognit...
2,905 篇 training
1,954 篇 computational mo...
1,791 篇 cameras
1,755 篇 visualization
1,490 篇 shape
1,472 篇 image segmentati...
1,446 篇 feature extracti...
1,415 篇 three-dimensiona...
1,285 篇 robustness
1,169 篇 computer archite...
1,146 篇 layout
1,142 篇 computer science
1,138 篇 semantics
1,070 篇 object detection
1,044 篇 conferences
1,014 篇 benchmark testin...
963 篇 codes
803 篇 face recognition

机构

135 篇 univ sci & techn...
118 篇 univ chinese aca...
118 篇 chinese univ hon...
109 篇 carnegie mellon ...
99 篇 tsinghua univers...
99 篇 microsoft resear...
95 篇 swiss fed inst t...
92 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
81 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
68 篇 microsoft res as...
65 篇 national laborat...
65 篇 alibaba grp peop...
63 篇 adobe research
61 篇 tsinghua univ pe...
60 篇 peking univ peop...
58 篇 univ oxford oxfo...

作者

78 篇 van gool luc
70 篇 timofte radu
63 篇 zhang lei
52 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
36 篇 li stan z.
33 篇 xiaoou tang
33 篇 qi tian
32 篇 sun jian
32 篇 pascal fua
31 篇 liu yang
31 篇 chen chen
31 篇 tian qi
31 篇 li fei-fei
29 篇 darrell trevor
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 hanqing lu
27 篇 murino vittorio

语言

22,719 篇 英文
162 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"

共 22908 条记录，以下是4771-4780 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Remote Sensing Image Object Detection Method with Feature Denoising Fusion Module 7

Remote Sensing Image Object Detection Method with Feature De...

引用

7th ieee Advanced Information Technology, Electronic and Automation Control conference, IAEAC 2024

作者： Chen, Penghui Li, Qishen Li, Qiufeng Wu, Zhongyu Nanchang Hangkong University School of Information Engineering Jiangxi Nanchang China Key Laboratory of Jiangxi Province for Image Processing and Pattern Recognition Jiangxi Nanchang China Nanchang Hangkong University School of Software Jiangxi Nanchang China Jiangxi Nanchang China

ISBN: (纸本)9798350339161

Remote sensing object detection is an important research area in computer vision, widely applied in both military and civilian domains. However, challenges in remote sensing image object detection such as large image sizes, complex backgrounds, and significant variations in target scales are prevalent. To address these issues, this paper proposes a new Feature Denoising and Fusion Module (FDFM) aimed at enhancing the accuracy and robustness of object detection. This module comprises a Multi-Scale Denoising Submodule(MDS) and an Attention Optimization Submodule(AOS). The Multi-Scale Denoising Module aims to suppress lower-level texture noise by utilizing higher-level semantic features before the fusion process, reducing the impact of lower-level noise on subsequent multi-scale feature fusion. Meanwhile, the Attention Optimization Module seeks to enhance the precision of self-attention computations within the Multi-Scale Denoising Module without increasing the parameter count. The efficacy of this method was evaluated on public datasets DOTA, VisDrone, VOC and COCO, showing improvements in comparison to baseline models. © 2024 ieee.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

MetaHTR: Towards Writer-Adaptive Handwritten Text recognition

MetaHTR: Towards <i>Writer</i>-<i>Adaptive</i> Handwritten T...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Bhunia, Ayan Kumar Ghose, Shuvozit Kumar, Amandeep Chowdhury, Pinaki Nath Sain, Aneeshan Song, Yi-Zhe Univ Surrey CVSSP SketchX Guildford Surrey England iFlyTek Surrey Joint Res Ctr Artificial Intellige Guildford Surrey England

ISBN: (纸本)9781665445092

Handwritten Text recognition (HTR) remains a challenging problem to date, largely due to the varying writing styles that exist amongst us. Prior works however generally operate with the assumption that there is a limited number of styles, most of which have already been captured by existing datasets. In this paper, we take a completely different perspective - we work on the assumption that there is always a new style that is drastically different, and that we will only have very limited data during testing to perform adaptation. This creates a commercially viable solution being exposed to the new style, the model has the best shot at adaptation, and the few-sample nature makes it practical to implement. We achieve this via a novel meta-learning framework which exploits additional new-writer data via a support set, and outputs a writer-adapted model via single gradient step update, all during inference (see Figure 1). We discover and leverage on the important insight that there exists few key characters per writer that exhibit relatively larger style discrepancies. For that, we additionally propose to meta-learn instance specific weights for a character-wise cross-entropy loss, which is specifically designed to work with the sequential nature of text data. Our writer-adaptive MetaHTR framework can be easily implemented on the top of most state-of-the-art HTR models. Experiments show an average performance gain of 5-7% can be obtained by observing very few new style data (<= 16).

关键词： Adaptation models computer vision Adaptive systems Text recognition Computational modeling computer architecture Writing

来源：评论

学校读者我要写书评

暂无评论

Revisiting Superpixels for Active Learning in Semantic Segmentation with Realistic Annotation Costs

Revisiting Superpixels for Active Learning in Semantic Segme...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cai, Lile Xu, Xun Liew, Jun Hao Foo, Chuan Sheng Inst Infocomm Res Singapore Singapore Natl Univ Singapore Singapore Singapore

ISBN: (纸本)9781665445092

State-of-the-art methods for semantic segmentation are based on deep neural networks that are known to be data-hungry. Region-based active learning has shown to be a promising method for reducing data annotation costs. A key design choice for region-based AL is whether to use regularly-shaped regions (e.g., rectangles) or irregularly-shaped region (e.g., superpixels). In this work, we address this question under realistic, click-based measurement of annotation costs. In particular, we revisit the use of super-pixels and demonstrate that the inappropriate choice of cost measure (e.g., the percentage of labeled pixels), may cause the effectiveness of the superpixel-based approach to be under-estimated. We benchmark the superpixel-based approach against the traditional "rectangle+polygon"-based approach with annotation cost measured in clicks, and show that the former outperforms on both Cityscapes and PASCAL VOC. We further propose a class-balanced acquisition function to boost the performance of the superpixel-based approach and demonstrate its effectiveness on the evaluation datasets. Our results strongly argue for the use of superpixel-based AL for semantic segmentation and highlight the importance of using realistic annotation costs in evaluating such methods.

关键词： Deep learning computer vision Costs Annotations Atmospheric measurements Semantics Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

DANNet: A One-Stage Domain Adaptation Network for Unsupervis...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wu, Xinyi Wu, Zhenyao Guo, Hao Ju, Lili Wang, Song Univ South Carolina Columbia SC 29208 USA Farsee2 Technol Ltd Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781665445092

Semantic segmentation of nighttime images plays an equally important role as that of daytime images in autonomous driving, but the former is much more challenging due to poor illuminations and arduous human annotations. In this paper, we propose a novel domain adaptation network (DANNet) for nighttime semantic segmentation without using labeled nighttime image data. It employs an adversarial training with a labeled daytime dataset and an unlabeled dataset that contains coarsely aligned day-night image pairs. Specifically, for the unlabeled day-night image pairs, we use the pixel-level predictions of static object categories on a daytime image as a pseudo supervision to segment its counterpart nighttime image. We further design a re-weighting strategy to handle the inaccuracy caused by misalignment between day-night image pairs and wrong predictions of daytime images, as well as boost the prediction accuracy of small objects. The proposed DANNet is the first one-stage adaptation framework for nighttime semantic segmentation, which does not train additional day-night image transfer models as a separate pre-processing stage. Extensive experiments on Dark Zurich and Nighttime Driving datasets show that our method achieves state-of-the-art performance for nighttime semantic segmentation.

关键词： Training Bridges Image segmentation computer vision Annotations Semantics Neural networks

来源：评论

学校读者我要写书评

暂无评论

What does CLIP know about peeling a banana?

What does CLIP know about peeling a banana?

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Claudia Cuttano Gabriele Rosi Gabriele Trivigno Giuseppe Averta Politecnico di Torino Focoos AI

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Humans show an innate capability to identify tools to support specific actions. The association between objects parts and the actions they facilitate is usually named affordance. Being able to segment objects parts depending on the tasks they afford is crucial to enable intelligent robots to use objects of daily living. Traditional supervised learning methods for affordance segmentation require costly pixel-level annotations, while weakly supervised approaches, though less demanding, still rely on object-interaction examples and support a closed set of actions. These limitations hinder scalability, may introduce biases, and usually restrict models to a limited set of predefined actions. This paper proposes Affordance-CLIP, to overcome these limitations by leveraging the implicit affordance knowledge embedded within large pre-trained vision-Language models like CLIP. We experimentally demonstrate that CLIP, although not explicitly trained for affordances detection, retains valuable information for the task. Our AffordanceCLIP achieves competitive zero-shot performance compared to methods with specialized training, while offering several advantages: i) it works with any action prompt, not just a predefined set; ii) it requires training only a small number of additional parameters compared to existing solutions and iii) eliminates the need for direct supervision on action-object pairs, opening new perspectives for functionality-based reasoning of models.

关键词： Training computer vision Affordances Scalability conferences Computational modeling Supervised learning

来源：评论

学校读者我要写书评

暂无评论

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation

One Thing One Click: A Self-Training Approach for Weakly Sup...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Zhengzhe Qi, Xiaojuan Fu, Chi-Wing Chinese Univ Hong Kong Hong Kong Peoples R China Univ Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Point cloud semantic segmentation often requires large-scale annotated training data, but clearly, point-wise labels are too tedious to prepare. While some recent methods propose to train a 3D network with small percentages of point labels, we take the approach to an extreme and propose "One Thing One Click," meaning that the annotator only needs to label one point per object. To leverage these extremely sparse labels in network training, we design a novel self-training approach, in which we iteratively conduct the training and label propagation, facilitated by a graph propagation module. Also, we adopt a relation network to generate the per-category prototype and explicitly model the similarity among graph nodes to generate pseudo labels to guide the iterative training. Experimental results on both ScanNet-v2 and S3DIS show that our self-training approach, with extremely-sparse annotations, outperforms all existing weakly supervised methods for 3D semantic segmentation by a large margin, and our results are also comparable to those of the fully supervised counterparts.

关键词： Training computer vision Three-dimensional displays Annotations Semantics Training data Prototypes

来源：评论

学校读者我要写书评

暂无评论

computer-vision Based Attention Monitoring for Online Meetings 5

Computer-Vision Based Attention Monitoring for Online Meetin...

引用

5th International conference on pattern recognition and Artificial Intelligence, PRAI 2022

作者： Dacayan, Tristram Kwak, Daehan Zhang, Xudong Kean University School of Computer Science and Technology UnionNJ United States

ISBN: (数字)9781665499163

ISBN: (纸本)9781665499163

Due to the appearance of COVID-19, virtual video conferencing platforms like Zoom and Google Meet have become one of the main alternative ways to conduct virtual meetings and presentations. While the virtual platforms are cheaper and more flexible, presenters and meeting hosts are likely less efficient at assessing audience attention and engagement due to the lack of body language. In this paper, we propose a system for estimating and monitoring participant attention in virtual meetings by using computer vision. Our approach mainly focuses on changes in a person's presence, gaze direction, and head orientation as a computer camera has a limited field of view. We first created a module to detect and extract participant video cells to isolate users and process their attention individually. Using those videos, we then monitored the user's presence, using YOLOv3 and DeepSORT, and their gaze direction and head orientation, using PTGaze. Through this monitoring, the system is able to record and graph a user's attention over the total amount of frames and return a collective attention level graph for the entire meeting. We believe that our system has potential usage in settings where attention is critical, such as academic lectures or collaborative business meetings. © 2022 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Generic Perceptual Loss for Modeling Structured Output Dependencies

Generic Perceptual Loss for Modeling Structured Output Depen...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yifan Chen, Hao Chen, Yu Yin, Wei Shen, Chunhua Univ Adelaide Adelaide SA Australia Automind Vancouver BC Canada Monash Univ Clayton Vic Australia

ISBN: (纸本)9781665445092

The perceptual loss has been widely used as an effective loss term in image synthesis tasks including image super-resolution [16], and style transfer [14]. It was believed that the success lies in the high-level perceptual feature representations extracted from CNNs pretrained with a large set of images. Here we reveal that, what matters is the network structure instead of the trained weights. Without any learning, the structure of a deep network is sufficient to capture the dependencies between multiple levels of variable statistics using multiple layers of CNNs. This insight removes the requirements of pre-training and a particular network structure (commonly, VGG) that are previously assumed for the perceptual loss, thus enabling a significantly wider range of applications. To this end, we demonstrate that a randomly-weighted deep CNN can be used to model the structured dependencies of outputs. On a few dense perpixel prediction tasks such as semantic segmentation, depth estimation and instance segmentation, we show improved results of using the extended randomized perceptual loss, compared to the baselines using pixel-wise loss alone. We hope that this simple, extended perceptual loss may serve as a generic structured-output loss that is applicable to most structured output learning tasks.

关键词： Training computer vision Image segmentation Image synthesis Semantics Superresolution Estimation

来源：评论

学校读者我要写书评

暂无评论

Pixel-aligned Volumetric Avatars

Pixel-aligned Volumetric Avatars

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Raj, Amit Zollhofer, Michael Simon, Tomas Saragih, Jason Saito, Shunsuke Hays, James Lombardi, Stephen Georgia Inst Technol Atlanta GA 30332 USA Facebook Real Labs Res Menlo Pk CA USA

ISBN: (纸本)9781665445092

Acquisition and rendering of photo-realistic human heads is a highly challenging research problem of particular importance for virtual telepresence. Currently, the highest quality is achieved by volumetric approaches trained in a person-specific manner on multi-view data. These models better represent fine structure, such as hair, compared to simpler mesh-based models. Volumetric models typically employ a global code to represent facial expressions, such that they can be driven by a small set of animation parameters. While such architectures achieve impressive rendering quality, they can not easily be extended to the multi-identity setting. In this paper, we devise a novel approach for predicting volumetric avatars of the human head given just a small number of inputs. We enable generalization across identities by a novel parameterization that combines neural radiance fields with local, pixel-aligned features extracted directly from the inputs, thus side-stepping the need for very deep or complex networks. Our approach is trained in an end-to-end manner solely based on a photometric re-rendering loss without requiring explicit 3D supervision. We demonstrate that our approach outperforms the existing state of the art in terms of quality and is able to generate faithful facial expressions in a multi-identity setting.

关键词： Hair computer vision Head Three-dimensional displays Telepresence Avatars Neural networks

来源：评论

学校读者我要写书评

暂无评论

3D-MAN: 3D Multi-frame Attention Network for Object Detection

3D-MAN: 3D Multi-frame Attention Network for Object Detectio...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Yang, Zetong Zhou, Yin Chen, Zhifeng Ngiam, Jiquan Chinese Univ Hong Kong Hong Kong Peoples R China Waymo LLC Mountain View CA USA Google Res Brain Team Mountain View CA USA

ISBN: (纸本)9781665445092

3D object detection is an important module in autonomous driving and robotics. However, many existing methods focus on using single frames to perform 3D detection, and do not fully utilize information from multiple frames. In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset. 3D-MAN first uses a novel fast single-frame detector to produce box proposals. The box proposals and their corresponding feature maps are then stored in a memory bank. We design a multi-view alignment and aggregation module, using attention networks, to extract and aggregate the temporal features stored in the memory bank. This effectively combines the features coming from different perspectives of the scene. We demonstrate the effectiveness of our approach on the large-scale complex Waymo Open Dataset, achieving state-of-the-art results compared to published single-frame and multi-frame methods.

关键词： computer vision Three-dimensional displays Fuses Aggregates Object detection Detectors Feature extraction

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 474 475 476 477 478 479 480 481 482 483 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：