检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1131-1140 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

DDOS: The Drone Depth and Obstacle Segmentation Dataset

DDOS: The Drone Depth and Obstacle Segmentation Dataset

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Benedikt Kolbeinsson Krystian Mikolajczyk Imperial College London

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The advancement of autonomous drones, essential for sectors such as remote sensing and emergency services, is hindered by the absence of training datasets that fully capture the environmental challenges present in real-world scenarios, particularly operations in non-optimal weather conditions and the detection of thin structures like wires. We present the Drone Depth and Obstacle Segmentation (DDOS) dataset to fill this critical gap with a collection of synthetic aerial images, created to provide comprehensive training samples for semantic segmentation and depth estimation. Specifically designed to enhance the identification of thin structures, DDOS allows drones to navigate a wide range of weather conditions, significantly elevating drone training and operational safety. Additionally, this work introduces innovative drone-specific metrics aimed at refining the evaluation of algorithms in depth estimation, with a focus on thin structure detection. These contributions not only pave the way for substantial improvements in autonomous drone technology but also set a new benchmark for future research, opening avenues for further advancements in drone navigation and safety.

关键词： Measurement Training computer vision Navigation Semantic segmentation Wires Estimation

来源：评论

学校读者我要写书评

暂无评论

Improving Semi-Supervised Domain Adaptation Using Effective Target Selection and Semantics

Improving Semi-Supervised Domain Adaptation Using Effective ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Singh, Anurag Doraiswamy, Naren Takamuku, Sawa Bhalerao, Megh Dutta, Titir Biswas, Soma Chepuri, Aditya Vengatesan, Balasubramanian Natori, Naotake Indian Inst Sci Bangalore Karnataka India Aisin Corp Toyota Japan Aisin Automot Haryana Pvt Ltd Bangalore Karnataka India

ISBN: (纸本)9781665448994

Recently, semi-supervised domain adaptation (SSDA) approaches have shown impressive performance for the domain adaptation task. They effectively utilize few labeled target samples along with the unlabeled data to account for the distribution shift across the source and target domains. In this work, we make three-fold contributions, concentrating on the role of target samples and semantics for the SSDA task. First, we observe that choosing a few, but an equal number of labeled samples from each class in the target domain requires a significant amount of manual effort. To address this, we propose an active learning-based framework by modeling both the sample diversity and the classifier uncertainty. By utilizing k-means initialized cluster centers for picking a small pool of diverse unlabeled target samples, we compute a novel classifier adaptation uncertainty term to select the most effective samples from this pool, which are queried to obtain their true labels from an oracle. Second, we propose to weigh the hard target samples more, without explicitly using their predicted, possibly incorrect labels, which guides the adaptation process. Third, we note that irrespective of the domain shift, the semantics of the classes remain unchanged, so they can be effectively utilized for this task. We show that initializing the class-representations or prototypes with the class-semantics helps in bridging the domain gap significantly. These along with adversarially learnt entropy objective results in a novel framework, termed STar (Select TARgets), which sets a new state-of-the-art for the SSDA task.

关键词： computer vision Uncertainty conferences Computational modeling Semantics Prototypes Manuals

来源：评论

学校读者我要写书评

暂无评论

Region-Adaptive Deformable Network for Image Quality Assessment

Region-Adaptive Deformable Network for Image Quality Assessm...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Shi, Shuwei Bai, Qingyan Cao, Mingdeng Xia, Weihao Wang, Jiahao Chen, Yifan Yang, Yujiu Tsinghua Univ Tsinghua Shenzhen Int Grad Sch Beijing Peoples R China Tsinghua Univ Dept Automat Beijing Peoples R China

ISBN: (纸本)9781665448994

Image quality assessment (IQA) aims to assess the perceptual quality of images. The outputs of the IQA algorithms are expected to be consistent with human subjective perception. In image restoration and enhancement tasks, images generated by generative adversarial networks (GAN) can achieve better visual performance than traditional CNN-generated images, although they have spatial shift and texture noise. Unfortunately, the existing IQA methods have unsatisfactory performance on the GAN-based distortion partially because of their low tolerance to spatial misalignment. To this end, we propose the reference-oriented deformable convolution, which can improve the performance of an IQA network on GAN-based distortion by adaptively considering this misalignment. We further propose a patch-level attention module to enhance the interaction among different patch regions, which are processed independently in previous patch-based methods. The modified residual block is also proposed by applying modifications to the classic residual block to construct a patch-region-based baseline called WResNet. Equipping this baseline with the two proposed modules, we further propose Region-Adaptive Deformable Network (RADN). The experiment results on the NTIRE 2021 Perceptual Image Quality Assessment Challenge dataset show the superior performance of RADN, and the ensemble approach won fourth place in the final testing phase of the challenge.

关键词： Image quality Visualization Convolution conferences Generative adversarial networks Distortion pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Technical Report of NICE Challenge at CVPR 2024: Caption Re-ranking Evaluation Using Ensembled CLIP and Consensus Scores

Technical Report of NICE Challenge at CVPR 2024: Caption Re-...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Kiyoon Jeong Woojun Lee Woongchan Nam Minjeong Ma Pilsung Kang School of Industrial and Management Engineering Korea University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This report presents the ECO (Ensembled Clip score and cOnsensus score) pipeline from team DSBALAB, which is a new framework used to evaluate and rank captions for a given image. ECO selects the most accurate caption describing image. It is made possible by combining an Ensembled CLIP score, which considers the semantic alignment between the image and captions, with a Consensus score that accounts for the essentialness of the captions. Using this framework, we achieved notable success in the CVPR 2024 Workshop Challenge on Caption Re-ranking Evaluation at the New Frontiers for Zero-Shot Image Captioning Evaluation (NICE). Specifically, we secured third place based on the CIDEr metric, second in both the SPICE and METEOR metrics, and first in the ROUGE-L and all BLEU Score metrics. The code and configuration for the ECO framework are available at https://***/DSBA-Lab/ECO.

关键词： Measurement Training computer vision conferences Semantics Pipelines Writing

来源：评论

学校读者我要写书评

暂无评论

TrajFine: Predicted Trajectory Refinement for Pedestrian Trajectory Forecasting

TrajFine: Predicted Trajectory Refinement for Pedestrian Tra...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Kuan-Lin Wang Li-Wu Tsao Jhih-Ciang Wu Hong-Han Shuai Wen-Huang Cheng National Yang Ming Chiao Tung University Taiwan National Taiwan University Taiwan

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Trajectory prediction, aiming to forecast future trajectories based on past ones, encounters two pivotal issues: insufficient interactions and scene incompetence. The former signifies a lack of consideration for the interactions of predicted future trajectories among agents, resulting in a potential collision, while the latter indicates the incapacity for learning complex social interactions from simple data. To establish an interaction-aware approach, we propose a diffusion-based model named TrajFine to extract social relationships among agents and refine predictions by considering past predictions and future interactive dynamics. Additionally, we introduce Scene Mixup to facilitate the augmentation via integrating agents from distinct scenes under the Curriculum Learning strategy, progressively increasing the task difficulty during training. Extensive experiments demonstrate the effectiveness of TrajFine for trajectory forecasting by outperforming current SOTAs with significant improvements on the benchmarks.

关键词： Training computer vision Pedestrians conferences Force Predictive models Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Listen Then See: Video Alignment with Speaker Attention

Listen Then See: Video Alignment with Speaker Attention

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Aviral Agrawal Carlos Mateo Samudio Lezcano Iqui Balam Heredia-Marin Prabhdeep Singh Sethi Carnegie Mellon University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Video-based Question Answering (Video QA) is a challenging task and becomes even more intricate when addressing Socially Intelligent Question Answering (SIQA). SIQA requires context understanding, temporal reasoning, and the integration of multimodal information, but in addition, it requires processing nuanced human behavior. Furthermore, the complexities involved are exacerbated by the dominance of the primary modality (text) over the others. Thus, there is a need to help the task’s secondary modalities to work in tandem with the primary modality. In this work, we introduce a cross-modal alignment and subsequent representation fusion approach that achieves state-of-the-art results (82.06% accuracy) on the Social IQ 2.0 dataset for SIQA. Our approach exhibits an improved ability to leverage the video modality by using the audio modality as a bridge with the language modality. This leads to enhanced performance by reducing the prevalent issue of language overfitting and resultant video modality bypassing encountered by current existing techniques. Our code and models are publicly available at [1].

关键词： Bridges Visualization computer vision Codes Accuracy conferences Question answering (information retrieval)

来源：评论

学校读者我要写书评

暂无评论

Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models

Efficient Exploration of Image Classifier Failures with Baye...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Adrien Le Coz Houssem Ouertatani Stéphane Herbin Faouzi Adjed IRT SystemX Palaiseau France DTIS ONERA Université Paris Saclay Palaiseau France INRIA Lille France

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Image classifiers should be used with caution in the real world. Performance evaluated on a validation set may not reflect performance in the real world. In particular, classifiers may perform well for conditions that are frequently encountered during training, but poorly for other infrequent conditions. In this study, we hypothesize that recent advances in text-to-image generative models make them valuable for benchmarking computer vision models such as image classifiers: they can generate images conditioned by textual prompts that cause classifier failures, allowing failure conditions to be described with textual attributes. However, their generation cost becomes an issue when a large number of synthetic images need to be generated, which is the case when many different attribute combinations need to be tested. We propose an image classifier benchmarking method as an iterative process that alternates image generation, classifier evaluation, and attribute selection. This method efficiently explores the attributes that ultimately lead to poor behavior detection.

关键词： Training computer vision Costs Image synthesis Computational modeling Text to image Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Self-Supervised Learning with Generative Adversarial Networks for Electron Microscopy

Self-Supervised Learning with Generative Adversarial Network...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Bashir Kazimi Karina Ruzaeva Stefan Sandfeld IAS-9 Forschungszentrum Jülich GmbH Jülich Germany RWTH Aachen University Aachen Germany

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this work, we explore the potential of self-supervised learning with Generative Adversarial Networks (GANs) for electron microscopy datasets. We show how self-supervised pretraining facilitates efficient fine-tuning for a spectrum of downstream tasks, including semantic segmentation, denoising, noise & background removal, and super-resolution. Experimentation with varying model complexities and receptive field sizes reveals the remarkable phenomenon that fine-tuned models of lower complexity consistently outperform more complex models with random weight initialization. We demonstrate the versatility of self-supervised pretraining across various downstream tasks in the context of electron microscopy, allowing faster convergence and better performance. We conclude that self-supervised pretraining serves as a powerful catalyst, being especially advantageous when limited annotated data are available and efficient scaling of computational cost is important.

关键词： Training computer vision Superresolution Noise reduction Noise Self-supervised learning Predictive models

来源：评论

学校读者我要写书评

暂无评论

Cluster Self-Refinement for Enhanced Online Multi-Camera People Tracking

Cluster Self-Refinement for Enhanced Online Multi-Camera Peo...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jeongho Kim Wooksu Shin Hancheol Park Donghyuk Choi Nota Inc. Republic of Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Recently, there has been a significant amount of research on Multi-Camera People Tracking (MCPT). MCPT presents more challenges compared to Multi-Object Single Camera Tracking, leading many existing studies to address them using offline methods. However, offline methods can only analyze pre-recorded videos, which presents less practical application in real industries compared to online methods. Therefore, we aimed to focus on resolving major problems that arise when using the online approach. Specifically, to address problems that could critically affect the performance of the online MCPT, such as storing inaccurate or low-quality appearance features and situations where a person is assigned multiple IDs, we proposed a Cluster Self-Refinement module. We achieved a third-place at the 2024 AI City Challenge Track 1 with a HOTA score of 60.9261%, and our code is available at https://***/nota-github/AIC2024_Track1_Nota.

关键词： Industries computer vision Accuracy conferences Urban areas Refining Pose estimation

来源：评论

学校读者我要写书评

暂无评论

Temporal surface frame anomalies for deepfake video detection

Temporal surface frame anomalies for deepfake video detectio...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Andrea Ciamarra Roberto Caldelli Alberto Del Bimbo University of Florence Florence Italy Universitas Mercatorum Rome Italy CNIT Florence Italy

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Looking at a video sequence where a foreground person is represented is not as time ago anymore. Deepfakes have revolutionized our way to watch at such contents and nowadays we are more often used to wonder if what we are seeing is real or is just a mystification. In this context of generalized disinformation, the need for reliable solutions to help common users, and not only, to make an assessment on this kind of video sequences is strongly upcoming. In this paper, a novel approach which leverages on temporal surface frame anomalies in order to reveal deepfake videos is introduced. The method searches for possible discrepancies, induced by deepfake manipulation, in the surfaces belonging to the captured scene and in their evolution along the temporal axis. These features are used as input of a pipeline based on deep neural networks to perform a binary assessment on the video itself. Experimental results witness that such a methodology can achieve significant performance in terms of detection accuracy.

关键词： Deepfakes Accuracy Instruments Video sequences Pipelines Watches pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 110 111 112 113 114 115 116 117 118 119 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：