检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1481-1490 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

PointPrompt: A Multi-modal Prompting Dataset for Segment Anything Model

PointPrompt: A Multi-modal Prompting Dataset for Segment Any...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jorge Quesada Mohammad Alotaibi Mohit Prabhushankar Ghassan AlRegib OLIVES Lab Georgia Institute of Technology Atlanta GA USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The capabilities of foundation models, most recently the Segment Anything Model, have gathered a large degree of attention for providing a versatile framework for tackling a wide array of image segmentation tasks. However, the interplay between human prompting strategies and the segmentation performance of these models remains understudied, as does the role played by the domain knowledge that humans (by previous exposure) and models (by pretraining) bring to the prompting process. To bridge this gap, we present the PointPrompt dataset compiled across multiple image modalities as well as multiple prompting annotators per modality. We collected a total of 16 image datasets from the natural, underwater, medical and seismic domain in order to create a comprehensive resource to facilitate the study of prompting behavior and agreement across modalities. Overall, our prompting dataset contains 158880 inclusion points and 52594 exclusion points over a total of 6000 images. Our analysis highlights the following: (i) viability of prompts across heterogeneous data, (ii) that point prompts are a valuable resource in the effort for enhancing the robustness and generalizability of segmentation models across diverse domains, (iii) prompts facilitate an understanding of the dynamics between annotation strategies and neural network outcomes. Information on downloading the dataset, images, and prompting tool is provided on our project website https://***/pointprompt/.

关键词： Bridges Image segmentation computer vision Computational modeling conferences Neural networks Dynamic scheduling

来源：评论

学校读者我要写书评

暂无评论

DeSRF: Deformable Stylized Radiance Field

DeSRF: Deformable Stylized Radiance Field

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Shiyao Xu Lingzhi Li Li Shen Zhouhui Lian Wangxuan Institute of Computer Technology Peking University Alibaba Group Beijing China

When stylizing 3D scenes, current methods need to render the full-resolution images from different views and use the style loss, which is proposed for 2D style transfer and needs to be calculated on the whole image, to optimize the stylized radiance fields. It is quite inefficient when we need to stylize a large-scale scene. This paper proposes a more efficient method, DeSRF, to stylize the radiance fields, which also transfers style information to the geometry according to the input style. To achieve this goal, on the one hand, we first introduce a deformable module, which can learn the geometric style contained in the input style image and transfer it to radiance fields. On the other hand, although the style loss needs to be calculated for the entire image, actually we do not need to process all the rays when updating the stylized radiance fields. Motivated by this observation, we propose a new training strategy called Dilated Sampling (DS) for efficient stylization propagation. Experimental results show that our method works more efficiently and produces more visually-reasonable stylized 3D scenes with geometry style information compared to other existing approaches.

关键词：

来源：评论

学校读者我要写书评

暂无评论

iEdit: Localised Text-guided Image Editing with Weak Supervision

iEdit: Localised Text-guided Image Editing with Weak Supervi...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Rumeysa Bodur Erhan Gundogdu Binod Bhattarai Tae-Kyun Kim Michael Donoser Loris Bazzani Imperial College London UK Amazon University of Aberdeen UK KAIST South Korea

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Diffusion models (DMs) can generate realistic images with text guidance using large-scale datasets. However, they demonstrate limited controllability on the generated images. We introduce iEdit, a novel method for text-guided image editing conditioned on a source image and textual prompt. As a fully-annotated dataset with target images does not exist, previous approaches perform subject-specific fine-tuning at test time or adopt contrastive learning without a target image, leading to issues on preserving source image fidelity. We propose to automatically construct a dataset derived from LAION-5B, containing pseudo-target images and descriptive edit prompts. The dataset allows us to incorporate a weakly-supervised loss function, generating the pseudo-target image from the source image’s latent noise conditioned on the edit prompt. To encourage localised editing we propose a loss function that uses segmentation masks to guide the editing during training and optionally at inference. Trained with limited GPU resources on the constructed dataset, our model outperforms counterparts in image fidelity, CLIP alignment score, and qualitatively for both generated and real images.

关键词： Training Image segmentation computer vision conferences Noise Graphics processing units Contrastive learning

来源：评论

学校读者我要写书评

暂无评论

Time Lens++: Event-based Frame Interpolation with Parametric Nonlinear Flow and Multi-scale Fusion

Time Lens++: Event-based Frame Interpolation with Parametric...

引用

2022 ieee/CVF conference on computer vision and pattern recognition, CVPR 2022

作者： Tulyakov, Stepan Bochicchio, Alfredo Gehrig, Daniel Georgoulis, Stamatios Li, Yuanyou Scaramuzza, Davide Huawei Technologies Zurich Research Center Switzerland Univ. of Zurich Eth Zurich Dept. of Informatics Dept. of Neuroinformatics Czech Republic

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

Recently, video frame interpolation using a combination of frame- and event-based cameras has surpassed traditional image-based methods both in terms of performance and memory efficiency. However, current methods still suffer from (i) brittle image-level fusion of complementary interpolation results, that fails in the presence of artifacts in the fused image, (ii) potentially temporally inconsistent and inefficient motion estimation procedures, that run for every inserted frame and (iii) low contrast regions that do not trigger events, and thus cause events-only motion estimation to generate artifacts. Moreover, previous methods were only tested on datasets consisting of planar and far-away scenes, which do not capture the full complexity of the real world. In this work, we address the above problems by introducing multi-scale feature-level fusion and computing one-shot non-linear inter-frame motion-which can be efficiently sampled for image warping-from events and images. We also collect the first large-scale events and frames dataset consisting of more than 100 challenging scenes with depth variations, captured with a new experimental setup based on a beamsplitter. We show that our method improves the reconstruction quality by up to 0.2 dB in terms of PSNR and up to 15% in LPIPS score. © 2022 ieee.

关键词： Interpolation computer vision Motion estimation Memory management Dynamics Cameras pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Semantic Pre-supplement for Exposure Correction

Semantic Pre-supplement for Exposure Correction

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhen Zou Wei Yu Jie Huang Feng Zhao University of Science and Technology of China He Fei China

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Exposure correction tasks are dedicated to recovering the brightness and structural information of overexposed or underexposed images. The recovery difficulty of areas with different exposure levels is different, as severely exposed areas are more difficult to recover due to severe structural information loss than commonly exposed areas. However, existing methods focus on the simultaneous recovery of global brightness and structure, ignoring that the recovery difficulty varies between areas. To address this issue, we propose a novel exposure correction strategy named "Inpainting Assisted Exposure Correction"(IAEC), which pre-performs image structure repair on severely exposed areas to guide the exposure correction process. This method is based on the observation that the contextual semantic information contained in the image structure can effectively help the overall image recovery, and the lack of contextual semantic information in severely incorrectly exposed areas is very severe. The pre-performed structural repair by the inpainting model can well supplement the insufficient contextual semantic information caused by severe exposure. Therefore, we use an inpainting model to perform pre-structure repair on severely exposed areas to obtain supplementary contextual semantic information and then align the structure-repaired image with the improperly exposed input at the feature level. Extensive experiments demonstrate that our method gets superior results than the state-of-the-art methods and has the potential to be applied to other tasks with similar context loss problems.

关键词： Training computer vision conferences Semantics Brightness Maintenance engineering Cognition

来源：评论

学校读者我要写书评

暂无评论

IMIL: Interactive Medical Image Learning Framework

IMIL: Interactive Medical Image Learning Framework

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Adrit Rao Andrea Fisher Ken Chang John Christopher Panagides Katherine McNamara Joon-Young Lee Oliver Aalami Stanford University Palo Alto High School Adobe Research

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Data augmentations are widely used in training medical image deep learning models to increase the diversity and size of sparse datasets. However, commonly used augmentation techniques can result in loss of clinically relevant information from medical images, leading to incorrect predictions at inference time. We propose the Interactive Medical Image Learning (IMIL) framework, a novel approach for improving the training of medical image analysis algorithms that enables clinician-guided intermediate training data augmentations on misprediction outliers, focusing the algorithm on relevant visual information. To prevent the model from using irrelevant features during training, IMIL will ’blackout’ clinician-designated irrelevant regions and replace the original images with the augmented samples. This ensures that for originally mispredicted samples, the algorithm subsequently attends only to relevant regions and correctly correlates them with the respective diagnosis. We validate the efficacy of IMIL using radiology residents and compare its performance to state-of-the-art data augmentations. A 4.2% improvement in accuracy over ResNet-50 was observed when using IMIL on only 4% of the training set. Our study demonstrates the utility of clinician-guided interactive training to achieve meaningful data augmentations for medical image analysis algorithms.

关键词： Training computer vision Image analysis Accuracy Computational modeling Radiology Data augmentation

来源：评论

学校读者我要写书评

暂无评论

Image-caption difficulty for efficient weakly-supervised object detection from in-the-wild data

Image-caption difficulty for efficient weakly-supervised obj...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Giacomo Nebbia Adriana Kovashka University of Pittsburgh

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In recent years, we have witnessed the collection of larger and larger multi-modal, image-caption datasets: from hundreds of thousands such pairs to hundreds of millions. Such datasets allow researchers to build powerful deep learning models, at the cost of requiring intensive computational resources. In this work, we ask: can we use such datasets efficiently without sacrificing performance? We tackle this problem by extracting difficulty scores from each image-caption sample, and by using such scores to make training more effective and efficient. We compare two ways to use difficulty scores to influence training: filtering a representative subset of each dataset and ordering samples through curriculum learning. We analyze and compare difficulty scores extracted from a single modality—captions (i.e., caption length and number of object mentions) or images (i.e., region proposals’ size and number)—or based on alignment of image-caption pairs (i.e., CLIP and concreteness). We focus on Weakly-Supervised Object Detection where image-level labels are extracted from captions. We discover that (1) combining filtering and curriculum learning can achieve large gains in performance, but not all methods are stable across experimental settings, (2) singlemodality scores often outperform alignment-based ones, (3) alignment scores show the largest gains when training time is limited.

关键词： Training Deep learning computer vision Costs Filtering conferences Computational modeling

来源：评论

学校读者我要写书评

暂无评论

Exploring Real World Map Change Generalization of Prior-Informed HD Map Prediction Models

Exploring Real World Map Change Generalization of Prior-Info...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Samuel M. Bateman Ning Xu H. Charles Zhao Yael Ben Shalom Vince Gong Greg Long Will Maddern Nuro Inc.

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Building and maintaining High-Definition (HD) maps represents a large barrier to autonomous vehicle deployment. This, along with advances in modern online map detection models, has sparked renewed interest in the online mapping problem. However, effectively predicting online maps at a high enough quality to enable safe, driverless deployments remains a significant challenge. Recent work on these models proposes training robust online mapping systems using low quality map priors with synthetic perturbations in an attempt to simulate out-of-date HD map priors. In this paper, we investigate how models trained on these synthetically perturbed map priors generalize to performance on deployment-scale, real world map changes. We present a large-scale experimental study to determine which synthetic perturbations are most useful in generalizing to real world HD map changes, evaluated using multiple years of real-world autonomous driving data. We show there is still a substantial sim2real gap between synthetic prior perturbations and observed real-world changes, which limits the utility of current prior-informed HD map prediction models.

关键词： Training computer vision Perturbation methods conferences Noise Buildings Predictive models

来源：评论

学校读者我要写书评

暂无评论

Must Unsupervised Continual Learning Relies on Previous Information?

Must Unsupervised Continual Learning Relies on Previous Info...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Haoyang Cheng Haitao Wen Heqian Qiu Lanxiao Wang Minjian Zhang Hongliang Li University of Electronic Science and Technology of China Chengdu China

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Open-world recognition has recently gained significant attention owing to its ability to bridge the gap between experimental scenarios and real-world applications. Since continual learning can learn from a sequence of dynamic data streams, it obtains extensive applications in open-world recognition. However, because of the production of data annotation is usually time-consuming and labor-intensive in real-world scenarios, it’s necessary to develop unsupervised continual learning. Recent studies start to investigate unsupervised continual learning (i.e., UCL), but mainly focus on rehearsal and regularization strategies to enhance the anti-forgetting capability of UCL. In practice, rehearsal and regularization are information-dependent, which require information from previous data as supervised signals, e.g., replayed data and previous model. In this paper, we propose an information-free method, Alternate Task Discrimination (ATD), which is a self-supervised pretext task for continuity and improves anti-forgetting capability via encouraging the model to discriminate which data stream current sample is from. The whole process doesn’t rely on any previous information. In order to perform ATD effectively in UCL framework, we design an alternating optimization algorithm where UCL and ATD are optimized respectively. We validate the effectiveness of the proposed method on multiple standard UCL benchmarks, where it obtains considerable improvements compared with baseline methods. In addition, our approach can be used as a plug-in unit, which makes further achievements when collaborated with existing popular UCL methods.

关键词： Continuing education computer vision conferences Production Benchmark testing Data models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Tackling Domain Shifts in Person Re-Identification: A Survey and Analysis

Tackling Domain Shifts in Person Re-Identification: A Survey...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Vuong D. Nguyen Samiha Mirza Abdollah Zakeri Ayush Gupta Khadija Khaldi Rahma Aloui Pranav Mantini Shishir K. Shah Fatima Merchant University of Houston Johns Hopkins University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The necessity for a Person ReID system for rapidly evolving urban surveillance applications is severely challenged by domain shifts—variations in data distribution that occur across different environments or times. In this paper, we provide the first empirical review of domain shift in person ReID, which includes three settings namely Unsupervised Domain Adaptation ReID, Domain Generalizable ReID, and Lifelong ReID. We observe that existing approaches only tackle domain shifts caused by cross-dataset setting, while ignoring intra-dataset attribute domain shifts caused by changes in clothing, shape, or gait, which is very common in ReID. Thus, we enhance research directions in this field by redefining domain shift in ReID as the combination of attribute domain shift with cross-dataset domain shift. With a focus on Lifelong Re-ID methods, we conduct an extensive comparison on a fair experimental setup and provide an in-depth analysis of these methods under both non-cloth-changing and cloth-changing Re-ID scenarios. Insights into the strengths and limitations of these methods based on their performance are studied. This paper outlines future research directions and paves the way for the development of more adaptive, resilient, and enduring cross-domain ReID systems. Code is available here.

关键词： Surveys computer vision Codes Shape Reviews Surveillance conferences

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 145 146 147 148 149 150 151 152 153 154 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：