检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,431 篇 会议
38 篇 期刊文献
7 册 图书

馆藏范围

8,475 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,194 篇 工学
- 3,828 篇 计算机科学与技术...
- 2,067 篇 软件工程
- 1,218 篇 光学工程
- 530 篇 控制科学与工程
- 400 篇 信息与通信工程
- 282 篇 仪器科学与技术
- 262 篇 机械工程
- 233 篇 电气工程
- 155 篇 生物工程
- 144 篇 生物医学工程（可授...
- 110 篇 电子科学与技术（可...
- 57 篇 安全科学与工程
- 52 篇 建筑学
- 51 篇 化学工程与技术
- 48 篇 土木工程
- 43 篇 交通运输工程
- 39 篇 力学（可授工学、理...
1,991 篇 理学
- 1,352 篇 物理学
- 1,167 篇 数学
- 415 篇 统计学（可授理学、...
- 212 篇 生物学
- 49 篇 化学
- 34 篇 系统科学
222 篇 管理学
- 160 篇 图书情报与档案管...
- 66 篇 管理科学与工程(可...
- 36 篇 工商管理
202 篇 艺术学
- 202 篇 设计学（可授艺术学...
189 篇 医学
- 188 篇 临床医学
- 38 篇 基础医学(可授医学...
39 篇 法学
- 37 篇 社会学
21 篇 农学
11 篇 教育学
7 篇 经济学
6 篇 军事学

主题

3,323 篇 computer vision
1,174 篇 pattern recognit...
901 篇 cameras
873 篇 conferences
761 篇 computer science
660 篇 image segmentati...
624 篇 layout
573 篇 training
540 篇 shape
507 篇 robustness
451 篇 humans
423 篇 feature extracti...
422 篇 face recognition
402 篇 computational mo...
389 篇 object detection
345 篇 visualization
340 篇 computer archite...
335 篇 application soft...
291 篇 lighting
266 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 tsinghua univers...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 department of co...
16 篇 computer vision ...
16 篇 school of comput...
15 篇 institute of inf...
15 篇 school of comput...
14 篇 department of in...
14 篇 department of co...

作者

57 篇 timofte radu
25 篇 s.k. nayar
24 篇 van gool luc
24 篇 huang thomas s.
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 t.s. huang
19 篇 luc van gool
18 篇 xiaoou tang
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 m. shah
15 篇 a. zisserman
15 篇 heung-yeung shum

语言

8,459 篇 英文
9 篇 其他
7 篇 中文
1 篇 土耳其文

检索条件"任意字段=1966 IEEE Computer Society Conference on Computer Vision and Pattern Recognition"

共 8476 条记录，以下是111-120 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Scene Graph Driven Text-Prompt Generation for Image Inpainting

Scene Graph Driven Text-Prompt Generation for Image Inpainti...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Shukla, Tripti Maheshwari, Paridhi Singh, Rajhans Shukla, Ankita Kulkarni, Kuldeep Turaga, Pavan Adobe Res India San Jose CA 95110 USA Stanford Univ Stanford CA USA Arizona State Univ Tempe AZ USA

ISBN: (纸本)9798350302493

Scene editing methods are undergoing a revolution, driven by text-to-image synthesis methods. Applications in media content generation have benefited from a careful set of engineered text prompts, that have been arrived at by the artists by trial and error. There is a growing need to better model prompt generation, for it to be useful for a broad range of consumer-grade applications. We propose a novel method for text prompt generation for the explicit purpose of consumer-grade image inpainting, i.e. insertion of new objects into missing regions in an image. Our approach leverages existing inter-object relationships to generate plausible textual descriptions for the missing object, that can then be used with any text-to-image generator. Given an image and a location where a new object is to be inserted, our approach first converts the given image to an intermediate scene graph. Then, we use graph convolutional networks to 'expand' the scene graph by predicting the identity and relationships of the new object to be inserted, with respect to the existing objects in the scene. The output of the expanded scene graph is cast into a textual description, which is then processed by a text-to-image generator, conditioned on the given image, to produce the final inpainted image. We conduct extensive experiments on the Visual Genome dataset, and show through qualitative and quantitative metrics that our method is superior to other methods.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Training Strategies for vision Transformers for Object Detection

Training Strategies for Vision Transformers for Object Detec...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Singh, Apoorv Motional Boston MA 02210 USA

ISBN: (纸本)9798350302493

vision-based Transformer have shown huge application in the perception module of autonomous driving in terms of predicting accurate 3D bounding boxes, owing to their strong capability in modeling long-range dependencies between the visual features. However Transformers, initially designed for language models, have mostly focused on the performance accuracy, and not so much on the inference-time budget. For a safety critical system like autonomous driving, real-time inference at the on-board compute is an absolute necessity. This keeps our object detection algorithm under a very tight run-time budget. In this paper, we evaluated a variety of strategies to optimize on the inference-time of vision transformers based object detection methods keeping a close-watch on any performance variations. Our chosen metric for these strategies is accuracy-runtime joint optimization. Moreover, for actual inference-time analysis we profile our strategies with float32 and float16 precision with TensorRT module. This is the most common format used by the industry for deployment of their Machine Learning networks on the edge devices. We showed that our strategies are able to improve inference-time by 63% at the cost of performance drop of mere 3% for our problem-statement defined in Sec. 3. These strategies brings down vision Transformers detectors [3, 15, 18, 19, 36] inference-time even less than traditional single-image based CNN detectors like FCOS [17, 25, 33]. We recommend practitioners use these techniques to deploy Transformers based hefty multi-view networks on a budge-constrained robotic platform.

关键词： Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

Spectral Transfer Guided Active Domain Adaptation For Thermal Imagery

Spectral Transfer Guided Active Domain Adaptation For Therma...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ustun, Berkcan Kaya, Ahmet Kagan Ayerden, Ezgi Cakir Altinel, Fazil Aselsan Inc Res Ctr Yenimahalle Turkiye Middle East Tech Univ Dept Elect & Elect Engn Ankara Turkiye

ISBN: (纸本)9798350302493

The exploitation of visible spectrum datasets has led deep networks to show remarkable success. However, real-world tasks include low-lighting conditions which arise performance bottlenecks for models trained on large-scale RGB image datasets. Thermal IR cameras are more robust against such conditions. Therefore, the usage of thermal imagery in real-world applications can be useful. Unsupervised domain adaptation (UDA) allows transferring information from a source domain to a fully unlabeled target domain. Despite substantial improvements in UDA, the performance gap between UDA and its supervised learning counterpart remains significant. By picking a small number of target samples to annotate and using them in training, active domain adaptation tries to mitigate this gap with minimum annotation expense. We propose an active domain adaptation method in order to examine the efficiency of combining the visible spectrum and thermal imagery modalities. When the domain gap is considerably large as in the visible-to-thermal task, we may conclude that the methods without explicit domain alignment cannot achieve their full potential. To this end, we propose a spectral transfer guided active domain adaptation method to select the most informative unlabeled target samples while aligning source and target domains. We used the large-scale visible spectrum dataset MS-COCO as the source domain and the thermal dataset FLIR ADAS as the target domain to present the results of our method. Extensive experimental evaluation demonstrates that our proposed method outperforms the state-of-the-art active domain adaptation methods. The code and models are publicly available.(1)

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 2024 ieee/CVF conference on computer vision and pattern recognition, CVPR 2024

Proceedings - 2024 IEEE/CVF Conference on Computer Vision an...

引用

2024 ieee/CVF conference on computer vision and pattern recognition, CVPR 2024

ISBN: (纸本)9798350353006

The proceedings contain 2715 papers. The topics discussed include: revisiting adversarial training at scale;SPIDeRS: structured polarization for invisible depth and reflectance sensing;MA-LMM: memory-augmented large multimodal model for long-term video understanding;geometrically-driven aggregation for zero-shot 3D point cloud understanding;TextCraftor: your text encoder can be image quality controller;ViLa-MIL: dual-scale vision-language multiple instance learning for whole slide image classification;HumanNorm: learning normal diffusion model for high-quality and realistic 3D human generation;AnEmpirical study of scaling law for scene text recognition;improving image restoration through removing degradations in textual representations;and steganographic passport: an owner and user verifiable credential for deep model ip protection without retraining.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric Perturbations

Deep Prototypical-Parts Ease Morphological Kidney Stone Iden...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Flores-Araiza, Daniel Lopez-Tiro, Francisco El-Beze, Jonathan Hubert, Jacques Gonzalez, Miguel Ruiz, Gilberto Ochoa Daul, Christian Tecnol Monterrey Sch Engn Mexico City DF Mexico CHU Nancy Serv Urol Brabois Nancy France Univ Lorraine CRAN UMR 7039 Nancy France

ISBN: (纸本)9798350302493

Identifying the type of kidney stones can allow urologists to determine their cause of formation, improving the prescription of appropriate treatments to diminish future relapses. Currently, the associated ex-vivo diagnosis (known as Morpho-constitutional Analysis, MCA) is time-consuming, expensive and requires a great deal of experience, as it requires a visual analysis component that is highly operator dependant. Recently, machine learning methods have been developed for in-vivo endoscopic stone recognition. Deep Learning (DL) based methods outperform non-DL methods in terms of accuracy but lack explainability. Despite this trade-off, when it comes to making high-stakes decisions, its important to prioritize understandable computer-Aided Diagnosis (CADx) that suggests a course of action based on reasonable evidence, rather than a model prescribing a course of action. In this proposal, we learn Prototypical Parts (PPs) per kidney stone subtype, which are used by the DL model to generate an output classification. Using PPs in the classification task enables case-based reasoning explanations for such output, thus making the model interpretable. In addition, we modify global visual characteristics to describe their relevance to the PPs and the sensitivity of our models performance. With this, we provide explanations with additional information at the sample, class and model levels in contrast to previous works. Although our implementations average accuracy is lower than state-of-the-art (SOTA) non-interpretable DL models by 1.5%, our models perform 2.8% better on perturbed images with a lower standard deviation, without adversarial training. Thus, Learning PPs has the potential to create more robust DL models. Code at: https://***/DanielF29/Prototipical_Parts

关键词： computer aided diagnosis

来源：评论

学校读者我要写书评

暂无评论

Proceedings - 2024 ieee/CVF conference on computer vision and pattern recognition Workshops, CVPRW 2024

Proceedings - 2024 IEEE/CVF Conference on Computer Vision an...

引用

2024 ieee/CVF conference on computer vision and pattern recognition Workshops, CVPRW 2024

ISBN: (纸本)9798350365474

The proceedings contain 802 papers. The topics discussed include: X-VARS: introducing explainability in football refereeing with multi-modal large language models;a hybrid ANN-SNN architecture for low-power and low-latency visual perception;pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions;towards efficient audio-visual learners via empowering pre-trained vision transformers with cross-modal adaptation;a dual-mode approach for vision-based navigation in a lunar landing scenario;class similarity transition: decoupling class similarities and imbalance from generalized few-shot segmentation;ReweightOOD: loss reweighting for distance-based OOD detection;Hinge-Wasserstein: estimating multimodal aleatoric uncertainty in regression tasks;and ConPro: learning severity representation for medical images using contrastive learning and preference optimization.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Masked Jigsaw Puzzle: A Versatile Position Embedding for vision Transformers

Masked Jigsaw Puzzle: A Versatile Position Embedding for Vis...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ren, Bin Liu, Yahui Song, Yue Bi, Wei Cucchiara, Rita Sebe, Nicu Wang, Wei Univ Pisa Pisa Italy Univ Trento Trento TN Italy Tencent AI Lab Shenzhen Peoples R China Beijing Jiaotong Univ Beijing Peoples R China Univ Modena & Reggio Emilia Modena Italy

ISBN: (纸本)9798350301298

Position Embeddings (PEs), an arguably indispensable component in vision Transformers (ViTs), have been shown to improve the performance of ViTs on many vision tasks. However, PEs have a potentially high risk of privacy leakage since the spatial information of the input patches is exposed. This caveat naturally raises a series of interesting questions about the impact of PEs on accuracy, privacy, prediction consistency, etc. To tackle these issues, we propose a Masked Jigsaw Puzzle (MJP) position embedding method. In particular, MJP first shuffles the selected patches via our block-wise random jigsaw puzzle shuffle algorithm, and their corresponding PEs are occluded. Meanwhile, for the non-occluded patches, the PEs remain the original ones but their spatial relation is strengthened via our dense absolute localization regressor. The experimental results reveal that 1) PEs explicitly encode the 2D spatial relationship and lead to severe privacy leakage problems under gradient inversion attack;2) Training ViTs with the naively shuffled patches can alleviate the problem, but it harms the accuracy;3) Under a certain shuffle ratio, the proposed MJP not only boosts the performance and robustness on large-scale datasets (i.e., ImageNet-1K and ImageNet-C, -A/O) but also improves the privacy preservation ability under typical gradient attacks by a large margin. The source code and trained models are available at https://***/yhlleo/ MJP.

关键词： Deep learning architectures and techniques

来源：评论

学校读者我要写书评

暂无评论

A case for using rotation invariant features in state of the art feature matchers

A case for using rotation invariant features in state of the...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Bokman, Georg Kahl, Fredrik Chalmers Univ Technol Gothenburg Sweden

ISBN: (纸本)9781665487399

The aim of this paper is to demonstrate that a state of the art feature matcher (LoFTR) can be made more robust to rotations by simply replacing the backbone CNN with a steerable CNN which is equivariant to translations and image rotations. It is experimentally shown that this boost is obtained without reducing performance on ordinary illumination and viewpoint matching sequences.

关键词： computer vision conferences Lighting pattern matching

来源：评论

学校读者我要写书评

暂无评论

Bridging the Gap Between Automated and Human Facial Emotion Perception

Bridging the Gap Between Automated and Human Facial Emotion ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Stratton, Derek Hand, Emily Univ Nevada Reno Reno NV 89557 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Understanding the complex relationship between emotions and facial expressions is important for both psychologists and computer scientists. A large body of research in psychology investigates facial expressions, emotions, and how emotions are perceived from facial expressions. As computer scientists look to incorporate this research into automatic emotion perception systems, it is important to understand the nature and limitations of human emotion perception. These principles of emotion science affect the way datasets are created, methods are implemented, and results are interpreted in automated emotion perception. This paper aims to distill and align prior work in automated and human facial emotion perception to facilitate future discussions and research at the intersection of the two disciplines.

关键词： Emotion recognition computer vision Uncertainty conferences Sociology Psychology Natural language processing

来源：评论

学校读者我要写书评

暂无评论

Importance is in your attention: agent importance prediction for autonomous driving

Importance is in your attention: agent importance prediction...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hazard, Christopher Bhagat, Akshay Buddharaju, Balarama Raju Liu, Zhongtao Shao, Yunming Lu, Lu Omari, Sammy Cui, Henggang Motional Boston MA 02210 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Trajectory prediction is an important task in autonomous driving. State-of-the-art trajectory prediction models often use attention mechanisms to model the interaction between agents. In this paper, we show that the attention information from such models can also be used to measure the importance of each agent with respect to the ego vehicle's future planned trajectory. Our experiment results on the nuPlans dataset show that our method can effectively find and rank surrounding agents by their impact on the ego's plan.

关键词： computer vision conferences Predictive models Trajectory pattern recognition Task analysis Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 8 9 10 11 12 13 14 15 16 17 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：