检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

11,745 篇 会议
8 篇 期刊文献

馆藏范围

11,753 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

8,139 篇 工学
- 7,674 篇 计算机科学与技术...
- 804 篇 机械工程
- 580 篇 软件工程
- 376 篇 电气工程
- 252 篇 控制科学与工程
- 208 篇 光学工程
- 85 篇 生物工程
- 83 篇 信息与通信工程
- 29 篇 生物医学工程（可授...
- 23 篇 电子科学与技术（可...
- 21 篇 化学工程与技术
- 15 篇 交通运输工程
- 14 篇 安全科学与工程
- 10 篇 网络空间安全
- 8 篇 仪器科学与技术
- 6 篇 材料科学与工程（可...
- 6 篇 动力工程及工程热...
3,194 篇 医学
- 3,190 篇 临床医学
- 11 篇 基础医学(可授医学...
- 7 篇 公共卫生与预防医...
481 篇 理学
- 216 篇 物理学
- 203 篇 系统科学
- 88 篇 生物学
- 55 篇 数学
- 29 篇 统计学（可授理学、...
- 24 篇 化学
55 篇 管理学
- 29 篇 图书情报与档案管...
- 28 篇 管理科学与工程(可...
- 12 篇 工商管理
17 篇 法学
- 15 篇 社会学
6 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学
1 篇 艺术学

主题

5,434 篇 computer vision
2,516 篇 training
2,087 篇 pattern recognit...
1,621 篇 computational mo...
1,435 篇 visualization
1,306 篇 three-dimensiona...
1,060 篇 semantics
981 篇 codes
968 篇 benchmark testin...
898 篇 computer archite...
884 篇 deep learning
762 篇 task analysis
681 篇 feature extracti...
536 篇 face recognition
527 篇 conferences
515 篇 transformers
515 篇 neural networks
479 篇 object detection
466 篇 image segmentati...
454 篇 cameras

机构

168 篇 univ sci & techn...
144 篇 univ chinese aca...
144 篇 tsinghua univ pe...
143 篇 carnegie mellon ...
135 篇 chinese univ hon...
112 篇 peng cheng lab p...
108 篇 zhejiang univ pe...
97 篇 swiss fed inst t...
92 篇 tsinghua univers...
92 篇 sensetime res pe...
88 篇 shanghai ai lab ...
85 篇 zhejiang univers...
84 篇 shanghai jiao to...
78 篇 peng cheng labor...
77 篇 university of sc...
77 篇 alibaba grp peop...
76 篇 univ hong kong p...
76 篇 tech univ munich...
76 篇 stanford univ st...
73 篇 university of ch...

作者

76 篇 timofte radu
64 篇 van gool luc
50 篇 zhang lei
44 篇 yang yi
40 篇 loy chen change
34 篇 tao dacheng
32 篇 liu yang
32 篇 chen chen
30 篇 zhou jie
30 篇 tian qi
30 篇 sun jian
28 篇 zha zheng-jun
27 篇 qi tian
26 篇 li xin
26 篇 vasconcelos nuno
26 篇 ying shan
25 篇 liu xiaoming
25 篇 luc van gool
25 篇 boxin shi
24 篇 zheng wei-shi

语言

11,746 篇 英文
7 篇 其他

检索条件"任意字段=2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023"

共 11753 条记录，以下是131-140 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Topology-Guided Multi-Class Cell Context Generation for Digital Pathology

Topology-Guided Multi-Class Cell Context Generation for Digi...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Ahousamra, Shahira Gupta, Rajarsi Kurc, Tahsin Samaras, Dimitris Saltz, Joel Chen, Chao SUNY Stony Brook Dept Comp Sci Stony Brook NY 11790 USA SUNY Stony Brook Dept Biomed Informat Stony Brook NY USA

ISBN: (纸本)9798350301298

In digital pathology, the spatial context of cells is important for cell classification, cancer diagnosis and prognosis. To model such complex cell context, however, is challenging. Cells form different mixtures, lineages, clusters and holes. To model such structural patterns in a learnable fashion, we introduce several mathematical tools from spatial statistics and topological data analysis. We incorporate such structural descriptors into a deep generative model as both conditional inputs and a differentiable loss. This way, we are able to generate high quality multi-class cell layouts for the first time. We show that the topology-rich cell layouts can be used for data augmentation and improve the performance of downstream tasks such as cell classification.

关键词： cell microscopy Medical and biological vision

来源：评论

学校读者我要写书评

暂无评论

Data-driven Feature Tracking for Event Cameras

Data-driven Feature Tracking for Event Cameras

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Messikommer, Nico Fang, Carter Gehrig, Mathias Scaramuzza, Davide Univ Zurich Robot & Percept Grp Zurich Switzerland

ISBN: (纸本)9798350301298

Because of their high temporal resolution, increased resilience to motion blur, and very sparse output, event cameras have been shown to be ideal for low-latency and low-bandwidth feature tracking, even in challenging scenarios. Existing feature tracking methods for event cameras are either handcrafted or derived from first principles but require extensive parameter tuning, are sensitive to noise, and do not generalize to different scenarios due to unmodeled effects. To tackle these deficiencies, we introduce the first data-driven feature tracker for event cameras, which leverages low-latency events to track features detected in a grayscale frame. We achieve robust performance via a novel frame attention module, which shares information across feature tracks. By directly transferring zero-shot from synthetic to real data, our data-driven tracker outperforms existing approaches in relative feature age by up to 120% while also achieving the lowest latency. This performance gap is further increased to 130% by adapting our tracker to real data with a novel self-supervision strategy.

关键词： Low-level vision

来源：评论

学校读者我要写书评

暂无评论

Balanced Energy Regularization Loss for Out-of-distribution Detection

Balanced Energy Regularization Loss for Out-of-distribution ...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Choi, Hyunjun Jeong, Hawook Choi, Jin Young Seoul Natl Univ ASRI ECE Seoul South Korea RideFlux Inc Jeju South Korea

ISBN: (纸本)9798350301298

In the field of out-of-distribution (OOD) detection, a previous method that use auxiliary data as OOD data has shown promising performance. However, the method provides an equal loss to all auxiliary data to differentiate them from inliers. However, based on our observation, in various tasks, there is a general imbalance in the distribution of the auxiliary OOD data across classes. We propose a balanced energy regularization loss that is simple but generally effective for a variety of tasks. Our balanced energy regularization loss utilizes class-wise different prior probabilities for auxiliary data to address the class imbalance in OOD data. The main concept is to regularize auxiliary samples from majority classes, more heavily than those from minority classes. Our approach performs better for OOD detection in semantic segmentation, long-tailed image classification, and image classification than the prior energy regularization loss. Furthermore, our approach achieves state-of-the-art performance in two tasks: OOD detection in semantic segmentation and long-tailed image classification.

关键词： vision applications and systems

来源：评论

学校读者我要写书评

暂无评论

GRES: Generalized Referring Expression Segmentation

GRES: Generalized Referring Expression Segmentation

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Liu, Chang Ding, Henghui Jiang, Xudong Nanyang Technol Univ Singapore Singapore

ISBN: (纸本)9798350301298

Referring Expression Segmentation (RES) aims to generate a segmentation mask for the object described by a given language expression. Existing classic RES datasets and methods commonly support single-target expressions only, i.e., one expression refers to one target object. Multi-target and no-target expressions are not considered. This limits the usage of RES in practice. In this paper, we introduce a new benchmark called Generalized Referring Expression Segmentation (GRES), which extends the classic RES to allow expressions to refer to an arbitrary number of target objects. Towards this, we construct the first largescale GRES dataset called gRefCOCO that contains multitarget, no-target, and single-target expressions. GRES and gRefCOCO are designed to be well-compatible with RES, facilitating extensive experiments to study the performance gap of the existing RES methods on the GRES task. In the experimental study, we find that one of the big challenges of GRES is complex relationship modeling. Based on this, we propose a region-based GRES baseline ReLA that adaptively divides the image into regions with subinstance clues, and explicitly models the region-region and region-language dependencies. The proposed approach ReLA achieves new state-of-the-art performance on the both newly proposed GRES and classic RES tasks. The proposed gRefCOCO dataset and method are available at https://***/GRES.

关键词： language reasoning vision

来源：评论

学校读者我要写书评

暂无评论

Two-way Multi-Label Loss

Two-way Multi-Label Loss

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Kobayashi, Takumi Natl Inst Adv Ind Sci & Technol Tokyo Japan Univ Tsukuba Tsukuba Japan

ISBN: (纸本)9798350301298

A natural image frequently contains multiple classification targets, accordingly providing multiple class labels rather than a single label per image. While the single-label classification is effectively addressed by applying a softmax cross-entropy loss, the multi-label task is tackled mainly in a binary cross-entropy (BCE) framework. In contrast to the softmax loss, the BCE loss involves issues regarding imbalance as multiple classes are decomposed into a bunch of binary classifications;recent works improve the BCE loss to cope with the issue by means of weighting. In this paper, we propose a multi-label loss by bridging a gap between the softmax loss and the multi-label scenario. The proposed loss function is formulated on the basis of relative comparison among classes which also enables us to further improve discriminative power of features by enhancing classification margin. The loss function is so flexible as to be applicable to a multi-label setting in two ways for discriminating classes as well as samples. In the experiments on multi-label classification, the proposed method exhibits competitive performance to the other multi-label losses, and it also provides transferrable features on single-label ImageNet training. Codes are available at https: //***/tk1980/TwowayMultiLabelLoss.

关键词： detection recognition: Categorization retrieval

来源：评论

学校读者我要写书评

暂无评论

Iterative vision-and-Language Navigation

Iterative Vision-and-Language Navigation

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Krantz, Jacob Banerjee, Shurjo Zhu, Wang Corso, Jason Anderson, Peter Lee, Stefan Thomason, Jesse Oregon State Univ Corvallis OR 97331 USA Univ Michigan Ann Arbor MI USA Univ Southern Calif Los Angeles CA USA Google Res Sunnyvale CA USA

ISBN: (纸本)9798350301298

We present Iterative vision-and-Language Navigation (IVLN), a paradigm for evaluating language-guided agents navigating in a persistent environment over time. Existing vision-and-Language Navigation (VLN) benchmarks erase the agent's memory at the beginning of every episode, testing the ability to perform cold-start navigation with no prior information. However, deployed robots occupy the same environment for long periods of time. The IVLN paradigm addresses this disparity by training and evaluating VLN agents that maintain memory across tours of scenes that consist of up to 100 ordered instruction-following Room-to-Room (R2R) episodes, each defined by an individual language instruction and a target path. We present discrete and continuous Iterative Room-to-Room (IR2R) benchmarks comprising about 400 tours each in 80 indoor scenes. We find that extending the implicit memory of high-performing transformer VLN agents is not sufficient for IVLN, but agents that build maps can benefit from environment persistence, motivating a renewed focus on map-building agents in VLN.

关键词： language reasoning vision

来源：评论

学校读者我要写书评

暂无评论

Disentangled Representation Learning for Unsupervised Neural Quantization

Disentangled Representation Learning for Unsupervised Neural...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Noh, Haechan Hyun, Sangeek Jeong, Woojin Lim, Hanshin Heo, Jae-Pil Sungkyunkwan Univ Seoul South Korea Elect & Telecommun Res Inst Daejeon South Korea

ISBN: (纸本)9798350301298

The inverted index is a widely used data structure to avoid the infeasible exhaustive search. It accelerates retrieval significantly by splitting the database into multiple disjoint sets and restricts distance computation to a small fraction of the database. Moreover, it even improves search quality by allowing quantizers to exploit the compact distribution of residual vector space. However, we firstly point out a problem that an existing deep learning-based quantizer hardly benefits from the residual vector space, unlike conventional shallow quantizers. To cope with this problem, we introduce a novel disentangled representation learning for unsupervised neural quantization. Similar to the concept of residual vector space, the proposed method enables more compact latent space by disentangling information of the inverted index from the vectors. Experimental results on large-scale datasets confirm that our method outperforms the state-of-the-art retrieval systems by a large margin.

关键词： detection recognition: Categorization retrieval

来源：评论

学校读者我要写书评

暂无评论

Rethinking Feature-based Knowledge Distillation for Face recognition

Rethinking Feature-based Knowledge Distillation for Face Rec...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Li, Jingzhi Guo, Zidong Li, Hui Han, Seungju Baek, Ji-won Yang, Min Yang, Ran Suh, Sungjoo Samsung R&D Inst China Xian SRCX Xian Peoples R China Samsung Adv Inst Technol SAIT Suwon South Korea

ISBN: (纸本)9798350301298

With the continual expansion of face datasets, feature-based distillation prevails for large-scale face recognition. In this work, we attempt to remove identity supervision in student training, to spare the GPU memory from saving massive class centers. However, this naive removal leads to inferior distillation result. We carefully inspect the performance degradation from the perspective of intrinsic dimension, and argue that the gap in intrinsic dimension, namely the intrinsic gap, is intimately connected to the infamous capacity gap problem. By constraining the teacher's search space with reverse distillation, we narrow the intrinsic gap and unleash the potential of feature-only distillation. Remarkably, the proposed reverse distillation creates universally student-friendly teacher that demonstrates outstanding student improvement. We further enhance its effectiveness by designing a student proxy to better bridge the intrinsic gap. As a result, the proposed method surpasses state-of-the-art distillation techniques with identity supervision on various face recognition benchmarks, and the improvements are consistent across different teacher-student pairs.

关键词： body gesture Humans: Face movement pose

来源：评论

学校读者我要写书评

暂无评论

1% VS 100%: Parameter-Efficient Low Rank Adapter for Dense Predictions

1% VS 100%: Parameter-Efficient Low Rank Adapter for Dense P...

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Yin, Dongshuo Yang, Yiran Wang, Zhechao Yu, Hongfeng Wei, Kaiwen Sun, Xian Chinese Acad Sci Aerosp Informat Res Inst Key Lab Network Informat Syst Technol Beijing Peoples R China Univ Chinese Acad Sci Sch Elect Elect & Commun Engn Beijing Peoples R China

ISBN: (纸本)9798350301298

Fine-tuning large-scale pre-trained vision models to downstream tasks is a standard technique for achieving state-of-the-art performance on computer vision benchmarks. However, fine-tuning the whole model with millions of parameters is inefficient as it requires storing a same-sized new model copy for each task. In this work, we propose LoRand, a method for fine-tuning large-scale vision models with a better trade-off between task performance and the number of trainable parameters. LoRand generates tiny adapter structures with low-rank synthesis while keeping the original backbone parameters fixed, resulting in high parameter sharing. To demonstrate LoRand's effectiveness, we implement extensive experiments on object detection, semantic segmentation, and instance segmentation tasks. By only training a small percentage (1% to 3%) of the pre-trained backbone parameters, LoRand achieves comparable performance to standard fine-tuning on COCO and ADE20K and outperforms fine-tuning in low-resource PASCAL VOC dataset.

关键词： Efficient and scalable vision

来源：评论

学校读者我要写书评

暂无评论

NVTC: Nonlinear Vector Transform Coding

NVTC: Nonlinear Vector Transform Coding

引用

ieee/cvf conference on computer vision and pattern recognition (cvpr)

作者： Feng, Runsen Guo, Zongyu Li, Weiping Chen, Zhibo Univ Sci & Technol China Hefei Peoples R China

ISBN: (纸本)9798350301298

In theory, vector quantization (VQ) is always better than scalar quantization (SQ) in terms of rate-distortion (RD) performance [33]. Recent state-of-the-art methods for neural image compression are mainly based on nonlinear transform coding (NTC) with uniform scalar quantization, overlooking the benefits of VQ due to its exponentially increased complexity. In this paper, we first investigate on some toy sources, demonstrating that even if modern neural networks considerably enhance the compression performance of SQ with nonlinear transform, there is still an insurmountable chasm between SQ and VQ. Therefore, revolving around VQ, we propose a novel framework for neural image compression named Nonlinear Vector Transform Coding (NVTC). NVTC solves the critical complexity issue of VQ through (1) a multi-stage quantization strategy and (2) nonlinear vector transforms. In addition, we apply entropy-constrained VQ in latent space to adaptively determine the quantization boundaries for joint rate-distortion optimization, which improves the performance both theoretically and experimentally. Compared to previous NTC approaches, NVTC demonstrates superior rate-distortion performance, faster decoding speed, and smaller model size. Our code is available at https://***/USTC-IMCL/NVTC.

关键词： Low-level vision

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 10 11 12 13 14 15 16 17 18 19 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：