检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1381-1390 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Neural Fields for Co-Reconstructing 3D Objects from Incidental 2D Data

Neural Fields for Co-Reconstructing 3D Objects from Incident...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Dylan Campbell Eldar Insafutdinov João F. Henriques Andrea Vedaldi Australian National University University of Oxford

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We ask whether 3D objects can be reconstructed from real world data collected for some other purpose, such as autonomous driving or augmented reality, thus inferring objects only incidentally. 3D reconstruction from incidental data is a major challenge because, in addition to significant noise, only a few views of each object are observed, which are insufficient for reconstruction. We approach this problem as a co-reconstruction task, where multiple objects are reconstructed together, learning shape and appearance priors for regularization. In order to do so, we introduce a neural radiance field that is conditioned via an attention mechanism on the identity of the individual objects. We further disentangle shape from appearance and diffuse color from specular color via an asymmetric two-stream network, which factors shared information from instance-specific details. We demonstrate the ability of this method to reconstruct full 3D objects from partial, incidental observations in autonomous driving and other datasets.

关键词： computer vision Three-dimensional displays Shape Image color analysis conferences Noise Neural radiance field

来源：评论

学校读者我要写书评

暂无评论

Dynamic Addition of Noise in a Diffusion Model for Anomaly Detection

Dynamic Addition of Noise in a Diffusion Model for Anomaly D...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Justin Tebbe Jawad Tayyub Otto von Guericke University Magdeburg Magdeburg Germany Endress + Hauser Maulburg Germany

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Diffusion models have found valuable applications in anomaly detection by capturing the nominal data distribution and identifying anomalies via reconstruction. Despite their merits, they struggle to localize anomalies of varying scales, especially larger anomalies such as entire missing components. Addressing this, we present a novel framework that enhances the capability of diffusion models, by extending the previous introduced implicit conditioning approach [24] in three significant ways. First, we incorporate a dynamic step size computation that allows for variable noising steps in the forward process guided by an initial anomaly prediction. Second, we demonstrate that denoising an only scaled input, without any added noise, outperforms conventional denoising process. Third, we project images in a latent space to abstract away from fine details that interfere with reconstruction of large missing components. Additionally, we propose a fine-tuning mechanism that facilitates the model to effectively grasp the nuances of the target domain. Our method undergoes rigorous evaluation on prominent anomaly detection datasets VisA, BTAD and MVTec yielding strong performance. Importantly, our framework effectively localizes anomalies regardless of their scale, marking a pivotal advancement in diffusion-based anomaly detection.

关键词： Location awareness Technological innovation computer vision conferences Noise reduction Noise Diffusion models

来源：评论

学校读者我要写书评

暂无评论

Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding

Mushroom Segmentation and 3D Pose Estimation from Point Clou...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： George Retsinas Niki Efthymiou Petros Maragos School of Electrical and Computer Engineering National Technical University of Athens Greece

Modern agricultural applications rely more and more on deep learning solutions. However, training well-performing deep networks requires a large amount of annotated data that may not be available and in the case of 3D annotation may not even be feasible for human annotators. In this work, we develop a deep learning approach to segment mushrooms and estimate their pose on 3D data, in the form of point clouds acquired by depth sensors. To circumvent the annotation problem, we create a synthetic dataset of mushroom scenes, where we are fully aware of 3D information, such as the pose of each mushroom. The proposed network has a fully convolutional backbone, that parses sparse 3D data, and predicts pose information that implicitly defines both instance segmentation and pose estimation task. We have validated the effectiveness of the proposed implicit-based approach for a synthetic test set, as well as provided qualitative results for a small set of real acquired point clouds with depth sensors.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Event-driven Re-Id: A New Benchmark and Method Towards Privacy-Preserving Person Re-Identification

Event-driven Re-Id: A New Benchmark and Method Towards Priva...

引用

22nd ieee/CVF Winter conference on Applications of computer vision (WACV)

作者： Ahmad, Shafiq Scarpellini, Gianluca Morerio, Pietro Del Bue, Alessio Univ Genoa Genoa Italy Ist Italino Tecnol Pattern Anal & Comp Vis PAVIS Genoa Italy Ist Italiano Tecnol Visual Geometry & Modelling VGM Genoa Italy

ISBN: (纸本)9781665458245

The large-scale use of surveillance cameras in public spaces raised severe concerns about an individual privacy breach. Introducing privacy and security in video surveillance systems, primarily in person re-identification (re-id), is quite challenging. Event cameras are novel sensors, which only respond to brightness changes in the scene. This characteristic makes event-based vision sensors viable for privacy-preserving in video surveillance. Integrating privacy into the person re-id;this work investigates the possibility of performing person re-id with the event-camera network for the first time. We transform the asynchronous events stream generated by an event camera into synchronous image-like representations to leverage deep learning models and then evaluate how complex the re-id problem is with this new sensor modality. Interestingly, such event-based representations contain meaningful spatial details which are very similar to standard edges and contours. We use two different representations, image-like representation and their transformation to polar coordinates (which carry more distinct edge patterns). Finally, we train a person re-id model on such images to demonstrate the feasibility of performing event-driven re-id. We evaluate the performance of our approach and produce baseline results on two synthetic datasets (generated from publicly available datasets, SAIVT and DukeMTMC-reid).

关键词： Image edge detection Transforms vision sensors Streaming media Sensor phenomena and characterization Cameras Video surveillance

来源：评论

学校读者我要写书评

暂无评论

DSTCFuse: A Method based on Dual-cycled Cross-awareness of Structure Tensor for Semantic Segmentation via Infrared and Visible Image Fusion

DSTCFuse: A Method based on Dual-cycled Cross-awareness of S...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Xuan Li Rongfu Chen Jie Wang Lei Ma Li Cheng Haiwen Yuan School of Electrical and Information Engineering Wuhan Institute of Technology Hubei Key Laboratory of Optical Information and Pattern Recognition

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Multi-modality information fusion can compensate deficiencies of single modality and provide rich scene information for 2D semantic segmentation. However, the inconsistency in the feature space between different modalities may lead to poor presentation of objects and that would affect subsequent segmented effectiveness. The idea of modal transition can reduce the modal differences and avoid biased processing during the fusion process, but it is hard to perfectly retain the contents of the source images. To address these challenges, a fusion method based on dual-cycled cross-awareness of structure tensor is proposed. Firstly, we propose a dual-cycle modality transition network based on cross-awareness consistency to learn the differences in feature space from different modalities. Secondly, a set of globally structure-tensor preserving modules are designed to enhance the capabilities of the network in capturing complementary features and perceiving global modal consistency. Under the joint constraint of globally structure-tensor awareness loss and cross-awareness loss, our network achieves a robust mapping of feature space from visible to pseudo-infrared images without relying on Ground-Truth. Finally, the pseudo-infrared images that inherit the superior qualities of two modalities are fused with the original infrared images directly, which effectively reduces the complexity of fusion. Extensive comparative experiments show that our method outperforms state-of-the-art methods in qualitative and quantitative evaluation.

关键词： Degradation computer vision Tensors Semantic segmentation conferences Interference pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Online Multi-camera People Tracking with Spatial-temporal Mechanism and Anchor-feature Hierarchical Clustering

Online Multi-camera People Tracking with Spatial-temporal Me...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Riu Cherdchusakulchai Sasin Phimsiri Visarut Trairattanapa Suchat Tungjitnob Wasu Kudisthalert Pornprom Kiawjak Ek Thamwiwatthana Phawat Borisuitsawat Teepakorn Tosawadi Pakcheera Choppradit Kasisdis Mahakijdechachai Supawit Vatathanavaro Worawit Saetan Vasin Suttichaya AI and Robotics Ventures (ARV) Thailand

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Multi-camera Multi-object tracking (MTMC) surpasses conventional single-camera tracking by enabling seamless object tracking across multiple camera views. This capability is critical for security systems and improving situational awareness in various environments. This paper proposes a novel MTMC framework designed for online operation. The framework employs a three-stage pipeline: Multiobject Tracking (MOT), Multi-target Multi-camera Tracking (MTMC), and Cross Interval Synchronization (CIS). In the MOT stage, ReID features are extracted and localized tracklets are created. MTMC links these tracklets across cameras using spatial-temporal constraints and constraint hierarchical clustering with anchor features for improved inter-camera association. Finally, CIS ensures the temporal coherence of tracklets across time intervals. The proposed framework achieves robust tracking performance, validated on the challenging 2024 AI City Challenge with a HOTA score of 51.0556%, ranking sixth. The code is available at: https://***/ARV-MLCORE/AIC2024Track1ARV

关键词： Urban areas Pipelines Cameras Feature extraction Real-time systems pattern recognition Synchronization

来源：评论

学校读者我要写书评

暂无评论

MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incremental Learning

MetaFSCIL: A Meta-Learning Approach for Few-Shot Class Incre...

引用

2022 ieee/CVF conference on computer vision and pattern recognition, CVPR 2022

作者： Chi, Zhixiang Gu, Li Liu, Huan Wang, Yang Yu, Yuanhao Tang, Jin Noah's Ark Lab Huawei Technologies McMaster University Canada University of Manitoba Canada

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

In this paper, we tackle the problem of few-shot class incremental learning (FSCIL). FSCIL aims to incrementally learn new classes with only a few samples in each class. Most existing methods only consider the incremental steps at test time. The learning objective of these methods is often hand-engineered and is not directly tied to the objective (i.e. incrementally learning new classes) during testing. Those methods are sub-optimal due to the misalignment between the training objectives and what the methods are expected to do during evaluation. In this work, we proposed a bi-level optimization based on meta-learning to directly optimize the network to learn how to incrementally learn in the setting of FSCIL. Concretely, we propose to sample sequences of incremental tasks from base classes for training to simulate the evaluation protocol. For each task, the model is learned using a meta-objective such that it is capable to perform fast adaptation without forgetting. Furthermore, we propose a bi-directional guided modulation, which is learned to automatically modulate the activations to reduce catastrophic forgetting. Extensive experimental results demonstrate that the proposed method outperforms the baseline and achieves the state-of-the-art results on CIFARIOO, MiniImageNet, and CUB200 datasets. © 2022 ieee.

关键词： Training Adaptation models Protocols Modulation Bidirectional control Power capacitors pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multispectral Contrastive Learning with Viewmaker Networks

Multispectral Contrastive Learning with Viewmaker Networks

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jasmine Bayrooti Noah Goodman Alex Tamkin Department of Computer Science Stanford University Stanford CA USA

Contrastive learning methods have been applied to a range of domains and modalities by training models to identify similar "views" of data points. However, specialized scientific modalities pose a challenge for this paradigm, as identifying good views for each scientific instrument is complex and time-intensive. In this paper, we focus on applying contrastive learning approaches to a variety of remote sensing datasets. We show that Viewmaker networks, a recently proposed method for generating views without extensive domain knowledge, can produce useful views in this setting. We also present a Viewmaker variant called Divmaker, which achieves similar performance and does not require adversarial optimization. Applying both methods to four multispectral imaging problems, each with a different format, we find that Viewmaker and Divmaker can outperform cropping- and reflection-based methods for contrastive learning in every case when evaluated on downstream classification tasks. This provides additional evidence that domain-agnostic methods can empower contrastive learning to scale to real-world scientific domains. Open source code can be found at https://***/jbayrooti/divmaker.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Vim4Path: Self-Supervised vision Mamba for Histopathology Images

Vim4Path: Self-Supervised Vision Mamba for Histopathology Im...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Ali Nasiri-Sarvi Vincent Quoc-Huy Trinh Hassan Rivaz Mahdi S. Hosseini Department of Computer Science and Software Engineering (CSSE) Concordia University Canada Institute for Research in Immunology and Cancer of the University of Montreal Canada Department of Electrical and Computer Engineering (ECE) Concordia University Canada

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Representation learning from Gigapixel Whole Slide Images (WSI) poses a significant challenge in computational pathology due to the complicated nature of tissue structures and the scarcity of labeled data. Multi-instance learning methods have addressed this challenge, leveraging image patches to classify slides utilizing pretrained models using Self-Supervised Learning (SSL) approaches. The performance of both SSL and MIL methods relies on the architecture of the feature encoder. This paper proposes leveraging the vision Mamba (Vim) architecture, inspired by state space models, within the DINO framework for representation learning in computational pathology. We evaluate the performance of Vim against vision Transformers (ViT) on the Camelyon16 dataset for both patch-level and slide-level classification. Our findings highlight Vim’s enhanced performance compared to ViT, particularly at smaller scales, where Vim achieves an 8.21 increase in ROC AUC for models of similar size. An explainability analysis further highlights Vim’s capabilities, which reveals that Vim uniquely emulates the pathologist workflow—unlike ViT. This alignment with human expert analysis highlights Vim’s potential in practical diagnostic settings and contributes significantly to developing effective representation-learning algorithms in computational pathology. We release the codes and pretrained weights at https://***/AtlasAnalyticsLab/Vim4Path.

关键词： Representation learning Training computer vision Adaptation models Computational modeling Self-supervised learning computer architecture

来源：评论

学校读者我要写书评

暂无评论

Enhancing the Transferability of Adversarial Attacks with Stealth Preservation

Enhancing the Transferability of Adversarial Attacks with St...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Xinwei Zhang Tianyuan Zhang Yitong Zhang Shuangcheng Liu School of Computer Science and Engineering Beihang University Beijing China State Key Lab of Software Development Environment Beihang University Beijing China Shen Yuan Honors College Beihang University Beijing China

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Deep neural networks are susceptible to attacks from adversarial examples in recent years. Especially, the black-box attacks cause a more serious threat to practical applications. However, while most existing black-box attacks have achieved a high success rate in deceiving models, they have not focused on the stealthiness of adversarial examples, often exhibiting suspicious visual appearances. To address this issue, this paper proposes the Mask Momentum Iterative Attack (MMIA), which introduces a masking mechanism and adopts an optimal perturbation strategy to identify regions of an image most vulnerable to attacks. This approach effectively ensures the transferability and stealthiness of adversarial examples. Simultaneously, by integrating image enhancement techniques and temporal and spatial momentum terms into the iterative process of the attack, we prevent the attack from getting stuck in local optima, further improving the transferability of adversarial examples. To enhance the success rate of black-box attacks, we apply MMIA to a model ensemble using a joint optimization strategy. We demonstrate that adversarially trained models with a strong defense ability are also susceptible to our black-box attacks. We conduct extensive experiments on classification tasks using common vision models, and our results significantly demonstrate the superiority of our method over state-of-the-art approaches when considering both transferability and stealthiness.

关键词： Visualization Fuses Computational modeling Perturbation methods Closed box pattern recognition Iterative methods

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 135 136 137 138 139 140 141 142 143 144 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：