检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是971-980 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

NTIRE 2024 Challenge on Stereo Image Super-Resolution: Methods and Results

NTIRE 2024 Challenge on Stereo Image Super-Resolution: Metho...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Longguang Wang Yulan Guo Juncheng Li Hongda Liu Yang Zhao Yingqian Wang Zhi Jin Shuhang Gu Radu Timofte

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper summarizes the 3rd NTIRE challenge on stereo image super-resolution (SR) with a focus on new solutions and results. The task of this challenge is to super-resolve a low-resolution stereo image pair to a high-resolution one with a magnification factor of × 4 under a limited computational budget. Compared with single image SR, the major challenge of this challenge lies in how to exploit additional information in another viewpoint and how to maintain stereo consistency in the results. This challenge has 2 tracks, including one track on bicubic degradation and one track on real degradations. In total, 108 and 70 participants were successfully registered for each track, respectively. In the test phase, 14 and 13 teams successfully submitted valid results with PSNR (RGB) scores better than the baseline. This challenge establishes a new benchmark for stereo image SR.

关键词： Degradation computer vision conferences Superresolution Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Spike timing-based unsupervised learning of orientation, disparity, and motion representations in a spiking neural network

Spike timing-based unsupervised learning of orientation, dis...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Barbier, Thomas Teuliere, Celine Triesch, Jochen Univ Clermont Auvergne Inst Pascal SIGMA Clermont CNRS F-63000 Clermont Ferrand France Frankfurt Inst Adv Studies Frankfurt Germany

ISBN: (纸本)9781665448994

Neuromorphic vision sensors present unique advantages over their frame based counterparts. However, unsupervised learning of efficient visual representations from their asynchronous output is still a challenge, requiring a rethinking of traditional image and video processing methods. Here we present a network of leaky integrate and fire neurons that learns representations similar to those of simple and complex cells in the primary visual cortex of mammals from the input of two event-based vision sensors. Through the combination of spike timing-dependent plasticity and homeostatic mechanisms, the network learns visual feature detectors for orientation, disparity, and motion in a fully unsupervised fashion. We validate our approach on a mobile robotic platform.

关键词： Visualization Neuromorphics conferences Neurons Detectors vision sensors Robot sensing systems

来源：评论

学校读者我要写书评

暂无评论

Towards Quantitative Evaluation Metrics for Image Editing Approaches

Towards Quantitative Evaluation Metrics for Image Editing Ap...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Dana Cohen Hochberg Oron Anschel Alon Shoshan Igor Kviatkovsky Manoj Aggarwal Gérard Medioni Amazon

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In the rapidly evolving field of Generative AI, this work takes initial steps towards establishing a systematic approach for comparing image editing methods. Currently, there is a lack of quantitative metrics for evaluating image editing tasks, with new methods being evaluated mostly qualitatively. Our methodology involves three key components: 1) The creation of a large synthetic dataset using GAN-Control, which enables the generation of ground-truth images for consistent edits across different facial identities; 2) A matching procedure that pairs the edited images with their corresponding ground-truth; and 3) Application of the Perceptual Distance metric to matched pairs. We assessed the effectiveness of our proposed framework through a user study and a set of simulation experiments. Our results indicate that our approach can rank image-editing methods in a way that aligns with human judgment. This research seeks to lay the foundation for a comprehensive evaluation framework for image editing techniques in subsequent studies, initiating a dialogue on this topic.

关键词： Measurement computer vision Systematics Generative AI conferences pattern recognition Synthetic data

来源：评论

学校读者我要写书评

暂无评论

Context-aware Video Anomaly Detection in Long-Term Datasets

Context-aware Video Anomaly Detection in Long-Term Datasets

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhengye Yang Richard J. Radke Department of ECSE Rensselaer Polytechnic Institute Troy NY USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Video anomaly detection research is generally evaluated on short, isolated benchmark videos only a few minutes long. However, in real-world environments, security cameras observe the same scene for months or years at a time, and the notion of anomalous behavior critically depends on context, such as the time of day, day of week, or schedule of events. Here, we propose a context-aware video anomaly detection algorithm, Trinity, specifically targeted to these scenarios. Trinity is especially well-suited to crowded scenes in which individuals cannot be easily tracked, and anomalies are due to speed, direction, or absence of group motion. Trinity is a contrastive learning framework that aims to learn alignments between context, appearance, and motion, and uses alignment quality to classify videos as normal or anomalous. We evaluate our algorithm on both conventional benchmarks and a public webcam-based dataset we collected that spans more than three months of activity.

关键词： Schedules computer vision Target tracking conferences Contrastive learning Benchmark testing pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-scale Attention-Based Inclination Angles Estimation for Panoramic Camera

Multi-scale Attention-Based Inclination Angles Estimation fo...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Yuhao Shan Heyu Chen Jiaying Zhang Shigang Li Jianfeng Li Southwest University Chongqing China Hiroshima City University Hiroshima Japan

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Images taken by panoramic cameras in the upright posture can give viewers a better sense and make the downstream panoramic image-based computer vision tasks easier. To estimate the inclination angles of panoramic camera, we proposed a simple but elegant panoramic image-based network, which combines the advantages of geometry-based and deep-learning-based methods. First, a backbone network with five down-sampling layers is designed to focus on the local distortion features. Then, since non-upright panoramic images have highly uniform geometric distortion for the same camera inclination angles, a multi-scale attention module is proposed for the first time, which can weigh each pixel on the feature maps of the backbone network and allows the network to focus on the global and shallow geometric features. Moreover, apart from angle loss, pixel-level image loss is introduced in our network for the inclination angles estimation task to allow the network to compensate for pixel deviations during training. The experiments show that our method overcomes other leading state-of-the-art methods in this field.

关键词： Training computer vision conferences Estimation Cameras Distortion pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Unveiling the Anomalies in an Ever-Changing World: A Benchmark for Pixel-Level Anomaly Detection in Continual Learning

Unveiling the Anomalies in an Ever-Changing World: A Benchma...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Nikola Bugarin Jovana Bugaric Manuel Barusco Davide Dalle Pezze Gian Antonio Susto University of Padova

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Anomaly Detection is a relevant problem in numerous real-world applications, especially when dealing with images. However, little attention has been paid to the issue of changes over time in the input data distribution, which may cause a significant decrease in performance. In this study, we investigate the problem of Pixel-Level Anomaly Detection in the Continual Learning setting, where new data arrives over time and the goal is to perform well on new and old data. We implement several state-of-the-art techniques to solve the Anomaly Detection problem in the classic setting and adapt them to work in the Continual Learning setting. To validate the approaches, we use a real-world dataset of images with pixel-based anomalies to provide a reliable benchmark and serve as a foundation for further advancements in the field. We provide a comprehensive analysis, discussing which Anomaly Detection methods and which families of approaches seem more suitable for the Continual Learning setting.

关键词： Continuing education computer vision conferences Benchmark testing pattern recognition Reliability Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

SPECTRE: Visual Speech-Informed Perceptual 3D Facial Expression Reconstruction from Videos

SPECTRE: Visual Speech-Informed Perceptual 3D Facial Express...

引用

2023 ieee/CVF conference on computer vision and pattern recognition workshops, CVPRW 2023

作者： Filntisis, Panagiotis P. Retsinas, George Paraperas-Papantoniou, Foivos Katsamanis, Athanasios Roussos, Anastasios Maragos, Petros Athena Research Center Institute of Robotics Maroussi15125 Greece National Technical University of Athens School of Electrical & Computer Engineering Greece Greece University of Exeter College of Engineering Mathematics and Physical Sciences United Kingdom Imperial College London United Kingdom Institute for Language and Speech Processing Athena R.C. Greece

ISBN: (纸本)9798350302493

The recent state of the art on monocular 3D face reconstruction from image data has made some impressive advancements, thanks to the advent of Deep Learning. However, it has mostly focused on input coming from a single RGB image, overlooking the following important factors: a) Nowadays, the vast majority of facial image data of interest do not originate from single images but rather from videos, which contain rich dynamic information. b) Furthermore, these videos typically capture individuals in some form of verbal communication (public talks, teleconferences, audiovisual human-computer interactions, interviews, monologues/dialogues in movies, etc). When existing 3D face reconstruction methods are applied in such videos, the artifacts in the reconstruction of the shape and motion of the mouth area are often severe, since they do not match well with the speech *** overcome the aforementioned limitations, we present the first method for visual speech-informed perceptual reconstruction of 3D mouth expressions. We do this by proposing a "lipreading"loss, which guides the fitting process so that the elicited perception from the 3D reconstructed talking head resembles that of the original video footage. We demonstrate that, interestingly, the lipreading loss is better suited for 3D reconstruction of mouth movements compared to traditional landmark losses, and even direct 3D supervision. Furthermore, the devised method does not rely on any text transcriptions or corresponding audio, rendering it ideal for training in unlabeled datasets. We verify the efficiency of our method through objective evaluations on three large-scale datasets, as well as subjective evaluation with two web-based user studies. Project webpage: https://***/spectre/ © 2023 ieee.

关键词： Human computer interaction

来源：评论

学校读者我要写书评

暂无评论

Benchmarking Zero-Shot recognition with vision-Language Models: Challenges on Granularity and Specificity

Benchmarking Zero-Shot Recognition with Vision-Language Mode...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhenlin Xu Yi Zhu Siqi Deng Abhay Mittal Yanbei Chen Manchen Wang Paolo Favaro Joseph Tighe Davide Modolo AWS AI Labs Boson AI Meta

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper presents novel benchmarks for evaluating vision-language models (VLMs) in zero-shot recognition, focusing on granularity and specificity. Although VLMs excel in tasks like image captioning, they face challenges in open-world settings. Our benchmarks test VLMs’ consistency in understanding concepts across semantic granularity levels and their response to varying text specificity. Findings show that VLMs favor moderately fine-grained concepts and struggle with specificity, often misjudging texts that differ from their training data. Extensive evaluations reveal limitations in current VLMs, particularly in distinguishing between correct and subtly incorrect descriptions. While fine-tuning offers some improvements, it doesn’t fully address these issues, highlighting the need for VLMs with enhanced generalization capabilities for real-world applications. This study provides insights into VLM limitations and suggests directions for developing more robust models.

关键词： computer vision Computational modeling Face recognition conferences Semantics Training data Focusing

来源：评论

学校读者我要写书评

暂无评论

PLM: Partial Label Masking for Imbalanced Multi-label Classification

PLM: Partial Label Masking for Imbalanced Multi-label Classi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Duarte, Kevin Rawat, Yogesh Shah, Mubarak Univ Cent Florida Ctr Res Comp Vis Orlando FL 32816 USA

ISBN: (纸本)9781665448994

Neural networks trained on real-world datasets with long-tailed label distributions are biased towards frequent classes and perform poorly on infrequent classes. The imbalance in the ratio of positive and negative samples for each class skews network output probabilities further from ground-truth distributions. We propose a method, Partial Label Masking (PLM), which utilizes this ratio during training. By stochastically masking labels during loss computation, the method balances this ratio for each class, leading to improved recall on minority classes and improved precision on frequent classes. The ratio is estimated adaptively based on the network's performance by minimizing the KL divergence between predicted and ground-truth distributions. Whereas most existing approaches addressing data imbalance are mainly focused on single-label classification and do not generalize well to the multi-label case, this work proposes a general approach to solve the long-tail data imbalance issue for multi-label classification. PLM is versatile: it can be applied to most objective functions and it can be used alongside other strategies for class imbalance. Our method achieves strong performance when compared to existing methods on both multi-label (MultiMNIST and MSCOCO) and single-label (imbalanced CIFAR-10 and CIFAR-100) image classification datasets.

关键词： Training computer vision conferences Neural networks Linear programming pattern recognition Classification algorithms

来源：评论

学校读者我要写书评

暂无评论

Self-Supervised Multi-Task Pretraining Improves Image Aesthetic Assessment

Self-Supervised Multi-Task Pretraining Improves Image Aesthe...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pfister, Jan Kobs, Konstantin Hotho, Andreas Univ Wurzburg Wurzburg Germany

ISBN: (纸本)9781665448994

Neural networks for Image Aesthetic Assessment are usually initialized with weights of pretrained ImageNet models and then trained using a labeled image aesthetics dataset. We argue that the ImageNet classification task is not well-suited for pretraining, since content based classification is designed to make the model invariant to features that strongly influence the image's aesthetics, e.g. style-based features such as brightness or contrast. We propose to use self-supervised aesthetic-aware pretext tasks that let the network learn aesthetically relevant features, based on the observation that distorting aesthetic images with image filters usually reduces their appeal. To ensure that images are not accidentally improved when filters are applied, we introduce a large dataset comprised of highly aesthetic images as the starting point for the distortions. The network is then trained to rank less distorted images higher than their more distorted counterparts. To exploit effects of multiple different objectives, we also embed this task into a multi-task setting by adding either a self-supervised classification or regression task. In our experiments, we show that our pretraining improves performance over the ImageNet initialization and reduces the number of epochs until convergence by up to 47 %. Additionally, we can match the performance of an ImageNet-initialized model while reducing the labeled training data by 20 %. We make our code, data, and pretrained models available.

关键词： computer vision conferences Computational modeling Neural networks Training data Distortion Data models

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 94 95 96 97 98 99 100 101 102 103 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：