检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是811-820 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Audio Provenance Analysis in Heterogeneous Media Sets

Audio Provenance Analysis in Heterogeneous Media Sets

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Milica Gerhardt Luca Cuccovillo Patrick Aichroth Fraunhofer Institute for Digital Media Technology IDMT Ilemanu Germany

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

This paper introduces a framework for Audio Provenance Analysis, addressing the complex challenge of ana-lyzing heterogeneous sets of audio items without requiring any prior knowledge of their content. Our framework applies a novel approach that combines partial audio matching and phylogeny techniques. It constructs directed acyclic graphs to capture the origins and the evolution of content within near-duplicate audio clusters, identifying the least altered versions and tracing the reuse of content within these clusters. The approach is evaluated for two selected application scenarios, demonstrating that it can accurately determine the direction of content reuse and identify parent-child relationships, while also offering a dedicated dataset for benchmarking future research in this area.

关键词： Directed acyclic graph computer vision conferences Media Benchmark testing Phylogeny pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Supervised deep learning of elastic SRV distances on the shape space of curves

Supervised deep learning of elastic SRV distances on the sha...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hartman, Emmanuel Sukurdeep, Yashil Charon, Nicolas Klassen, Eric Bauer, Martin Florida State Univ Dept Math Tallahassee FL 32306 USA Johns Hopkins Univ Dept Appl Math Baltimore MD USA

ISBN: (纸本)9781665448994

Motivated by applications from computer vision to bioinformatics, the field of shape analysis deals with problems where one wants to analyze geometric objects, such as curves, while ignoring actions that preserve their shape, such as translations, rotations, scalings, or reparametrizations. Mathematical tools have been developed to define notions of distances, averages, and optimal deformations for geometric objects. One such framework, which has proven to be successful in many applications, is based on the square root velocity (SRV) transform, which allows one to define a computable distance between spatial curves regardless of how they are parametrized. This paper introduces a supervised deep learning framework for the direct computation of SRV distances between curves, which usually requires an optimization over the group of reparametrizations that act on the curves. The benefits of our approach in terms of computational speed and accuracy are illustrated via several numerical experiments on both synthetic and real data.

关键词： Training Deep learning computer vision Shape conferences Transforms Tools

来源：评论

学校读者我要写书评

暂无评论

Data-Efficient Language-Supervised Zero-Shot Learning with Self-Distillation

Data-Efficient Language-Supervised Zero-Shot Learning with S...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cheng, Ruizhe Wu, Bichen Zhang, Peizhao Vajda, Peter Gonzalez, Joseph E. Univ Calif Berkeley Berkeley CA 94720 USA Facebook Real Labs Redmond WA USA

ISBN: (纸本)9781665448994

Traditional computer vision models are trained to predict a fixed set of predefined categories. Recently, natural language has been shown to be a broader and richer source of supervision that provides finer descriptions to visual concepts than supervised "gold" labels. Previous works, such as CLIP, use a simple pretraining task of predicting the pairings between images and text captions. CLIP, however, is data hungry and requires more than 400M image text pairs for training. We propose a data-efficient contrastive distillation method that uses soft labels to learn from noisy image-text pairs. Our model transfers knowledge from pre-trained image and sentence encoders and achieves strong performance with only 3M image text pairs, 133x smaller than CLIP. Our method exceeds the previous SoTA of general zero-shot learning on ImageNet 21k+1k by 73% relatively with a ResNet50 image encoder and DeCLUTR text encoder. We also beat CLIP by 10.5% relatively on zeroshot evaluation on Google Open Images (19,958 classes).

关键词： Training computer vision Visualization Natural languages Predictive models pattern recognition Internet

来源：评论

学校读者我要写书评

暂无评论

Improving the Efficiency-Accuracy Trade-off of DETR-Style Models in Practice

Improving the Efficiency-Accuracy Trade-off of DETR-Style Mo...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Yumin Suh Dongwan Kim Abhishek Aich Samuel Schulter Jong-Chyi-Su Bohyung Han Manmohan Chandraker NEC Laboratories America Seoul National University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We aim to provide a comprehensive view of the inference efficiency of DETR-style detection models. We explore the effect of basic efficiency techniques and identify the factors that are easy to implement, yet effectively improve the efficiency-accuracy trade-off. Specifically, we investigate the effect of input resolution, multi-scale feature enhancement, and backbone pre-training. Our experiments support that 1) adjusting the input resolution is a simple yet effective way to achieve a better efficiency-accuracy trade-off. 2) Multi-scale feature enhancement can be lightened with a marginal decrease in accuracy, and 3) improved backbone pre-training can further improve the trade-off.

关键词： computer vision Accuracy conferences pattern recognition

来源：评论

学校读者我要写书评

暂无评论

IrrNet: Spatio-Temporal Segmentation guided Classification for Irrigation Mapping

IrrNet: Spatio-Temporal Segmentation guided Classification f...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Oishee Bintey Hoque Department of Computer Science University of Virginia VA USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Irrigation systems can vary widely in scale, from smallscale subsistence farming to large commercial agriculture (see Fig. 1 ). The heterogeneity in irrigation practices and systems across different regions adds to the complexity of mapping (see Fig. 1 ). Distinguishing between irrigated and non-irrigated areas is challenging due to the spectral characteristics of various irrigation systems and practices across different regions, further complicating the task of mapping different types of irrigation. For example, rainfed agriculture is prevalent in the Midwest, Southeast, and parts of the Northeast U.S., while irrigation is common in arid Western and Southwestern states. Rainfed farming can result in highly variable patterns of cultivation. Farmers may practice rainfed agriculture in some fields while irrigating others, leading to a complex mosaic of irrigated and non-irrigated areas within the same region.

关键词： Irrigation computer vision conferences pattern recognition Complexity theory Farming

来源：评论

学校读者我要写书评

暂无评论

Occlusion Guided Scene Flow Estimation on 3D Point Clouds

Occlusion Guided Scene Flow Estimation on 3D Point Clouds

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ouyang, Bojun Raviv, Dan Tel Aviv Univ Tel Aviv Israel

ISBN: (纸本)9781665448994

3D scene flow estimation is a vital tool in perceiving our environment given depth or range sensors. Unlike optical flow, the data is usually sparse and in most cases partially occluded in between two temporal samplings. Here we propose a new scene flow architecture called OGSF-Net which tightly couples the learning for both flow and occlusions between frames. Their coupled symbiosis results in a more accurate prediction of flow in space. Unlike a traditional multi-action network, our unified approach is fused throughout the network, boosting performances for both occlusion detection and flow estimation. Our architecture is the first to gauge the occlusion in 3D scene flow estimation on point clouds. In key datasets such as Flyingthings3D and KITTI, we achieve the state-of-the-art results.(1,2)

关键词： Symbiosis Measurement Deep learning Three-dimensional displays Estimation computer architecture Tools

来源：评论

学校读者我要写书评

暂无评论

Leveraging Multi scale Backbone with Multilevel supervision for Thermal Image Super Resolution

Leveraging Multi scale Backbone with Multilevel supervision ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Nathan, Sabari Kansal, Priya Couger Inc Shibuya Ku Tokyo Japan

ISBN: (纸本)9781665448994

This paper proposes an attention-based multi-level model with a multi-scale backbone for thermal image super-resolution. The model leverages the multi-scale backbone as well. The thermal image dataset is provided by PBVS 2020 in their thermal image super-resolution challenge. This dataset contains the images with three different resolution scales(low, medium, high) [1]. However, only the medium and high-resolution images are used to train the proposed architecture to generate the super-resolution images in x2, x4 scales. The proposed architecture is based on the Res2net blocks as the backbone of the network. Along with this, the coordinate convolution layer and dual attention are also used in the architecture. Further, multi-level supervision is implemented to supervise the output image resolution similarity with the real image at each block during training. To test the robustness of the proposed model, we evaluated our model on the Thermal-6 dataset [20]. The results show that our model is efficient to achieve state-of-the-art results on the PBVS dataset. Further the results on the Thermal-6 dataset show that the model has a decent generalization capacity.

关键词： Training Convolution conferences Superresolution computer architecture Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Label, Verify, Correct: A Simple Few Shot Object Detection Method

Label, Verify, Correct: A Simple Few Shot Object Detection M...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Kaul, Prannay Xie, Weidi Zisserman, Andrew Univ Oxford Visual Geometry Grp Oxford England Shanghai Jiao Tong Univ Shanghai Peoples R China

ISBN: (数字)9781665469463

ISBN: (纸本)9781665469463

The objective of this paper is few-shot object detection (FSOD) - the task of expanding an object detector for a new category given only a few instances for training. We introduce a simple pseudo-labelling method to source high-quality pseudo-annotations from the training set, for each new category, vastly increasing the number of training instances and reducing class imbalance;our method finds previously unlabelled instances. Naively training with model predictions yields suboptimal performance;we present two novel methods to improve the precision of the pseudo-labelling process: first, we introduce a verification technique to remove candidate detections with incorrect class labels;second, we train a specialised model to correct poor quality bounding boxes. After these two novel steps, we obtain a large set of high-quality pseudo-annotations that allow our final detector to be trained end-to-end. Additionally, we demonstrate our method maintains base class performance, and the utility of simple augmentations in FSOD. While benchmarking on PASCAL VOC and MS-COCO, our method achieves state-of-the-art or second-best performance compared to existing approaches across all number of shots.

关键词： Training computer vision Detectors Object detection Predictive models Benchmark testing Solids

来源：评论

学校读者我要写书评

暂无评论

Perceptual Image Quality Assessment with Transformers

Perceptual Image Quality Assessment with Transformers

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cheon, Manri Yoon, Sung-Jun Kang, Byungyeon Lee, Junwoo LG Elect Seoul South Korea

ISBN: (纸本)9781665448994

In this paper, we propose an image quality transformer (IQT) that successfully applies a transformer architecture to a perceptual full-reference image quality assessment (IQA) task. Perceptual representation becomes more important in image quality assessment. In this context, we extract the perceptual feature representations from each of input images using a convolutional neural network (CNN) backbone. The extracted feature maps are fed into the transformer encoder and decoder in order to compare a reference and distorted images. Following an approach of the transformer-based vision models [18, 55], we use extra learnable quality embedding and position embedding. The output of the transformer is passed to a prediction head in order to predict a final quality score. The experimental results show that our proposed model has an outstanding performance for the standard IQA datasets. For a large-scale IQA dataset containing output images of generative model, our model also shows the promising results. The proposed IQT was ranked first among 13 participants in the NTIRE 2021 perceptual image quality assessment challenge [23]. Our work will be an opportunity to further expand the approach for the perceptual IQA task.

关键词： Image quality Measurement Image resolution Head Feature extraction Quality assessment pattern recognition

来源：评论

学校读者我要写书评

暂无评论

APES: Audiovisual Person Search in Untrimmed Video

APES: Audiovisual Person Search in Untrimmed Video

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Alcazar, Juan Leon Heilbron, Fabian Caba Mai, Long Perazzi, Federico Lee, Joon-Young Arbelaez, Pablo Ghanem, Bernard Univ Los Andes Bogota Colombia Adobe Res San Jose CA USA King Abdullah Univ Sci & Technol Thuwal Saudi Arabia

ISBN: (纸本)9781665448994

Humans are arguably one of the most important subjects in video streams, many real-world applications such as video summarization or video editing workflows often require the automatic search and retrieval of a person of interest. Despite tremendous efforts in the person re-identification and retrieval domains, few works have developed audiovisual search strategies. In this paper, we present the Audiovisual Person Search dataset (APES), a new dataset composed of untrimmed videos whose audio (voices) and visual (faces) streams are densely annotated. APES contains over 1.9K identities labeled along 36 hours of video, making it the largest dataset available for untrimmed audiovisual person search. A key property of APES is that it includes dense temporal annotations that link faces to speech segments of the same identity. To showcase the potential of our new dataset, we propose an audiovisual baseline and benchmark for person retrieval. Our study shows that modeling audiovisual cues benefits the recognition of people's identities.

关键词： Visualization computer vision Annotations conferences Streaming media Benchmark testing Search problems

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 78 79 80 81 82 83 84 85 86 87 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：