检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是1031-1040 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

EVA-GCN: Head Pose Estimation Based on Graph Convolutional Networks

EVA-GCN: Head Pose Estimation Based on Graph Convolutional N...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xin, Miao Mo, Shentong Lin, Yuanze Chinese Acad Sci Inst Automat CASIA Beijing Peoples R China Carnegie Mellon Univ Pittsburgh PA 15213 USA Beihang Univ Beijing Peoples R China

ISBN: (纸本)9781665448994

Head pose estimation is an important task in many real-world applications. Since the facial landmarks usually serve as the common input that is shared by multiple downstream tasks, utilizing landmarks to acquire high-precision head pose estimation is of practical value for many real-world applications. However, existing landmark-based methods have a major drawback in model expressive power, making them hard to achieve comparable performance to the landmark-free methods. In this paper, we propose a strong baseline method which views the head pose estimation as a graph regression problem. We construct a landmark-connection graph, and propose to leverage the Graph Convolutional Networks (GCN) to model the complex nonlinear mappings between the graph typologies and the head pose angles. Specifically, we design a novel GCN architecture which utilizes joint Edge-Vertex Attention (EVA) mechanism to overcome the unstable landmark detection. Moreover, we introduce the Adaptive Channel Attention (ACA) and the Densely-Connected Architecture (DCA) to boost the performance further. We evaluate the proposed method on three challenging benchmark datasets. Experiment results demonstrate that our method achieves better performance in comparison with the state-of-the-art landmark-based and landmark-free methods.

关键词： computer vision Adaptation models Image edge detection conferences Pose estimation computer architecture Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Image Compression with Recurrent Neural Network and Generalized Divisive Normalization

Image Compression with Recurrent Neural Network and Generali...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Islam, Khawar Dang, L. Minh Lee, Sujin Moon, Hyeonjoon Sejong Univ Comp Vis & Pattern Recognit Lab Seoul South Korea Sejong Univ Dept Artificial Intelligence Seoul South Korea

ISBN: (纸本)9781665448994

Image compression is a method to remove spatial redundancy between adjacent pixels and reconstruct a high-quality image. In the past few years, deep learning has gained huge attention from the research community and produced promising image reconstruction results. Therefore, recent methods focused on developing deeper and more complex networks, which significantly increased network complexity. In this paper, two effective novel blocks are developed: analysis and synthesis block that employs the convolution layer and Generalized Divisive Normalization (GDN) in the variablerate encoder and decoder side. Our network utilizes a pixel RNN approach for quantization. Furthermore, to improve the whole network, we encode a residual image using LSTM cells to reduce unnecessary information. Experimental results demonstrated that the proposed variable-rate framework with novel blocks outperforms existing methods and standard image codecs, such as George's [11] and JPEG in terms of image similarity. The project page along with code and models are available at https://***/khawar512/cvpr image compress

关键词： Image coding Recurrent neural networks Quantization (signal) Convolution Redundancy Transform coding Decoding

来源：评论

学校读者我要写书评

暂无评论

Transformer-based Text Detection in the Wild

Transformer-based Text Detection in the Wild

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Raisi, Zobeir Naiel, Mohamed A. Younes, Georges Wardell, Steven Zelek, John S. Univ Waterloo Waterloo ON N2L 3G1 Canada ATS Automat Tooling Syst Inc Cambridge ON N3H 4R7 Canada

ISBN: (纸本)9781665448994

A major limitation to most state-of-the-art visual localization methods is their ineptitude to make use of ubiquitous signs and directions that are typically intuitive to humans. Localization methods can greatly benefit from a system capable of reasoning about a variety of cues beyond low-level features, such as street signs, store names, building directories, room numbers, etc. In this work, we tackle the problem of text detection in the wild, an essential step towards achieving text-based localization and mapping. While current state-of-the-art text detection methods employ ad-hoc solutions with complex multi-stage components to solve the problem, we propose a Transformer-based architecture inherently capable of dealing with multi-oriented texts in images. A central contribution to our work is the introduction of a loss function tailored to the rotated text detection problem that leverages a rotated version of a generalized intersection over union score to properly capture the rotated text regions. We evaluate our proposed model qualitatively and quantitatively on several challenging datasets namely, ICDAR15, ICDAR17, and MSRA-TD500, and show that it outperforms current state-of-the-art methods in text detection in the wild.

关键词： Location awareness Visualization computer vision conferences Buildings computer architecture Detectors

来源：评论

学校读者我要写书评

暂无评论

Progressive Knowledge-Embedded Unified Perceptual Parsing for Scene Understanding

Progressive Knowledge-Embedded Unified Perceptual Parsing fo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zheng, Wenbo Yan, Lan Wang, Fei-Yue Gou, Chao Xi An Jiao Tong Univ Sch Software Engn Xian Peoples R China Sun Yat Sen Univ Sch Intelligent Syst Engn Guangzhou Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China

ISBN: (纸本)9781665448994

Human can naturally understand scenes in depth with the help of various knowledge accumulated and by a comprehensive visual concept organization including category labels and different-level attributes. This inspires us to unify professional knowledge at different levels with deep neural network architectures progressively for scene understanding. Different from the general embedding approaches, we construct different knowledge graphs for different levels of vision tasks by organizing the rich visual concepts accordingly. We employ a gated graph neural network and relational graph convolutional networks to propagate node messages for different levels of tasks and generate progressively different levels of knowledge representation through the graph. Compared with existing methods, our framework has a main appealing property leading to a novel progressive knowledge-embedded representation learning framework that incorporates different level knowledge graphs into the learning of networks at corresponding level. Extensive experiments on the widely used Broden+ dataset demonstrate the superiority of the proposed framework over other existing state-of-the-art methods.

关键词： Deep learning Visualization computer vision conferences Organizations Knowledge representation Logic gates

来源：评论

学校读者我要写书评

暂无评论

The Myth of the Pyramid

The Myth of the Pyramid

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Ramon Izquierdo-Cordova Walterio Mayol-Cuevas Department of Computer Science University of Bristol United Kingdom

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

A deep-rooted strategy for building convolutional neural networks in computer vision is to increase the number of filters every time the feature map resolution is decreased. The notion ruling this pyramidal design is that the expressivity of the network increases with a higher number of filters to compensate for losses caused for lower resolutions. This paper challenges the practice by testing a set of variate distribution of filters, named filter templates, on popular CNN architectures (VGG, ResNet, MobileNet and MnasNet). The experimental results show that the superiority of the pyramidal design holds on the ImageNet dataset but fails for other datasets such as MNIST, CIFAR and TinyImageNet, and for other tasks such as audio classification. CNN models with different filter distributions deliver higher accuracy with reduced resource consumption suggesting the pyramidal design has been optimised to Imagenet and that each model-dataset pair benefits from tuning the number and distribution of filters. To further illustrate the benefits of exploring other distributions, this paper shows that the best performing model from the NASBench101 dataset can increase its accuracy over the original pyramidal design with reductions of parameters up to 68 per cent by using templates. Overall, our experiments point to new opportunities for model designers to find more efficient models.

关键词： computer vision Filters Accuracy Image resolution Computational modeling conferences pattern recognition

来源：评论

学校读者我要写书评

暂无评论

IntegralAction: Pose-driven Feature Integration for Robust Human Action recognition in Videos

IntegralAction: Pose-driven Feature Integration for Robust H...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Moon, Gyeongsik Kwon, Heeseung Lee, Kyoung Mu Cho, Minsu SNU ECE & ASRI Seoul South Korea ASRI Seoul South Korea POSTECH CSE Pohang South Korea AIGS Seoul South Korea

ISBN: (纸本)9781665448994

Most current action recognition methods heavily rely on appearance information by taking an RGB sequence of entire image regions as input. While being effective in exploiting contextual information around humans, e.g., human appearance and scene category, they are easily fooled by out-of-context action videos where the contexts do not exactly match with target actions. In contrast, pose-based methods, which take a sequence of human skeletons only as input, suffer from inaccurate pose estimation or ambiguity of human pose per se. Integrating these two approaches has turned out to be non-trivial;training a model with both appearance and pose ends up with a strong bias towards appearance and does not generalize well to unseen videos. To address this problem, we propose to learn pose-driven feature integration that dynamically combines appearance and pose streams by observing pose features on the fly. The main idea is to let the pose stream decide how much and which appearance information is used in integration based on whether the given pose information is reliable or not. We show that the proposed IntegralAction achieves highly robust performance across in-context and out-of-context action video datasets. The codes are available in here.

关键词： Training computer vision Image recognition conferences Pose estimation Information filters Skeleton

来源：评论

学校读者我要写书评

暂无评论

Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis

Lacunarity Pooling Layers for Plant Image Classification usi...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Akshatha Mohan Joshua Peeples Department of Electrical and Computer Engineering Texas A&M University College Station TX USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Pooling layers (e.g., max and average) may overlook important information encoded in the spatial arrangement of pixel intensity and/or feature values. We propose a novel lacunarity pooling layer that aims to capture the spatial heterogeneity of the feature maps by evaluating the variability within local windows. The layer operates at multiple scales, allowing the network to adaptively learn hierarchical features. The lacunarity pooling layer can be seamlessly integrated into any artificial neural network architecture. Experimental results demonstrate the layer’s effectiveness in capturing intricate spatial patterns, leading to improved feature extraction capabilities. The proposed approach holds promise in various domains, especially in agricultural image analysis tasks. This work contributes to the evolving landscape of artificial neural network architectures by introducing a novel pooling layer that enriches the representation of spatial features. Our code is publicly available. 1

关键词： computer vision Adaptation models Accuracy Computational modeling computer architecture Feature extraction Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Generative Dataset Distillation: Balancing Global Structure and Local Details

Generative Dataset Distillation: Balancing Global Structure ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Longzhen Li Guang Li Ren Togo Keisuke Maeda Takahiro Ogawa Miki Haseyama Hokkaido University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this paper, we propose a new dataset distillation method that considers balancing global structure and local details when distilling the information from a large dataset into a generative model. Dataset distillation has been proposed to reduce the size of the required dataset when training models. The conventional dataset distillation methods face the problem of long redeployment time and poor cross-architecture performance. Moreover, previous methods focused too much on the high-level semantic attributes between the synthetic dataset and the original dataset while ignoring the local features such as texture and shape. Based on the above understanding, we propose a new method for distilling the original image dataset into a generative model. Our method involves using a conditional generative adversarial network to generate the distilled dataset. Subsequently, we ensure balancing global structure and local details in the distillation process, continuously optimizing the generator for more information-dense dataset generation.

关键词： Training computer vision Shape Face recognition conferences Computational modeling Semantics

来源：评论

学校读者我要写书评

暂无评论

LOFI: LOng-tailed FIne-Grained Network for Food recognition

LOFI: LOng-tailed FIne-Grained Network for Food Recognition

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jesús M. Rodríguez-de-Vera Imanol G. Estepa Marc Bolaños Bhalaji Nagarajan Petia Radeva Universitat de Barcelona Barcelona Spain AIGecko Technologies SL Barcelona Spain

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Food recognition plays a crucial role in several healthcare applications. Nevertheless, it presents significant computer vision challenges such as long-tailed and fine-grained distributions that hinder its progress. In this work, we propose LOFI, a Long-tailed Fine-grained Network aimed specifically at tackling these food recognition challenges by improving the feature learning capabilities of food recognition models. Specifically, we improve vanilla R-CNN architecture by tailoring it for food recognition. We design an efficient multi-task framework for fine-grained food recognition, which exploits the lexical similarity of dishes during training to improve the discriminative ability of the network. Secondly, we include a Graph Confidence Propagation module based on graph neural networks to aggregate the information of overlapping detections and refine the final prediction of the network. Extensive analysis and ablations of different components of LOFI highlight that it successfully addresses the targeted problems and leads to noticeable gains in performance. Remarkably, the proposed method achieves competitive results and outperforms the current state-of-the-art methods in three public food benchmarks: UECFood-256, AiCrowd Food Challenge 2022, and UECFood-100 segmented.

关键词： Training Representation learning computer vision conferences Computational modeling Medical services computer architecture

来源：评论

学校读者我要写书评

暂无评论

Using Language-Aligned Gesture Embeddings for Understanding Gestures Accompanying Math Terms

Using Language-Aligned Gesture Embeddings for Understanding ...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Tristan Maidment Purav J Patel Erin Walker Adriana Kovashka Intelligent Systems Program University of Pittsburgh Pittsburgh PA USA University of Maryland College Park MD USA Computer Science University of Pittsburgh Pittsburgh PA USA Learning Research and Development Center University of Pittsburgh Pittsburgh PA USA

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this paper, we introduce an approach for recognizing and classifying gestures that accompany mathematical terms, in a new collection we name the "GAMT" dataset. Our method uses language as a means of providing context to classify gestures. Specifically, we use a CLIP-style framework to construct a shared embedding space for gestures and language, experimenting with various methods for encoding gestures within this space. We evaluate our method on our new dataset containing a wide array of gestures associated with mathematical terms. The shared embedding space leads to a substantial improvement in gesture classification. Furthermore, we identify an efficient model that excelled at classifying gestures from our unique dataset, thus contributing to the further development of gesture recognition in diverse interaction scenarios.

关键词： computer vision conferences Computational modeling Gesture recognition Mathematical models Encoding

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 100 101 102 103 104 105 106 107 108 109 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：