检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

50,636 篇 会议
1,423 册 图书
1,044 篇 期刊文献
1 篇 学位论文

馆藏范围

53,101 篇 电子文献
3 种 纸本馆藏

日期分布

学科分类号

31,927 篇 工学
- 24,897 篇 计算机科学与技术...
- 12,629 篇 软件工程
- 5,176 篇 光学工程
- 4,760 篇 电气工程
- 4,463 篇 信息与通信工程
- 4,261 篇 机械工程
- 3,980 篇 控制科学与工程
- 2,477 篇 生物工程
- 1,736 篇 生物医学工程（可授...
- 1,583 篇 仪器科学与技术
- 1,314 篇 电子科学与技术（可...
- 795 篇 化学工程与技术
- 715 篇 安全科学与工程
- 560 篇 交通运输工程
- 383 篇 建筑学
- 335 篇 土木工程
11,899 篇 理学
- 6,481 篇 物理学
- 5,426 篇 数学
- 2,765 篇 生物学
- 1,915 篇 统计学（可授理学、...
- 804 篇 化学
- 669 篇 系统科学
5,313 篇 医学
- 5,103 篇 临床医学
- 731 篇 基础医学(可授医学...
- 459 篇 药学(可授医学、理...
3,369 篇 管理学
- 1,964 篇 图书情报与档案管...
- 1,554 篇 管理科学与工程(可...
- 485 篇 工商管理
720 篇 艺术学
- 718 篇 设计学（可授艺术学...
434 篇 法学
- 406 篇 社会学
302 篇 农学
198 篇 教育学
166 篇 经济学
63 篇 文学
48 篇 军事学

主题

17,404 篇 computer vision
9,026 篇 pattern recognit...
4,196 篇 training
3,830 篇 feature extracti...
3,134 篇 cameras
2,876 篇 computational mo...
2,794 篇 image segmentati...
2,622 篇 visualization
2,574 篇 shape
2,535 篇 face recognition
2,176 篇 robustness
2,124 篇 computer science
1,975 篇 object detection
1,960 篇 computer archite...
1,882 篇 layout
1,853 篇 object recogniti...
1,801 篇 three-dimensiona...
1,725 篇 neural networks
1,705 篇 humans
1,697 篇 image recognitio...

机构

165 篇 univ chinese aca...
144 篇 tsinghua univers...
135 篇 national laborat...
106 篇 univ sci & techn...
104 篇 zhejiang univers...
101 篇 shanghai jiao to...
95 篇 university of sc...
95 篇 microsoft resear...
85 篇 zhejiang univ pe...
84 篇 shanghai ai lab ...
74 篇 school of comput...
69 篇 computer vision ...
68 篇 peking univ peop...
68 篇 chinese acad sci...
66 篇 chinese univ hon...
63 篇 institute of inf...
62 篇 google res mount...
61 篇 univ oxford oxfo...
59 篇 univ toronto on
57 篇 swiss fed inst t...

作者

92 篇 van gool luc
87 篇 umapada pal
78 篇 zhang lei
64 篇 lee seong-whan
50 篇 vittorio murino
42 篇 yang yi
34 篇 nassir navab
34 篇 ling haibin
33 篇 li xin
33 篇 jie yang
32 篇 liu yang
31 篇 loy chen change
30 篇 escalera sergio
30 篇 h. bischof
29 篇 zhou jie
29 篇 vasconcelos nuno
29 篇 jan-michael frah...
28 篇 blumenstein mich...
27 篇 jia yunde
27 篇 luo ping

语言

50,122 篇 英文
2,746 篇 其他
252 篇 中文
22 篇 土耳其文
4 篇 西班牙文
2 篇 日文
2 篇 葡萄牙文
2 篇 俄文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"

共 53104 条记录，以下是4791-4800 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

DeepObjStyle: Deep Object-based Photo Style Transfer

DeepObjStyle: Deep Object-based Photo Style Transfer

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mastan, Indra Deep Raman, Shanmuganathan Indian Inst Technol Gandhinagar Gandhinagar Gujarat India

ISBN: (纸本)9781665448994

One of the major challenges of style transfer is the appropriate image features supervision between the output image and the input images (style and content). An efficient strategy would be to define an object map between the objects of the style and the content images. However, such a mapping is not well established when there are semantic objects of different types and numbers in the style and the content images. It also leads to content mismatch in the style transfer output, which could reduce the visual quality of the results. We propose an object-based style transfer approach, called DeepObjStyle, for the style supervision in the training data-independent framework. DeepObjStyle preserves the semantics of the objects and achieves better style transfer in the challenging scenario when the style and the content images have a mismatch of image features. We also perform style transfer of images containing a word cloud to demonstrate that DeepObjStyle enables an appropriate image features supervision. We validate the results using quantitative comparisons and user studies.

关键词： Training Visualization computer vision conferences Semantics Tag clouds Quality assessment

来源：评论

学校读者我要写书评

暂无评论

ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation

ViP-DeepLab: Learning Visual Perception with Depth-aware Vid...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qiao, Siyuan Zhu, Yukun Adam, Hartwig Yuille, Alan Chen, Liang-Chieh Johns Hopkins Univ Baltimore MD 21218 USA Google Res Mountain View CA USA

ISBN: (纸本)9781665445092

In this paper, we present ViP-DeepLab, a unified model attempting to tackle the long-standing and challenging inverse projection problem in vision, which we model as restoring the point clouds from perspective image sequences while providing each point with instance-level semantic interpretations. Solving this problem requires the vision models to predict the spatial location, semantic class, and temporally consistent instance label for each 3D point. ViP-DeepLab approaches it by jointly performing monocular depth estimation and video panoptic segmentation. We name this joint task as Depth-aware Video Panoptic Segmentation, and propose a new evaluation metric along with two derived datasets for it, which will be made available to the public. On the individual sub-tasks, ViP-DeepLab also achieves state-of-the-art results, outperforming previous methods by 5.1% VPQ on Cityscapes-VPS, ranking 1st on the KITTI monocular depth estimation benchmark, and 1st on KITTI MOTS pedestrian. The datasets and the evaluation codes are made publicly available(1).

关键词： Measurement Solid modeling Three-dimensional displays Semantics Estimation Predictive models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Based Women Hairstyle Classification Through Inception Resnet-101

Deep Learning-Based Women Hairstyle Classification Through I...

引用

2023 International conference on Energy, Materials and Communication Engineering, ICEMCE 2023

作者： Farook, Faazelah Mohamed Mohamed Mansoor Roomi, S. Sasithra Devi, A. Jayanthi Rajee, R.B. Thiagarajar College of Engineering Department of Electronics and Communication Engineering Tamil-Nadu Madurai India

ISBN: (纸本)9798350393378

Artificial intelligence and computer science's computer vision field is revolutionizing a number of industries, including healthcare, automotive, agriculture, security, and entertainment, by enabling robots to assess visual input. For tasks like object detection, classification, and 3D reconstruction, it makes use of methods from the fields of image processing, machine learning, deep learning, and pattern recognition. One promising application of this technology is the classification of hairstyles using pre-trained networks, which can be used for the detection of criminals who may be using certain hairstyles as a form of disguise or it can be used to identify a person through CCTV surveillance for further investigations. In this paper, a study using the performance of Inception Resnet 101, a popular learning architecture has been used for the classification of several common hairstyles. These hairstyles include 'Bald', 'Braids', 'Free Hair', 'Pony Tail' and 'Short Hair'. Validation accuracy of about 89.7% and testing accuracy of 95.6% has been achieved. This approach demonstrates that the model can attain high accuracy even in the face of difficult real-world circumstances, such as changes in illumination, angles, and distances. A few of the industries that need to take this into account are marketing, trend analysis, and customized haircuts. Approaches utilizing computer vision and artificial intelligence facilitate innovation, enhance decision-making, and enhance the user experience. © 2023 ieee.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Learning for Robust Fitting: A Reinforcement Learning Approach

Unsupervised Learning for Robust Fitting: A Reinforcement Le...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Truong, Giang Le, Huu Suter, David Zhang, Erchuan Gilani, Syed Zulqarnain Edith Cowan Univ Sch Sci Churchlands WA Australia Chalmers Univ Technol Dept Elect Engn Gothenburg Sweden

ISBN: (纸本)9781665445092

Robust model fitting is a core algorithm in a large number of computer vision applications. Solving this problem efficiently for datasets highly contaminated with outliers is, however, still challenging due to the underlying computational complexity. Recent literature has focused on learning-based algorithms. However, most approaches are supervised (which require a large amount of labelled training data). In this paper, we introduce a novel unsupervised learning framework that learns to directly solve robust model fitting. Unlike other methods, our work is agnostic to the underlying input features, and can be easily generalized to a wide variety of LP-type problems with quasi-convex residuals. We empirically show that our method outperforms existing unsupervised learning approaches, and achieves competitive results compared to traditional methods on several important computer vision problems(1).

关键词： computer vision Structure from motion Computational modeling Fitting Estimation Training data Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

LoL-V2T: Large-Scale Esports Video Description Dataset

LoL-V2T: Large-Scale Esports Video Description Dataset

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tanaka, Tsunehiko Simo-Serra, Edgar Waseda Univ Tokyo Japan

ISBN: (纸本)9781665448994

Esports is a fastest-growing new field with a largely online-presence, and is creating a demand for automatic domain-specific captioning tools. However, at the current time, there are few approaches that tackle the esports video description problem. In this work, we propose a large-scale dataset for esports video description, focusing on the popular game "League of Legends". The dataset, which we call LoL-V2T, is the largest video description dataset in the video game domain, and includes 9,723 clips with 62,677 captions. This new dataset presents multiple new video captioning challenges such as large amounts of domain-specific vocabulary, subtle motions with large importance, and a temporal gap between most captions and the events that occurred. In order to tackle the issue of vocabulary, we propose a masking the domain-specific words and provide additional annotations for this. In our results, we show that the dataset poses a challenge to existing video captioning approaches, and the masking can significantly improve performance. Our dataset and code is publicly available(1).

关键词： Training Vocabulary computer vision Video description conferences Focusing Games

来源：评论

学校读者我要写书评

暂无评论

Long-tailed Oracle Character recognition Based on Convolutional Neural Networks and vision Transformers

Long-tailed Oracle Character Recognition Based on Convolutio...

引用

2025 ieee International conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Yang, Zhongyuan Han, Zhiwang Aysa, Alimjan Ibrahim, Ghalip Ubul, Kurban School of Computer Science and Technology Xinjiang University Xinjiang China Xinjiang Key Laboratory of Multilingual Information Technology Xinjiang University Xinjiang China

ISBN: (纸本)9798350368741

Oracle bone inscriptions, which are among the oldest known hieroglyphics in China, encompass rich historical and cultural information. However, the automatic recognition of oracle characters faces substantial challenges due to issues with data quality and long-tail distribution. This study introduces a hybrid model based on a convolutional neural network (CNN) and a visual transformer (ViT), augmented with a multi-expert learning strategy, to enhance the recognition performance of long-tail oracle bone characters. Initially, the CNN extracts local features, while the ViT captures global dependencies, thereby improving the model's feature representation capability. Subsequently, through the multi-expert learning mechanism, the prediction results of multiple models are aggregated, effectively mitigating the effects of data imbalance and alleviating the challenges associated with the long-tail distribution of the dataset. Experimental results demonstrate that the proposed model achieves new state-of-the-art performance on two large-scale Oracle datasets (Oracle-AYNU and OBC306) in terms of Top-1 accuracy, Top-5 accuracy, F1-Score, and average accuracy, with respective values of 87.97% (93.61%), 96.58% (98.84%), 86.83% (93.49%), and 82.81% (85.23%). © 2025 ieee.

关键词： convolutional neural network long-tail distribution Oracle recognition visual transformer

来源：评论

学校读者我要写书评

暂无评论

Hyperdimensional computing as a framework for systematic aggregation of image descriptors

Hyperdimensional computing as a framework for systematic agg...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Neubert, Peer Schubert, Stefan Tech Univ Chemnitz Chemnitz Germany

ISBN: (纸本)9781665445092

Image and video descriptors are an omnipresent tool in computer vision and its application fields like mobile robotics. Many hand-crafted and in particular learned image descriptors are numerical vectors with a potentially (very) large number of dimensions. Practical considerations like memory consumption or time for comparisons call for the creation of compact representations. In this paper, we use hyperdimensional computing (HDC) as an approach to systematically combine information from a set of vectors in a single vector of the same dimensionality. HDC is a known technique to perform symbolic processing with distributed representations in numerical vectors with thousands of dimensions. We present a HDC implementation that is suitable for processing the output of existing and future (deep learning based) image descriptors. We discuss how this can be used as a framework to process descriptors together with additional knowledge by simple and fast vector operations. A concrete outcome is a novel HDC-based approach to aggregate a set of local image descriptors together with their image positions in a single holistic descriptor. The comparison to available holistic descriptors and aggregation methods on a series of standard mobile robotics place recognition experiments shows a 20% improvement in average performance and > 2x better worst-case performance compared to runner-up.

关键词： computer vision Systematics Aggregates Semantics Memory management Layout Tools

来源：评论

学校读者我要写书评

暂无评论

Contrastive Learning for Sports Video: Unsupervised Player Classification

Contrastive Learning for Sports Video: Unsupervised Player C...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Koshkina, Maria Pidaparthy, Hemanth Elder, James H. York Univ Toronto ON Canada

ISBN: (纸本)9781665448994

We address the problem of unsupervised classification of players in a team sport according to their team affiliation, when jersey colours and design are not known a priori. We adopt a contrastive learning approach in which an embedding network learns to maximize the distance between representations of players on different teams relative to players on the same team, in a purely unsupervised fashion, without any labelled data. We evaluate the approach using a new hockey dataset and find that it outperforms prior unsupervised approaches by a substantial margin, particularly for real-time application when only a small number of frames are available for unsupervised learning before team assignments must be made. Remarkably, we show that our contrastive method achieves 94% accuracy after unsupervised training on only a single frame, with accuracy rising to 97% within 500 frames (17 seconds of game time). We further demonstrate how accurate team classification allows accurate team-conditional heat maps of player positioning to be computed.

关键词： Training Heating systems computer vision Image color analysis conferences Games Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Architectural Adversarial Robustness: The Case for Deep Pursuit

Architectural Adversarial Robustness: The Case for Deep Purs...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Cazenavette, George Murdock, Calvin Lucey, Simon Carnegie Mellon Univ Pittsburgh PA 15213 USA Univ Adelaide Adelaide SA Australia

ISBN: (纸本)9781665445092

Despite their unmatched performance, deep neural networks remain susceptible to targeted attacks by nearly imperceptible levels of adversarial noise. While the underlying cause of this sensitivity is not well understood, theoretical analyses can be simplified by refraining each layer of a feed forward network as an approximate solution to a sparse coding problem. Iterative solutions using basis pursuit are theoretically more stable and have improved adversarial robustness. However, cascading layer-wise pursuit implementations suffer from error accumulation in deeper networks. In contrast, our new method of deep pursuit approximates the activations of all layers as a single global optimization problem, allowing us to consider deepen real-world architectures with skip connections such as residual networks. Experimentally, our approach demonstrates improved robustness to adversarial noise.

关键词： Deep learning Resistance computer vision Sensitivity computer architecture Sparse representation Robustness

来源：评论

学校读者我要写书评

暂无评论

Yoga Pose Detection with Deep Learning and computer vision 15

Yoga Pose Detection with Deep Learning and Computer Vision

引用

15th International conference on Computing Communication and Networking Technologies, ICCCNT 2024

作者： Srivastava, Amisha Gayathri, K. Thangavel, Senthil Kumar Meghana, Pullela Vishvajit, S. Senthil Kumar, B. Somasundaram, K. Amrita Vishwa Vidyapeetham Amrita School of Computing Department of Computer Science and Engineering Tamil Nadu Coimbatore India Amrita Vishwa Vidyapeetham Amrita Darshanam International Centre for Spiritual Studies Coimbatore India Amrita Vishwa Vidyapeetham Amrita School of Physical Sciences Department of Mathematics Coimbatore India

ISBN: (纸本)9798350370249

An approach to do real-time monitoring of Yoga Asanas using Deep Learning and computer vision approaches. Convolutional neural networks (CNN) and long short-term memory (LSTM) are combined to create a hybrid deep learning model. Human pose recognition can be used to create a selfinstruction exercise system that enables people to learn and perform exercises appropriately on their own as these resources are not always readily available. This project discusses several machine learning and deep learning algorithms to precisely identify yoga positions on pre-recorded films as well as in realtime, laying the groundwork for developing such a system. These kinds of applications are useful during times of lockdown, such as the lockdown we experienced in 2020 due to the coronavirus epidemic, when people's freedom of movement is severely constrained and they may use such programmers' quite easily from home. © 2024 ieee.

关键词： Activity recognition CNN Deep Learning Open Pose Posture Analysis Yoga Asanas

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 476 477 478 479 480 481 482 483 484 485 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：