检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

23,008 篇 会议
126 册 图书
94 篇 期刊文献

馆藏范围

23,227 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,631 篇 工学
- 11,116 篇 计算机科学与技术...
- 3,481 篇 软件工程
- 2,445 篇 机械工程
- 1,716 篇 光学工程
- 1,080 篇 电气工程
- 1,014 篇 控制科学与工程
- 788 篇 信息与通信工程
- 411 篇 仪器科学与技术
- 352 篇 生物工程
- 251 篇 生物医学工程（可授...
- 196 篇 电子科学与技术（可...
- 114 篇 化学工程与技术
- 109 篇 安全科学与工程
- 100 篇 测绘科学与技术
- 88 篇 建筑学
- 88 篇 交通运输工程
- 84 篇 土木工程
3,495 篇 医学
- 3,482 篇 临床医学
- 82 篇 基础医学(可授医学...
3,246 篇 理学
- 1,941 篇 物理学
- 1,643 篇 数学
- 563 篇 统计学（可授理学、...
- 500 篇 生物学
- 249 篇 系统科学
- 106 篇 化学
521 篇 管理学
- 311 篇 图书情报与档案管...
- 223 篇 管理科学与工程(可...
- 76 篇 工商管理
276 篇 艺术学
- 276 篇 设计学（可授艺术学...
66 篇 法学
- 63 篇 社会学
38 篇 农学
28 篇 教育学
22 篇 经济学
10 篇 军事学
3 篇 文学

主题

10,186 篇 computer vision
3,967 篇 pattern recognit...
3,005 篇 training
2,007 篇 computational mo...
1,818 篇 visualization
1,815 篇 cameras
1,515 篇 feature extracti...
1,481 篇 shape
1,455 篇 three-dimensiona...
1,438 篇 image segmentati...
1,287 篇 robustness
1,206 篇 computer archite...
1,155 篇 semantics
1,147 篇 conferences
1,107 篇 layout
1,092 篇 computer science
1,088 篇 object detection
1,025 篇 benchmark testin...
970 篇 codes
922 篇 face recognition

机构

136 篇 univ sci & techn...
121 篇 univ chinese aca...
118 篇 chinese univ hon...
105 篇 carnegie mellon ...
101 篇 tsinghua univers...
101 篇 microsoft resear...
95 篇 swiss fed inst t...
93 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
79 篇 university of ch...
77 篇 shanghai ai lab ...
72 篇 shanghai jiao to...
69 篇 national laborat...
67 篇 microsoft res as...
67 篇 alibaba grp peop...
64 篇 adobe research
60 篇 peking univ peop...
60 篇 tsinghua univ pe...
59 篇 univ oxford oxfo...

作者

81 篇 van gool luc
72 篇 timofte radu
65 篇 zhang lei
47 篇 luc van gool
40 篇 yang yi
40 篇 li stan z.
37 篇 loy chen change
35 篇 chen chen
33 篇 xiaoou tang
32 篇 liu yang
32 篇 qi tian
31 篇 tian qi
31 篇 sun jian
30 篇 murino vittorio
29 篇 ling haibin
29 篇 darrell trevor
29 篇 pascal fua
29 篇 li fei-fei
28 篇 li xin
28 篇 ying shan

语言

22,989 篇 英文
210 篇 其他
22 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition Workshops"

共 23228 条记录，以下是761-770 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

TinyOps: ImageNet Scale Deep Learning on Microcontrollers

TinyOps: ImageNet Scale Deep Learning on Microcontrollers

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sadiq, Sulaiman Hare, Jonathon Maji, Partha Craske, Simon Merrett, Geoff, V Univ Southampton Southampton Hants England ARM Ltd Cambridge England

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Deep Learning on microcontroller (MCU) based IoT devices is extremely challenging due to memory constraints. Prior approaches focus on using internal memory or external memories exclusively which limit either accuracy or latency. We find that a hybrid method using internal and external MCU memories outperforms both approaches in accuracy and latency. We develop TinyOps, an inference engine which accelerates inference latency of models in slow external memory, using a partitioning and overlaying scheme via the available Direct Memory Access (DMA) peripheral to combine the advantages of external memory (size) and internal memory (speed). Experimental results show that architectures deployed with TinyOps significantly outperform models designed for internal memory with up to 6% higher accuracy and importantly, 1.3-2.2x faster inference latency to set the state-of-the-art in TinyML ImageNet classification. Our work shows that the TinyOps space is more efficient compared to the internal or external memory design spaces and should be explored further for TinyML applications.

关键词： Deep learning computer vision Microcontrollers conferences Memory management pattern recognition Internet of Things

来源：评论

学校读者我要写书评

暂无评论

Three Stream Graph Attention Network using Dynamic Patch Selection for the classification of micro-expressions

Three Stream Graph Attention Network using Dynamic Patch Sel...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Jain, Ankith Kumar, Rakesh Bhanu, Bir Univ Calif Riverside Dept Elect & Comp Engn Riverside CA 92521 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

To understand the genuine emotions expressed by humans during social interactions, it is necessary to recognize the subtle changes on the face (micro-expressions) demonstrated by an individual. Facial micro-expressions are brief, rapid, spontaneous gestures and non-voluntary facial muscle movements beneath the skin. Therefore, it is a challenging task to classify facial micro-expressions. This paper presents an end-to-end novel three-stream graph attention network model to capture the subtle changes on the face and recognize micro-expressions (MEs) by exploiting the relationship between optical flow magnitude, optical flow direction, and the node locations features. A facial graph representational structure is used to extract the spatial and temporal information using the three frames. The varying dynamic patch size of optical flow features is used to extract the local texture information across each landmark point. The network only utilizes the landmark points location features and optical flow information across these points and generates good results for the classification of MEs. A comprehensive evaluation of SAMM and the CASME II datasets demonstrates the high efficacy, efficiency, and generalizability of the proposed approach and achieves better results than the state-of-the-art methods.

关键词： Emotion recognition computer vision Face recognition conferences Feature extraction Skin Facial muscles

来源：评论

学校读者我要写书评

暂无评论

HR-STAN: High-Resolution Spatio-Temporal Attention Network for 3D Human Motion Prediction

HR-STAN: High-Resolution Spatio-Temporal Attention Network f...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Medjaouri, Omar Desai, Kevin Univ Texas San Antonio San Antonio TX 78249 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

3D human motion prediction requires making sense of the complex spatio-temporal dynamics which underpin human motion to make highly accurate predictions. Part of this complexity is due to the trade-off between long-term (>400ms) and short-term predictions (<400ms) which require different levels of granularity to observe patterns. Several works have explored methods of improving long-term prediction performance by utilizing longer motion histories but this typically comes at the cost of very short-term (<200ms) performance. Inspired by high-resolution network architectures, we propose a novel high-resolution spatio-temporal attention network (HR-STAN) which leverages parallel feature branches and dilated convolutions to observe human motion at different scales. Furthermore, we augment this architecture with split spatial and temporal attention mechanisms to efficiently capture spatio-temporal dependencies within a given motion. We evaluate the ability of our HR-STAN architecture at incorporating long-term motion histories while producing short-term predictions and show that it improves over several state-of-the-art methods on both the AMASS and Human3.6M benchmarks.

关键词： computer vision Three-dimensional displays Costs conferences Dynamics computer architecture Predictive models

来源：评论

学校读者我要写书评

暂无评论

NTIRE 2024 Image Shadow Removal Challenge Report

NTIRE 2024 Image Shadow Removal Challenge Report

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Vasluianu, Florin-Alexandru Seizinger, Tim Wu, Zongwei Zhou, Zhuyun Chen, Cailian Zhou, Han Timofte, Radu Dong, Wei Tian, Yuqiong Chen, Jun Lul, Xin Zhu, Yurui Wang, Xi Li, Dong Xiao, Jie Zhang, Yunpeng Fu, Xueyang Zha, Zheng-Jun Zhang, Zhao Zhao, Suiyi Wang, Bo Luo, Yan Wei, Yanyan Xiaol, Jie Ful, Xueyang Zhal, Zheng-Jun Lu, Xin Zhao, Zhihao Sun, Long Yang, Tingting Pan, Jinshan Tang, Jinhui Dong, Jiangxin Benjdira, Bilel Nassif, Mohammed Koubaa, Anis Elhayek, Ahmed Ali, Anas M. Tokoro, Kyotaro Kawai, Kento Yokoyama, Kaname Seno, Takuya Kondo, Yuki Ukita, Norimichi Li, Chenghua Yang, Bo Wu, Zhiqi Chen, Gao Yu, Yihan Chen, Sixiang Mane, Kai Ye, Tian Zou, Wenbin Lin, Yunlong Xing, Zhaohu Bai, Jinbin Chai, Wenhao Zhu, Lei Maheshwari, Ritik Verma, Rakshank Tekchandanil, Rahul Hambarde, Praful Tazil, Satya Narayan Vipparthi, Santosh Kumar Murala, Subrahmanyam Lee, Jaeho Kim, Seongwan Sharif, S. M. A. Khujaev, Nodirkhuja Tsoy, Roman Gao, Fan Yan, Weidan Shao, Wenze Zhang, Dengyin Chen, Bin Zhang, Siqi Qian, Yanxin Chen, Yuanbin Zhou, Yuanbo Tong, Tong Wei, Rongfeng Sun, Ruiqi Liu, Yue Akalwadi, Nikhil Joshi, Amogh Malagi, Sampada Desai, Chaitra Tabib, Ramesh Ashok Mudenagudi, Uma Murtaza, Ali Khairuddin, Uswah Faudzi, Ahmad'Athif Mohd Dukre, Adinath Deshmukh, Vivek Phutke, Shruti S. Kulkarni, Ashutosh Gonde, Anil Karthik, Arun K. Manasa, N. Priyal, Shri Hari Hao, Wei Yan, Xingzhuo Fu, Minghan Univ Wurzburg Comp Vis Lab IFI & CAIDAS Wurzburg Germany Shanghai Jiao Tong Univ Shanghai Peoples R China McMaster Univ Dept Elect & Comp Engn Hamilton ON Canada Univ Sci & Technol China Hefei Peoples R China Hefei Univ Technol Hefei Peoples R China Nanjing Univ Sci & Technol Nanjing Jiangsu Peoples R China Prince Sultan Univ Robot & Internet Things Lab Riyadh 12435 Saudi Arabia Prince Muqrin Univ Artificial Intelligence Dept Medinah 41311 Saudi Arabia Toyota Technol Inst Intelligent Informat Media Lab Nagoya Japan Nanjing Artificial Intelligence Res IA AiRiA Nanjing Peoples R China Nanjing Normal Univ High Sch Jiangning Campus Nanjing Peoples R China Hong Kong Univ Sci & Technol Guangzhou Guangzhou Peoples R China South China Univ Technol Guangzhou Peoples R China Xiamen Univ Xiamen Peoples R China Natl Univ Singapore Singapore Singapore Univ Washington Seattle WA 98195 USA GEC Ajmer Kiranipura India CVPR Lab IIT Ropar Rupnagar India SCSS Trinity Coll Dublin Dublin Ireland Opt AI Seoul South Korea Nanjing Univ Posts & Telecommun Nanjing Peoples R China Fuzhou Univ Fuzhou Peoples R China Univ Hong Kong Logist & Supply Chain MultiTech R&D Ctr Hong Kong Peoples R China Sun Yat Sen Univ Guangzhou Peoples R China KLE Technol Univ Ctr Excellence Visual Intelligence CEVI Hubballi Karnataka India KLE Technol Univ Sch Elect & Commun Engn Hubballi Karnataka India KLE Technol Univ Sch Comp Sci & Engn Hubballi Karnataka India Univ Teknol Malaysia Malaysia Japan Int Inst Technol MMT Kuala Lumpur Malaysia Univ Teknol Malaysia Ctr Artificial Intelligence & Robot CAIRO Kuala Lumpur Malaysia Shri Guru Gobind Singhji Inst Engn & Technol Nanded India Indian Inst Technol Ropar Comp Vis & Pattern Recognit Lab Rupnagar India Trinity Coll Dublin Sch Comp Sci & Stat CVPR Lab Dublin Ireland Shiv Nadar Univ Sch Engn Chennai Tamil Nadu India Fortinet Inc Sunnyvale CA USA Bosch Investment Ltd Shanghai Peoples R China Univ Saskatchewan Saskatoon

ISBN: (纸本)9798350365474

This work reviews the results of the NTIRE 2024 Challenge on Shadow Removal. Building on the last year edition, the current challenge was organized in two tracks, with a track focused on increased fidelity reconstruction, and a separate ranking for high performing perceptual quality solutions. Track 1 (fidelity) had 214 registered participants, with 17 teams submitting in the final phase, while Track 2 (perceptual) registered 185 participants, resulting in 18 final phase submissions. Both tracks were based on data from the WSRD dataset, simulating interactions between self-shadows and cast shadows, with a large variety of represented objects, textures, and materials. Improved image alignment enabled increased fidelity reconstruction, with restored frames mostly indistinguishable from the references images for top performing solutions.

关键词： lighting removal restoration shadow

来源：评论

学校读者我要写书评

暂无评论

An Empirical study of Data-Free Quantization's Tuning Robustness

An Empirical study of Data-Free Quantization's Tuning Robust...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Chen, Hong Wen, Yuxuan Ding, Yifu Yang, Zhen Guo, Yufei Qin, Haotong Beihang Univ Beijing Peoples R China Shanghai Aerosp Elect Technol Inst Shanghai Peoples R China China Aerosp Sci & Ind Corp Acad 2 Beijing Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Deep convolutional neural networks are now performing increasingly superior in various fields, while the network parameters are getting massive as the advanced neural networks tend to be deeper. Among various model compression methods, quantization is one of the most potent approaches to compress neural networks by compacting model weights and activations to lower bit-width. The data-free quantization method is also proposed, which is specialized for some privacy and security scenarios and enables quantization without access to real data. In this work, we find that the tuning robustness of existing data-free quantization is flawed, progressing an empirical study and determining some hyperparameter settings that can converge the model stably in the data-free quantization process. Our study aims to evaluate the overall tuning robustness of the current data-free quantization system, which is existing methods are significantly affected by parameter fluctuations in tuning. We also expect data-free quantification methods with tuning robustness to appear in the future.

关键词： Data privacy Quantization (signal) Fluctuations conferences Neural networks Robustness pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-Camera Vehicle Tracking Based on Occlusion-aware and Inter-vehicle Information

Multi-Camera Vehicle Tracking Based on Occlusion-aware and I...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yuming Zhang, Xiaochun Zhang, Bingzhen Zhang, Xiaoyong Wang, Sen Xu, Jianrong Shenzhen Urban Transport Planning Ctr Co Ltd Shenzhen Peoples R China

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

With the demands of analyzing and predicting traffic flow for applications in smart cities, Multi-Target Multi-Camera vehicle Tracking(MTMCT) at the city scale has become a fundamental problem. The MTMCT is challenging due to the view variations, frequent occlusions, and similar vehicle models in the same camera. This work proposes an MTMCT framework based on occlusion-aware and intervehicle information that can effectively match vehicle tracklets. The occlusion-aware module segments the tracklets of an occluded and occluding vehicle pair. It recalculates the similarity of the complete tracklets, which can handle the occlusions and suppress false detections. This work proposes an inter-vehicle information module to improve the matching accuracy. The module can enhance the ability to distinguish similar vehicles under the same camera at different times. The proposed whole framework consists of four modules: (1) vehicle detection and feature extraction by re-identification models, (2) single-camera tracking (SCT) to produce initial tracklets with an occlusion-aware module, (3) tracklets similarity by inter-vehicle association, (4) clustering in adjacent cameras for multi-camera tracklets matching. The proposed method obtains IDF1 score of 0.8285 on the Track-1 multi-camera vehicle tracking task of the 2022 AI City Challenge.

关键词： computer vision Smart cities Vehicle detection conferences Cameras Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

SaR: Self-adaptive Refinement on Pseudo Labels for Multiclass-Imbalanced Semi-supervised Learning

SaR: Self-adaptive Refinement on Pseudo Labels for Multiclas...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lai, Zhengfeng Wang, Chao Cheung, Sen-ching Chuah, Chen-Nee Univ Calif Davis Davis CA 95616 USA Southern Univ Sci & Technol Shenzhen Peoples R China Univ Kentucky Lexington KY 40506 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

Class-imbalanced datasets can severely deteriorate the performance of semi-supervised learning (SSL). This is due to the confirmation bias especially when the pseudo labels are highly biased towards the majority classes. Traditional resampling or reweighting techniques may not be directly applicable when the unlabeled data distribution is unknown. Inspired by the threshold-moving method that performs well in supervised learning-based binary classification tasks, we provide a simple yet effective scheme to address the multiclass imbalance issue of SSL. This scheme, named SaR, is a Self-adaptive Refinement of soft labels before generating pseudo labels. The pseudo labels generated post-SaR will be less biased, resulting in higher quality data for training the classifier. We show that SaR can consistently improve recent consistency-based SSL algorithms on various image classification problems across different imbalanced ratios. We also show that SaR is robust to the situations where unlabeled data have different distributions as labeled data. Hence, SaR does not rely on the assumptions that unlabeled data share the same distribution as the labeled data.

关键词： Training computer vision conferences Semisupervised learning pattern recognition Classification algorithms Task analysis

来源：评论

学校读者我要写书评

暂无评论

Learning Bottleneck Concepts in Image Classification

Learning Bottleneck Concepts in Image Classification

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Bowen Li, Liangzhi Nakashima, Lizyuta Nagahara, Hajime Osaka Univ Osaka Japan

ISBN: (纸本)9798350301298

Interpreting and explaining the behavior of deep neural networks is critical for many tasks. Explainable AI provides a way to address this challenge, mostly by providing per-pixel relevance to the decision. Yet, interpreting such explanations may require expert knowledge. Some recent attempts toward interpretability adopt a concept-based framework, giving a higher-level relationship between some concepts and model decisions. This paper proposes Bottleneck Concept Learner (BotCL), which represents an image solely by the presence/absence of concepts learned through training over the target task without explicit supervision over the concepts. It uses self-supervision and tailored regularizers so that learned concepts can be human-understandable. Using some image classification tasks as our testbed, we demonstrate BotCL's potential to rebuild neural networks for better interpretability.

关键词： Explainable computer vision

来源：评论

学校读者我要写书评

暂无评论

Multi-Dimensional vision Transformer Compression via Dependency Guided Gaussian Process Search

Multi-Dimensional Vision Transformer Compression via Depende...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hou, Zejiang Kung, Sun-Yuan Princeton Univ Princeton NJ 08544 USA

ISBN: (数字)9781665487399

ISBN: (纸本)9781665487399

vision transformers (ViT) have recently attracted considerable attentions, but the huge computational cost remains an issue for practical deployment. Previous ViT pruning methods tend to prune the model along one dimension solely, which may suffer from excessive reduction and lead to sub-optimal model quality. In contrast, we advocate a multi-dimensional ViT compression paradigm, and propose to harness the redundancy reduction from attention head, neuron and sequence dimensions jointly. Firstly, we propose a statistical dependence based pruning criterion that is generalizable to different dimensions for identifying the deleterious components. Moreover, we cast the multi-dimensional ViT compression as an optimization problem, objective of which is to learn an optimal pruning policy across the three dimensions while maximizing the compressed model's accuracy under a computational budget. The problem is solved by an adapted Gaussian process search with expected improvement. Experimental results show that our method effectively reduces the computational cost of various ViT models. For example, our method reduces 40% FLOPs without top-1 accuracy loss for DeiT and T2T-ViT models on the ImageNet dataset, outperforming previous state-of-the-art ViT pruning methods.

关键词： Adaptation models Image coding Head Computational modeling Neurons Gaussian processes Transformers

来源：评论

学校读者我要写书评

暂无评论

How you feelin'? Learning Emotions and Mental States in Movie Scenes

How you feelin'? Learning Emotions and Mental States in Movi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Srivastava, Dhruv Singh, Aditya Kumar Tapaswi, Makarand IIIT Hyderabad CVIT Hyderabad Telangana India

ISBN: (纸本)9798350301298

Movie story analysis requires understanding characters' emotions and mental states. Towards this goal, we formulate emotion understanding as predicting a diverse and multi-label set of emotions at the level of a movie scene and for each character. We propose EmoTx, a multimodal Transformer-based architecture that ingests videos, multiple characters, and dialog utterances to make joint predictions. By leveraging annotations from the MovieGraphs dataset [72], we aim to predict classic emotions (e.g. happy, angry) and other mental states (e.g. honest, helpful). We conduct experiments on the most frequently occurring 10 and 25 labels, and a mapping that clusters 181 labels to 26. Ablation studies and comparison against adapted state-of-the-art emotion recognition approaches shows the effectiveness of EmoTx. Analyzing EmoTx's self-attention scores reveals that expressive emotions often look at character tokens while other mental states rely on video and dialog cues.

关键词： language reasoning vision

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 73 74 75 76 77 78 79 80 81 82 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：