检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,905 篇 会议
43 篇 期刊文献
18 册 图书

馆藏范围

8,965 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

4,564 篇 工学
- 4,024 篇 计算机科学与技术...
- 2,182 篇 软件工程
- 1,241 篇 光学工程
- 558 篇 控制科学与工程
- 433 篇 信息与通信工程
- 430 篇 机械工程
- 294 篇 电气工程
- 288 篇 仪器科学与技术
- 179 篇 生物工程
- 159 篇 生物医学工程（可授...
- 119 篇 电子科学与技术（可...
- 64 篇 安全科学与工程
- 58 篇 建筑学
- 58 篇 化学工程与技术
- 52 篇 土木工程
- 52 篇 交通运输工程
- 40 篇 力学（可授工学、理...
2,066 篇 理学
- 1,382 篇 物理学
- 1,198 篇 数学
- 420 篇 统计学（可授理学、...
- 238 篇 生物学
- 55 篇 化学
- 36 篇 系统科学
266 篇 管理学
- 182 篇 图书情报与档案管...
- 92 篇 管理科学与工程(可...
- 47 篇 工商管理
223 篇 医学
- 222 篇 临床医学
- 39 篇 基础医学(可授医学...
205 篇 艺术学
- 205 篇 设计学（可授艺术学...
45 篇 法学
- 43 篇 社会学
21 篇 农学
14 篇 教育学
9 篇 经济学
6 篇 军事学

主题

3,414 篇 computer vision
1,216 篇 pattern recognit...
946 篇 cameras
908 篇 conferences
765 篇 computer science
674 篇 image segmentati...
618 篇 layout
598 篇 training
548 篇 shape
518 篇 robustness
451 篇 feature extracti...
448 篇 humans
445 篇 face recognition
405 篇 computational mo...
402 篇 object detection
365 篇 visualization
356 篇 computer archite...
336 篇 application soft...
304 篇 lighting
257 篇 image reconstruc...

机构

41 篇 microsoft resear...
30 篇 department of co...
25 篇 department of co...
23 篇 institute for co...
22 篇 department of co...
22 篇 school of comput...
20 篇 university of sc...
20 篇 swiss fed inst t...
19 篇 tsinghua univers...
19 篇 institute of com...
18 篇 swiss fed inst t...
17 篇 the robotics ins...
17 篇 carnegie mellon ...
17 篇 computer vision ...
17 篇 department of co...
16 篇 institute of inf...
16 篇 school of comput...
15 篇 school of comput...
15 篇 carnegie mellon ...
14 篇 national laborat...

作者

57 篇 timofte radu
25 篇 huang thomas s.
24 篇 van gool luc
23 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 t. kanade
21 篇 jain anil k.
20 篇 luc van gool
19 篇 t.s. huang
18 篇 xiaoou tang
18 篇 murino vittorio
18 篇 horst bischof
17 篇 a.k. jain
17 篇 t. darrell
16 篇 g. healey
16 篇 bowyer kevin w.
16 篇 bischof horst
15 篇 m.j. black
15 篇 li stan z.
15 篇 m. shah

语言

8,904 篇 英文
53 篇 其他
8 篇 中文
1 篇 土耳其文

检索条件"任意字段=IEEE-Computer-Society Conference on Computer Vision and Pattern Recognition Workshops"

共 8966 条记录，以下是951-960 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

EfficientViT-SAM: Accelerated Segment Anything Model Without...

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Zhuoyang Zhang Han Cai Song Han Tsinghua University NVIDIA MIT

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

We present EfficientViT-SAM, a new family of accelerated segment anything models. We retain SAM’s lightweight prompt encoder and mask decoder while replacing the heavy image encoder with EfficientViT. For the training, we begin with the knowledge distillation from the SAM-ViT-H image encoder to EfficientViT. Subsequently, we conduct end-to-end training on the SA-1B dataset. Benefiting from EfficientViT’s efficiency and capacity, EfficientViT-SAM delivers 48.9× measured TensorRT speedup on A100 GPU over SAM-ViT-H without sacrificing performance. Our code and pre-trained models are released at https://***/mit-han-lab/efficientvit.

关键词： Training Image segmentation computer vision Codes conferences Computational modeling Graphics processing units

来源：评论

学校读者我要写书评

暂无评论

Selective Replay Enhances Learning in Online Continual Analogical Reasoning

Selective Replay Enhances Learning in Online Continual Analo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hayes, Tyler L. Kanan, Christopher Rochester Inst Technol Rochester NY 14623 USA Paige Rochester NY USA Cornell Tech New York NY USA

ISBN: (纸本)9781665448994

In continual learning, a system learns from non-stationary data streams or batches without catastrophic forgetting. While this problem has been heavily studied in supervised image classification and reinforcement learning, continual learning in neural networks designed for abstract reasoning has not yet been studied. Here, we study continual learning of analogical reasoning. Analogical reasoning tests such as Raven's Progressive Matrices (RPMs) are commonly used to measure non-verbal abstract reasoning in humans, and recently offline neural networks for the RPM problem have been proposed. In this paper, we establish experimental baselines, protocols, and forward and backward transfer metrics to evaluate continual learners on RPMs. We employ experience replay to mitigate catastrophic forgetting. Prior work using replay for image classification tasks has found that selectively choosing the samples to replay offers little, if any, benefit over random selection. In contrast, we find that selective replay can significantly outperform random selection for the RPM task(1).

关键词： Measurement Protocols Neural networks Reinforcement learning Streaming media Cognition pattern recognition

来源：评论

学校读者我要写书评

暂无评论

On the Robustness and Generalizability of Face Synthesis Detection Methods

On the Robustness and Generalizability of Face Synthesis Det...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Sabel, Johan Johansson, Fredrik Swedish Def Res Agcy FOI Stockholm Sweden

ISBN: (纸本)9781665448994

In recent years, significant progress has been made within human face synthesis. It is now possible, and easy for anyone, to generate credible high-resolution images of non-existing people. This calls for effective detection methods. In this paper, three state-of-the-art deep learning-based methods are evaluated with respect to their robustness and generalizability, which are two factors that must be taken into consideration for methods intended to be deployed in the wild. The robustness experiments show that it is possible to achieve near-perfect performance when discriminating between real and synthetic facial images that have been post-processed heavily with various perturbation techniques;especially when similar perturbations are incorporated during training of the detection models. The generalization experiments show that already trained detection models can achieve high performance on images from sources not known during training, provided that the models are fine-tuned on such images. One model achieved an average accuracy of 96.8% after being fine-tuned on 3 training images from each unknown source considered (one real and one synthetic source). However, additional images were required when fine-tuning using a different approach aimed at preventing catastrophic forgetting. Furthermore, in general, no method generalized well without fine-tuning. Hence, the limited generalization capability remains a shortcoming that must be overcome before the detection methods can be utilized in the wild.

关键词： Training Learning systems computer vision Perturbation methods Face recognition conferences Robustness

来源：评论

学校读者我要写书评

暂无评论

A Bop and Beyond: A Second Order Optimizer for Binarized Neural Networks

A Bop and Beyond: A Second Order Optimizer for Binarized Neu...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Daniel Suarez-Ramirez, Cuauhtemoc Gonzalez-Mendoza, Miguel Chang, Leonardo Ochoa-Ruiz, Gilberto Alberto Duran-Vega, Mario Tecnol Monterrey Sch Engn & Sci Dept Comp Sci Monterrey NL Mexico

ISBN: (纸本)9781665448994

The optimization of Binary Neural Networks (BNNs) relies on approximating the real-valued weights with their binarized representations. Current techniques for weight-updating use the same approaches as traditional Neural Networks (NNs) with the extra requirement of using an approximation to the derivative of the sign function - as it is the Dirac-Delta function - for back-propagation;thus, efforts are focused adapting full-precision techniques to work on BNNs. In the literature, only one previous effort has tackled the problem of directly training the BNNs with bit-flips by using the first raw moment estimate of the gradients and comparing it against a threshold for deciding when to flip a weight (Bop). In this paper, we take an approach parallel to Adam which also uses the second raw moment estimate to normalize the first raw moment before doing the comparison with the threshold, we call this method Bop2ndOrder. We present two versions of the proposed optimizer: a biased one and a bias-corrected one, each with its own applications. Also, we present a complete ablation study of the hyperparameters space, as well as the effect of using schedulers on each of them. For these studies, we tested the optimizer in CIFAR10 using the BinaryNet architecture. Also, we tested it in ImageNet 2012 with the XnorNet and BiRealNet architectures for accuracy. In both datasets our approach proved to converge faster, was robust to changes of the hyperparameters, and achieved better accuracy values.

关键词： Training computer vision conferences computer architecture Artificial neural networks pattern recognition Optimization

来源：评论

学校读者我要写书评

暂无评论

Multi Model Ensemble for Compound Expression recognition

Multi Model Ensemble for Compound Expression Recognition

引用

ieee computer society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jun Yu Jichao Zhu Wangyuan Zhu Zhongpeng Cai Gongpeng Zhao Zhihong Wei Guochen Xie Zerui Zhang Qingsong Liu Jiaen Liang University of Science and Technology of China Unisound AI Technology Co. Ltd.

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Compound Expression recognition (CER) plays a crucial role in interpersonal interactions. Due to the complexity of human emotional expressions, which leads to the existence of compound expressions, it is necessary to consider both local and global facial expressions comprehensively for recognition. In this paper, to address this issue, we propose a solution for compound expression recognition based on ensemble learning methods. Specifically, our task is classification. We trained three expression classification models based on convolutional networks (ResNet50), vision Transformers, and multi-scale local attention networks, respectively. Then, by using late fusion, integrated the outputs of three models to predict the final result, leveraging the strengths of different models. Our method achieves high accuracy on RAF-DB and in sixth Affective Behavior Analysis in-the-wild (ABAW) Challenge, achieves an F1 score of 0.224 on the test set of C-EXPR-DB.

关键词： computer vision Emotion recognition Face recognition Predictive models Transformers Feature extraction Compounds

来源：评论

学校读者我要写书评

暂无评论

Detecting and Matching Related Objects with One Proposal Multiple Predictions

Detecting and Matching Related Objects with One Proposal Mul...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yang Hafemann, Luiz G. Jamieson, Michael Javan, Mehrsan Sportlogiq Montreal PQ Canada

ISBN: (纸本)9781665448994

Tracking players in sports videos is commonly done in a tracking-by-detection framework, first detecting players in each frame, and then performing association over time. While for some sports tracking players is sufficient for game analysis, sports like hockey, tennis and polo may require additional detections, that include the object the player is holding (e.g. racket, stick). The baseline solution for this problem involves detecting these objects as separate classes, and matching them to player detections based on the intersection over union (IoU). This approach, however, leads to poor matching performance in crowded situations, as it does not model the relationship between players and objects. In this paper, we propose a simple yet efficient way to detect and match players and related objects at once without extra cost, by considering an implicit association for prediction of multiple objects through the same proposal box. We evaluate the method on a dataset of broadcast ice hockey videos, and also a new public dataset we introduce called COCO +Torso. On the ice hockey dataset, the proposed method boosts matching performance from 57.1% to 81.4%, while also improving the meanAP of player+stick detections from 68.4% to 88.3%. On the COCO +Torso dataset, we see matching improving from 47.9% to 65.2%. The COCO +Torso dataset, code and pre-trained models will be released at https: //***/foreverYoungGitHub/detectand-match-related-objects.

关键词： computer vision conferences Games Detectors Ice Proposals pattern matching

来源：评论

学校读者我要写书评

暂无评论

A Cortically-inspired Architecture for Event-based Visual Motion Processing: From Design Principles to Real-world Applications

A Cortically-inspired Architecture for Event-based Visual Mo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Peveri, Francesca Testa, Simone Sabatini, Silvio P. Univ Genoa Dept Informat Bioengn Robot & Syst Engn Via Opera Pia 11a Genoa Italy

ISBN: (纸本)9781665448994

We developed and tested the architecture of a bio-inspired Spiking Neural Network for motion estimation. The computation performed by the retina is emulated by the neuromorphic event-based image sensor DAVIS346 which constitutes the input of our network. We obtained neurons highly tuned to spatial frequency and orientation of the stimulus through a combination of feed-forward excitatory connections modeled as an elongated Gaussian kernel and recurrent inhibitory connections from two clusters of neurons within the same cortical layers. Sums over adjacent nodes weighted by time-variable synapses are used to attain Gabor-like spatio-temporal V1 receptive fields with selectivity to the stimulus' motion. In order to gain the invariance to the stimulus phase, the two polarities of the events provided by the neuromorphic sensor were exploited, which allowed us to build two pairs of quadrature filters from which we obtain Motion Energy detectors as described in [2]. Finally, a decoding stage allows us to compute optic flow from the Motion Detector layers. We tested the approach proposed with both synthetic and natural stimuli.

关键词： Visualization Neuromorphics Neurons Neural networks computer architecture Detectors Spatial filters

来源：评论

学校读者我要写书评

暂无评论

Explainable Deep Classification Models for Domain Generalization

Explainable Deep Classification Models for Domain Generaliza...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zunino, Andrea Bargal, Sarah Adel Volpi, Riccardo Sameki, Mehrnoosh Zhang, Jianming Sclaroff, Stan Murino, Vittorio Saenko, Kate Huawei Ireland Res Ctr Dublin Ireland Boston Univ Dept Comp Sci 111 Cummington St Boston MA 02215 USA Naver Labs Europe Meylan France Microsoft Redmond WA USA Adobe Res San Jose CA USA Ist Italiano Tecnol Pattern Anal & Comp Vis Genoa Italy Univ Verona Verona Italy

ISBN: (纸本)9781665448994

Conventionally, AI models are thought to trade off explainability for lower accuracy. We develop a training strategy that not only leads to a more explainable AI system for object classification, but as a consequence, suffers no perceptible accuracy degradation. Explanations are defined as regions of visual evidence upon which a deep classification network makes a decision. This is represented in the form of a saliency map conveying how much each pixel contributed to the network's decision. Our training strategy enforces a periodic saliency-based feedback to encourage the model to focus on the image regions that directly correspond to the ground-truth object. We quantify explainability using an automated metric, and using human judgement. We propose explainability as a means for bridging the visual-semantic gap between different domains where model explanations are used as a means of disentagling domain specific information from otherwise relevant features. We demonstrate that this leads to improved generalization to new domains without hindering performance on the original domain.

关键词： Training Degradation Measurement Visualization computer vision conferences Computational modeling

来源：评论

学校读者我要写书评

暂无评论

EVSRNet: Efficient Video Super-Resolution with Neural Architecture Search

EVSRNet: Efficient Video Super-Resolution with Neural Archit...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Shaoli Zheng, Chengjian Lu, Kaidi Gao, Si Wang, Ning Wang, Bofei Zhang, Diankai Zhang, Xiaofeng Xu, Tianyu ZTE Corp State Key Lab Mobile Network & Mobile Multimedia Shenzhen Peoples R China

ISBN: (纸本)9781665448994

With the development of convolutional neural networks (CNN), the super-resolution results of CNN-based method have far surpassed traditional method. In particular, the CNN-based single image super-resolution method has achieved excellent results. Video sequences contain more abundant information compare with image, but there are few video super-resolution methods that can be applied to mobile devices due to the requirement of heavy computation, which limits the application of video super-resolution. In this work, we propose the Efficient Video Super-Resolution Network (EVSRNet) with neural architecture search for real-time video super-resolution. Extensive experiments show that our method achieves a good balance between quality and efficiency. Finally, we achieve a competitive result of 7.36 where the PSNR is 27.85 dB and the inference time is 11.3 ms/f on the target snapdragon 865 SoC, resulting in a 2nd place in the Mobile AI(MAI) 2021 real-time video super-resolution challenge. It is noteworthy that, our method is the fastest and significantly outperforms other competitors by large margins.

关键词： Visualization Superresolution Video sequences Neural networks computer architecture Streaming media Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Combining Weight Pruning and Knowledge Distillation For CNN Compression

Combining Weight Pruning and Knowledge Distillation For CNN ...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Aghli, Nima Ribeiro, Eraldo Florida Inst Technol 150 W Univ Blvd Melbourne FL 32901 USA

ISBN: (纸本)9781665448994

Complex deep convolutional neural networks such as ResNet require expensive hardware such as powerful GPUs to achieve real-time performance. This problem is critical for applications that run on low-end embedded GPU or CPU systems with limited resources. As a result, model compression for deep neural networks becomes an important research topic. Popular compression methods such as weight pruning remove redundant neurons from the CNN without affecting the network's output accuracy. While these pruning methods work well on simple networks such as VGG or AlexNet, they are not suitable for compressing current state-of-the-art networks such as ResNets because of these networks' complex architectures with dimensionality dependencies. This dependency results in filter pruning breaking the structure of ResNets leading to an untrainable network. In this paper, we first use the weight pruning method only on a selective number of layers in the ResNet architecture to avoid breaking the network structure. Second, we introduce a knowledge distillation architecture and a loss function to compress the untouched layers during the pruning. We test our method on both image-based regression and classification networks for head-pose estimation and image classification. Our compression method reduces the models' size significantly while maintaining the accuracy very close to the baseline model.

关键词： Image coding Neurons Estimation Graphics processing units computer architecture Real-time systems Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 92 93 94 95 96 97 98 99 100 101 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：