检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

50,636 篇 会议
1,423 册 图书
1,044 篇 期刊文献
1 篇 学位论文

馆藏范围

53,101 篇 电子文献
3 种 纸本馆藏

日期分布

学科分类号

31,927 篇 工学
- 24,897 篇 计算机科学与技术...
- 12,629 篇 软件工程
- 5,176 篇 光学工程
- 4,760 篇 电气工程
- 4,463 篇 信息与通信工程
- 4,261 篇 机械工程
- 3,980 篇 控制科学与工程
- 2,477 篇 生物工程
- 1,736 篇 生物医学工程（可授...
- 1,583 篇 仪器科学与技术
- 1,314 篇 电子科学与技术（可...
- 795 篇 化学工程与技术
- 715 篇 安全科学与工程
- 560 篇 交通运输工程
- 383 篇 建筑学
- 335 篇 土木工程
11,899 篇 理学
- 6,481 篇 物理学
- 5,426 篇 数学
- 2,765 篇 生物学
- 1,915 篇 统计学（可授理学、...
- 804 篇 化学
- 669 篇 系统科学
5,313 篇 医学
- 5,103 篇 临床医学
- 731 篇 基础医学(可授医学...
- 459 篇 药学(可授医学、理...
3,369 篇 管理学
- 1,964 篇 图书情报与档案管...
- 1,554 篇 管理科学与工程(可...
- 485 篇 工商管理
720 篇 艺术学
- 718 篇 设计学（可授艺术学...
434 篇 法学
- 406 篇 社会学
302 篇 农学
198 篇 教育学
166 篇 经济学
63 篇 文学
48 篇 军事学

主题

17,404 篇 computer vision
9,026 篇 pattern recognit...
4,196 篇 training
3,830 篇 feature extracti...
3,134 篇 cameras
2,876 篇 computational mo...
2,794 篇 image segmentati...
2,622 篇 visualization
2,574 篇 shape
2,535 篇 face recognition
2,176 篇 robustness
2,124 篇 computer science
1,975 篇 object detection
1,960 篇 computer archite...
1,882 篇 layout
1,853 篇 object recogniti...
1,801 篇 three-dimensiona...
1,725 篇 neural networks
1,705 篇 humans
1,697 篇 image recognitio...

机构

165 篇 univ chinese aca...
144 篇 tsinghua univers...
135 篇 national laborat...
106 篇 univ sci & techn...
104 篇 zhejiang univers...
101 篇 shanghai jiao to...
95 篇 university of sc...
95 篇 microsoft resear...
85 篇 zhejiang univ pe...
84 篇 shanghai ai lab ...
74 篇 school of comput...
69 篇 computer vision ...
68 篇 peking univ peop...
68 篇 chinese acad sci...
66 篇 chinese univ hon...
63 篇 institute of inf...
62 篇 google res mount...
61 篇 univ oxford oxfo...
59 篇 univ toronto on
57 篇 swiss fed inst t...

作者

92 篇 van gool luc
87 篇 umapada pal
78 篇 zhang lei
64 篇 lee seong-whan
50 篇 vittorio murino
42 篇 yang yi
34 篇 nassir navab
34 篇 ling haibin
33 篇 li xin
33 篇 jie yang
32 篇 liu yang
31 篇 loy chen change
30 篇 escalera sergio
30 篇 h. bischof
29 篇 zhou jie
29 篇 vasconcelos nuno
29 篇 jan-michael frah...
28 篇 blumenstein mich...
27 篇 jia yunde
27 篇 luo ping

语言

50,122 篇 英文
2,746 篇 其他
252 篇 中文
22 篇 土耳其文
4 篇 西班牙文
2 篇 日文
2 篇 葡萄牙文
2 篇 俄文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"

共 53104 条记录，以下是4981-4990 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Expression Transfer Using Flow-based Generative Models

Expression Transfer Using Flow-based Generative Models

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Valenzuela, Andrea Segura, Carlos Diego, Ferran Gomez, Vicenc Univ Pompeu Fabra Barcelona Catalonia Spain Tel Res Barcelona Catalonia Spain

ISBN: (纸本)9781665448994

Among the different deepfake generation techniques, flow-based methods appear as natural candidates. Due to the property of invertibility, flow-based methods eliminate the necessity of person-specific training and are able to reconstruct any input image almost perfectly to human perception. We present a method for deepfake generation based on facial expression transfer using flow-based generative models. Our approach relies on simple latent vector operations akin to the ones used for attribute manipulation, but for transferring expressions between identity source-target pairs. We show the feasibility of this approach using a pre-trained Glow model and small sets of source and target images, not necessarily considered during prior training. We also provide an evaluation pipeline of the generated images in terms of similarities between identities and Action Units encoding the expression to be transferred. Our results show that an efficient expression transfer is feasible by using the proposed approach setting up a first precedent in deepfake content creation, and its evaluation, independently of the training identities.

关键词： Training computer vision Image coding conferences Computational modeling Pipelines pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Variational AutoEncoder for Reference based Image Super-Resolution

Variational AutoEncoder for Reference based Image Super-Reso...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Liu, Zhi-Song Siu, Wan-Chi Wang, Li-Wen Caritas Inst Higher Educ Hong Kong Peoples R China Hong Kong Polytech Univ Hong Kong Peoples R China

ISBN: (纸本)9781665448994

In this paper, we propose a novel reference based image super-resolution approach via Variational AutoEncoder (RefVAE). Existing state-of-the-art methods mainly focus on single image super-resolution which cannot perform well on large upsampling factors, e.g., 8x. We propose a reference based image super-resolution, for which any arbitrary image can act as a reference for super-resolution. Even using random map or low-resolution image itself the proposed RefVAE can transfer the knowledge from the reference to the super-resolved images. Depending upon different references, the proposed method can generate different versions of super-resolved images from a hidden super-resolution space. Besides using different datasets for some standard evaluations with PSNR and SSIM, we also took part in the NTIRE2021 SR Space challenge [21] and have provided results of the randomness evaluation of our approach. Compared to other state-of-the-art methods, our approach achieves higher diverse scores.

关键词： computer vision Quantization (signal) conferences Computational modeling Superresolution Space exploration pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Collaborative Image and Object Level Features for Image Colourisation

Collaborative Image and Object Level Features for Image Colo...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Pucci, Rita Micheloni, Christian Martinel, Niki Univ Udine Udine Italy

ISBN: (纸本)9781665448994

Image colourisation is an ill-posed problem, with multiple correct solutions which depend on the context and object instances present in the input datum. Previous approaches attacked the problem either by requiring intense user-interactions or by exploiting the ability of convolutional neural networks (CNNs) in learning image-level (context) features. However, obtaining human hints is not always feasible and CNNs alone are not able to learn entity-level semantics, unless multiple models pre-trained with supervision are considered. In this work, we propose a single network, named UCapsNet, that takes into consideration the image-level features obtained through convolutions and entity-level features captured by means of capsules. Then, by skip connections over different layers, we enforce collaboration between such the convolutional and entity factors to produce a high-quality and plausible image colourisation. We pose the problem as a classification task that can be addressed by a fully unsupervised approach, thus requires no human effort. Experimental results on three benchmark datasets show that our approach outperforms existing methods on standard quality metrics and achieves state-of-the-art performances on image colourisation. A large scale user study shows that our method is preferred over existing solutions. Code available at https://***/Riretta/Image_Colourisation_WiCV_2021.

关键词： Convolutional codes computer vision Semantics Collaboration Training data Feature extraction pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Dictionary-guided Scene Text recognition

Dictionary-guided Scene Text Recognition

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Nguyen Nguyen Thu Nguyen Vinh Tran Minh-Triet Tran Thanh Duc Ngo Thien Huu Nguyen Minh Hoai VinAI Res Hanoi Vietnam VNU HCM Univ Informat Technol Hanoi Vietnam VNU HCM Univ Sci Hanoi Vietnam Vietnam Natl Univ Ho Chi Minh City Vietnam Univ Oregon Eugene OR 97403 USA SUNY Stony Brook Stony Brook NY 11794 USA

ISBN: (纸本)9781665445092

Language prior plays an important role in the way humans detect and recognize text in the wild. Current scene text recognition methods do use lexicons to improve recognition performance, but their naive approach of casting the output into a dictionary word based purely on the edit distance has many limitations. In this paper, we present a novel approach to incorporate a dictionary in both the training and inference stage of a scene text recognition system. We use the dictionary to generate a list of possible outcomes and find the one that is most compatible with the visual appearance of the text. The proposed method leads to a robust scene text recognition model, which is better at handling ambiguous cases encountered in the wild, and improves the overall performance of state-of-the-art scene text spotting frameworks. Our work suggests that incorporating language prior is a potential approach to advance scene text detection and recognition methods. Besides, we contribute VinText, a challenging scene text dataset for Vietnamese, where some characters are equivocal in the visual form due to accent symbols. This dataset will serve as a challenging benchmark for measuring the applicability and robustness of scene text detection and recognition algorithms.

关键词： Training Visualization computer vision Casting Dictionaries Codes Text recognition

来源：评论

学校读者我要写书评

暂无评论

DISCO: Dynamic and Invariant Sensitive Channel Obfuscation for deep neural networks

DISCO: Dynamic and Invariant Sensitive Channel Obfuscation f...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Singh, Abhishek Chopra, Ayush Garza, Ethan Zhang, Emily Vepakomma, Praneeth Sharma, Vivek Raskar, Ramesh MIT 77 Massachusetts Ave Cambridge MA 02139 USA Harvard Med Sch Boston MA 02115 USA

ISBN: (纸本)9781665445092

Recent deep learning models have shown remarkable performance in image classification. While these deep learning systems are getting closer to practical deployment, the common assumption made about data is that it does not carry any sensitive information. This assumption may not hold for many practical cases, especially in the domain where an individual's personal information is involved, like healthcare and facial recognition systems. We posit that selectively removing features in this latent space can protect the sensitive information and provide better privacy-utility trade-off. Consequently, we propose DISCO which learns a dynamic and data driven pruning filter to selectively obfuscate sensitive information in the feature space. We propose diverse attack schemes for sensitive inputs & attributes and demonstrate the effectiveness of DISCO against state-of-the-art methods through quantitative and qualitative evaluation. Finally, we also release an evaluation benchmark dataset of 1 million sensitive representations to encourage rigorous exploration of novel attack and defense schemes at https://***/splitlearning/InferenceBenchmark.

关键词： Deep learning Privacy computer vision Face recognition Collaboration Medical services Benchmark testing

来源：评论

学校读者我要写书评

暂无评论

Cross-modal Prominent Fragments Enhancement Aligning Network for Image-text Retrieval

Cross-modal Prominent Fragments Enhancement Aligning Network...

引用

ieee International conference on Multimedia and Expo (ICME)

作者： Zhang, Yang Zhou, Yue Yang, Zonghao Chen, Ao Shanghai Jiao Tong Univ Inst Image Proc & Pattern Recognit Shanghai Peoples R China

ISBN: (纸本)9798350390155;9798350390162

Image-text retrieval is a widely studied topic in the field of computer vision due to the exponential growth of multimedia data, whose core concept is to measure the similarity between images and text. However, most existing retrieval methods heavily rely on cross-attention mechanisms for cross-modal fine-grained alignment, which takes into account excessive irrelevant regions and treats prominent and non-significant words equally. This paper aims to investigate an alignment approach that reduces the involvement of non-significant fragments in images and text while enhancing the alignment of prominent fragments. For this purpose, we introduce the Cross-Modal Prominent Fragments Enhancement Aligning Network(CPFEAN). In practice, we first design a novel intra-modal fragments relationship reasoning method, and subsequently employ our proposed alignment mechanism to compute the similarity between images and text. Extensive quantitative comparative experiments on MS-COCO and Flickr30K datasets demonstrate that our approach outperforms state-of-the-art methods.

关键词： Image-text retrieval fine-grained alignment cross-modal learning prominent fragments enhancement

来源：评论

学校读者我要写书评

暂无评论

Group Leakage Overestimates Performance: A Case Study in Keystroke Dynamics

Group Leakage Overestimates Performance: A Case Study in Key...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ayotte, Blaine Banavar, Mahesh K. Hou, Daqing Schuckers, Stephanie Clarkson Univ Dept Elect & Comp Engn 8 Clarkson Ave Potsdam NY 13699 USA

ISBN: (纸本)9781665448994

Keystroke dynamics is a powerful behavioral biometric capable of user authentication based on typing patterns. As larger keystroke datasets become available, machine learning and deep learning algorithms are becoming popular. Knowledge of every possible impostor is not known during training which means that keystroke dynamics is an open set recognition problem. Treating open set recognition problems as closed set (assuming samples from all impostors are present) can cause models to incur data leakage, which can provide unrealistic overestimates of performance. It is a common problem in machine learning and can cause models to report higher accuracies than would be expected in the real world. In this paper, we outline open set recognition and discuss how, if not handled properly, it can lead to data leakage. The performance of common machine learning methods, such as SVM and MLP are investigated with and without leakage to clearly demonstrate the differences in performance. A synthetic dataset and a publicly available keystroke dynamics fixed-text dataset are used for research transparency and reproducibility.

关键词： Training Support vector machines Deep learning computer vision Machine learning algorithms Heuristic algorithms conferences

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Visual Representation Learning by Tracking Patches in Video

Unsupervised Visual Representation Learning by Tracking Patc...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wang, Guangting Zhou, Yizhou Luo, Chong Xie, Wenxuan Zeng, Wenjun Xiong, Zhiwei Univ Sci & Technol China Hefei Anhui Peoples R China Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9781665445092

Inspired by the fact that human eyes continue to develop tracking ability in early and middle childhood, we propose to use tracking as a proxy task for a computer vision system to learn the visual representations. Modelled on the Catch game played by the children, we design a Catch-the-Patch (CtP) game for a 3D-CNN model to learn visual representations that would help with video-related tasks. In the proposed pretraining framework, we cut an image patch from a given video and let it scale and move according to a pre-set trajectory. The proxy task is to estimate the position and size of the image patch in a sequence of video frames, given only the target bounding box in the first frame. We discover that using multiple image patches simultaneously brings clear benefits. We further increase the difficulty of the game by randomly making patches invisible. Extensive experiments on mainstream benchmarks demonstrate the superior performance of CtP against other video pretraining methods. In addition, CtP-pretrained features are less sensitive to domain gaps than those trained by a supervised action recognition task. When both trained on Kinetics-400, we are pleasantly surprised to find that CtP-pretrained representation achieves much higher action classification accuracy than its fully supervised counterpart on Something-Something dataset.

关键词： Visualization computer vision Computational modeling Training data Games Trajectory pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Research on Moving Object Real-time recognition based on Deep Neural Network 24

Research on Moving Object Real-time Recognition based on Dee...

引用

2024 International conference on Machine Learning, pattern recognition and Automation Engineering, MLPRAE 2024

作者： Xia, Tongtong Computer Science and Control Systems Bauman Moscow State Technical University Russia

ISBN: (纸本)9798400709876

Object recognition represents a significant area of investigation within the field of computer vision, with applications spanning industrial detection, traffic supervision, remote sensing, biomedicine and numerous other domains. As information science and technology advance, the accuracy and speed of object recognition continue to improve. In this work, a real-time moving object recognition model based on an improved convolutional neural network (CNN) was proposed. By optimising the structure of the convolutional layer and the pooling layer, the model demonstrates enhanced capabilities for the detection and classification of moving objects within the video frame. A multi-scale feature extraction and attention mechanism is introduced to enhance the model's object recognition performance at different scales and dynamic backgrounds. The extraction of features at multiple scales enables the model to capture both large-scale and small-scale object features, thereby enhancing its ability to detect objects of varying sizes. The attention mechanism enables the model to allocate more precise attention to crucial features in the presence of intricate backgrounds and motion blurring, through the dynamic adjustment of the relative importance of each feature region. This enhances the resilience and precision of the recognition process. The combination of these two techniques enables the model to process a diverse range of complex scenarios in real-world applications with greater efficiency and accuracy. The experimental results demonstrate that the enhanced model exhibits superior performance compared to the established benchmark model on publicly available datasets, exhibiting enhanced accuracy and real-time processing capabilities. © 2024 ACM.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging

Rotation Coordinate Descent for Fast Globally Optimal Rotati...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Parra, Alvaro Chng, Shin-Fang Chin, Tat-Jun Eriksson, Anders Reid, Ian Univ Adelaide Sch Comp Sci Adelaide SA Australia Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia

ISBN: (纸本)9781665445092

Under mild conditions on the noise level of the measurements, rotation averaging satisfies strong duality, which enables global solutions to be obtained via semidefinite programming (SDP) relaxation. However, generic solvers for SDP are rather slow in practice, even on rotation averaging instances of moderate size, thus developing specialised algorithms is vital. In this paper, we present a fast algorithm that achieves global optimality called rotation coordinate descent (RCD). Unlike block coordinate descent (BCD) which solves SDP by updating the semidefinite matrix in a row-by-row fashion, RCD directly maintains and updates all valid rotations throughout the iterations. This obviates the need to store a large dense semidefinite matrix. We mathematically prove the convergence of our algorithm and empirically show its superior efficiency over state-of-the-art global methods on a variety of problem configurations. Maintaining valid rotations also facilitates incorporating local optimisation routines for further speed-ups. Moreover, our algorithm is simple to implement(1).

关键词： computer vision Coordinate measuring machines Convex functions pattern recognition Noise measurement Velocity measurement Rotation measurement

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 491 492 493 494 495 496 497 498 499 500 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：