检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

99 篇 会议
89 篇 期刊文献

馆藏范围

188 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

130 篇 工学
- 88 篇 计算机科学与技术...
- 83 篇 软件工程
- 33 篇 信息与通信工程
- 27 篇 生物工程
- 21 篇 光学工程
- 16 篇 机械工程
- 9 篇 控制科学与工程
- 8 篇 生物医学工程（可授...
- 6 篇 化学工程与技术
- 4 篇 仪器科学与技术
- 3 篇 电气工程
- 3 篇 电子科学与技术（可...
- 3 篇 建筑学
- 2 篇 材料科学与工程（可...
- 2 篇 土木工程
- 2 篇 交通运输工程
- 2 篇 安全科学与工程
- 1 篇 力学（可授工学、理...
- 1 篇 冶金工程
- 1 篇 动力工程及工程热...
85 篇 理学
- 43 篇 物理学
- 29 篇 数学
- 27 篇 生物学
- 7 篇 统计学（可授理学、...
- 6 篇 化学
32 篇 管理学
- 22 篇 图书情报与档案管...
- 12 篇 管理科学与工程(可...
4 篇 法学
- 4 篇 社会学
2 篇 医学
- 2 篇 临床医学
2 篇 艺术学
- 2 篇 设计学（可授艺术学...
1 篇 农学

主题

13 篇 convolution
12 篇 feature extracti...
10 篇 image edge detec...
10 篇 image reconstruc...
9 篇 semantics
8 篇 image segmentati...
8 篇 computer vision
6 篇 three-dimensiona...
6 篇 pixels
6 篇 training
5 篇 generative adver...
5 篇 writing
5 篇 face
5 篇 image color anal...
4 篇 distillation
4 篇 face recognition
4 篇 optical resolvin...
4 篇 text recognition
4 篇 biological syste...
4 篇 mathematical mod...

机构

40 篇 university of ch...
40 篇 shenzhen key lab...
31 篇 national key lab...
28 篇 computer vision ...
26 篇 shenzhen key lab...
22 篇 faculty of compu...
21 篇 siat branch shen...
19 篇 shanghai ai labo...
16 篇 sensetime resear...
16 篇 shenzhen key lab...
14 篇 xiamen key labor...
11 篇 shanghai artific...
10 篇 department of co...
8 篇 shanghai ai lab
8 篇 the chinese univ...
7 篇 department of st...
7 篇 the university o...
6 篇 shanghai jiao to...
6 篇 shenzhen key lab...
6 篇 fujian key labor...

作者

59 篇 qiao yu
27 篇 dong chao
26 篇 yu qiao
17 篇 wang yali
17 篇 pal umapada
17 篇 lu tong
16 篇 umapada pal
16 篇 tong lu
16 篇 palaiahnakote sh...
15 篇 shivakumara pala...
11 篇 chao dong
10 篇 he junjun
9 篇 chen xiangyu
9 篇 gu jinjin
9 篇 peng xiaojiang
8 篇 chen shifeng
8 篇 ren jimmy s.
7 篇 blumenstein mich...
7 篇 zhou zhipeng
7 篇 liu yihao

语言

186 篇 英文
2 篇 其他

检索条件"机构=Xiamen Key Lab of Computer Vision and Pattern Recognition"

共 188 条记录，以下是41-50 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Conditional Sequential Modulation for Efficient Global Image Retouching 16th

Conditional Sequential Modulation for Efficient Global Image...

引用

16th European Conference on computer vision, ECCV 2020

作者： He, Jingwen Liu, Yihao Qiao, Yu Dong, Chao ShenZhen Key Lab of Computer Vision and Pattern Recognition SIAT - SenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences Beijing China SIAT Branch Shenzhen Institute of Artificial Intelligence and Robotics for Society Shenzhen China University of Chinese Academy of Sciences Beijing China

ISBN: (纸本)9783030586003

Photo retouching aims at enhancing the aesthetic visual quality of images that suffer from photographic defects such as over/under exposure, poor contrast, inharmonious saturation. Practically, photo retouching can be accomplished by a series of image processing operations. In this paper, we investigate some commonly-used retouching operations and mathematically find that these pixel-independent operations can be approximated or formulated by multi-layer perceptrons (MLPs). Based on this analysis, we propose an extremely light-weight framework - Conditional Sequential Retouching Network (CSRNet) - for efficient global image retouching. CSRNet consists of a base network and a condition network. The base network acts like an MLP that processes each pixel independently and the condition network extracts the global features of the input image to generate a condition vector. To realize retouching operations, we modulate the intermediate features using Global Feature Modulation (GFM), of which the parameters are transformed by condition vector. Benefiting from the utilization of 1 × 1 convolution, CSRNet only contains less than 37 k trainable parameters, which is orders of magnitude smaller than existing learning-based methods. Extensive experiments show that our method achieves state-of-the-art performance on the benchmark MIT-Adobe FiveK dataset quantitively and qualitatively. Code is available at https://***/hejingwenhejingwen/CSRNet. © 2020, Springer Nature Switzerland AG.

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

Adaptive Pyramid Context Network for Semantic Segmentation

Adaptive Pyramid Context Network for Semantic Segmentation

引用

IEEE/CVF Conference on computer vision and pattern recognition

作者： Junjun He Zhongying Deng Lei Zhou Yali Wang Yu Qiao Shenzhen Key Lab of Computer Vision and Pattern Recognition SIAT-SenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences

ISBN: (纸本)9781728132945

Recent studies witnessed that context features can significantly improve the performance of deep semantic segmentation networks. Current context based segmentation methods differ with each other in how to construct context features and perform differently in practice. This paper firstly introduces three desirable properties of context features in segmentation task. Specially, we find that Global-guided Local Affinity (GLA) can play a vital role in constructing effective context features, while this property has been largely ignored in previous works. Based on this analysis, this paper proposes Adaptive Pyramid Context Network (APCNet) for semantic segmentation. APCNet adaptively constructs multi-scale contextual representations with multiple well-designed Adaptive Context Modules (ACMs). Specifically, each ACM leverages a global image representation as a guidance to estimate the local affinity coefficients for each sub-region, and then calculates a context vector with these affinities. We empirically evaluate our APCNet on three semantic segmentation and scene parsing datasets, including PASCAL VOC2012, Pascal-Context, and ADE20K dataset. Experimental results show that APCNet achieves state-of-the-art performance on all three benchmarks, and obtains a new record 84.2% on PASCAL VOC 2012 test set without MS COCO pre-trained and any post-processing.

关键词： Semantics subregion image representation Pascal TEST SETS

来源：评论

学校读者我要写书评

暂无评论

New Moments Based Fuzzy Similarity Measure for Text Detection in Distorted Social Media Images 5th

New Moments Based Fuzzy Similarity Measure for Text Detectio...

引用

5th Asian Conference on pattern recognition, ACPR 2019

作者： Roy, Soumyadip Shivakumara, Palaiahnakote Pal, Umapada Lu, Tong Blumenstein, Michael Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia National Key Lab for Novel Software Technology Nanjing University Nanjing China Faculty of Engineering and Information Technology University of Technology Sydney Australia

ISBN: (纸本)9783030414030

A trend towards capturing or filming images using cellphone and sharing images on social media is a part and parcel of day to day activities of humans. When an image is forwarded several times in social media it may be distorted a lot due to several different devices. This work deals with text detection from such distorted images. In this work, we consider images pass through three mobile devices on WhatsApp social media, which results in four images (including the original image) Unlike the existing methods that aim at developing new ways, we utilize the results detected by the existing ones to improve performances. The proposed method extracts Hu moments and fuzzy logic from detected texts of images. The similarity between text detection results given by three existing text detection methods is studied for determining the best pair of texts. The same similarity estimation is then used in a novel way to remove extra background or non-texts and restoring missing text information. Experimental results on own dataset and benchmark datasets of natural scene images, namely, MSRA-TD500, ICDAR2017-MLT, Total-Text, CTW1500 dataset and COCO datasets, show that the proposed method outperforms the existing methods. © Springer Nature Switzerland AG 2020.

关键词： Fuzzy logic

来源：评论

学校读者我要写书评

暂无评论

Anomaly Handwritten Text Detection for Automatic Descriptive Answer Evaluation 11

Anomaly Handwritten Text Detection for Automatic Descriptive...

引用

11th International Conference on Computing and pattern recognition, ICCPR 2022

作者： Chatterjee, Nilanjana Shivakumara, Palaiahnaakote Pal, Umapada Lu, Tong Lu, Yue Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India Faculty of Computer Science and Information Technology University of Malaya Kuala Lumpur Malaysia National Key Lab for Novel Software Technology Nanjing University Nanjing China Shanghai Key Laboratory of Multidimensional Information Processing East China Normal University Shanghai China

ISBN: (纸本)9781450397056

Although there are advanced technologies for character recognition, automatic descriptive answer evaluation is an open challenge for the document image analysis community due to large diversified handwritten text and answers to the question. This paper presents a novel method for detecting anomaly handwritten text in the responses written by the students to the questions. The method is proposed based on the fact that when the students are confident in answering questions, the students usually write answers legibly and neatly while they are not confident, they write sloppy writing which may not be easy for the reader to understand. To detect such anomaly handwritten text, we explore a new combination of Fourier transform and deep learning model for detecting edges. This result preserves the structure of handwritten text. For extracting features for classification of anomaly text and normal text, the proposed method studies the behavior of writing style, especially the variation at ascenders and descenders. Therefore, the proposed work draws principal axis which is invariant to rotation, scaling and some extent to distortion for the edge images. With respect to principal axis, the proposed method draws medial axis using uppermost and lowermost points. The distance between the medial axis and principal axis points are considered as feature vector. Further, the feature vector is passed to Artificial Neural Network for classification of anomaly text. The proposed method is evaluated by testing on our own dataset, standard dataset of gender identification (IAM) and handwritten forgery detection dataset (ACPR 2019). The results on different datasets show that the proposed work outperforms the existing methods. © 2022 ACM.

关键词： Students

来源：评论

学校读者我要写书评

暂无评论

UNIFORMER: UNIFIED TRANSFORMER FOR EFFICIENT SPATIOTEMPORAL REPRESENTATION LEARNING 10

UNIFORMER: UNIFIED TRANSFORMER FOR EFFICIENT SPATIOTEMPORAL ...

引用

10th International Conference on Learning Representations, ICLR 2022

作者： Li, Kunchang Wang, Yali Gao, Peng Song, Guanglu Liu, Yu Li, Hongsheng Qiao, Yu ShenZhen Key Lab of Computer Vision and Pattern Recognition SIAT-SenseTime Joint Lab Shenzhen Institute of Advanced Technology Chinese Academy of Sciences China University of Chinese Academy of Sciences China Shanghai AI Laboratory Shanghai China SenseTime Research The Chinese University of Hong Kong Hong Kong

It is a challenging task to learn rich and multi-scale spatiotemporal semantics from high-dimensional videos, due to large local redundancy and complex global dependency between video frames. The recent advances in this research have been mainly driven by 3D convolutional neural networks and vision transformers. Although 3D convolution can efficiently aggregate local context to suppress local redundancy from a small 3D neighborhood, it lacks the capability to capture global dependency because of the limited receptive field. Alternatively, vision transformers can effectively capture long-range dependency by self-attention mechanism, while having the limitation on reducing local redundancy with blind similarity comparison among all the tokens in each layer. Based on these observations, we propose a novel Unified transFormer (UniFormer) which seamlessly integrates merits of 3D convolution and spatiotemporal self-attention in a concise transformer format, and achieves a preferable balance between computation and accuracy. Different from traditional transformers, our relation aggregator can tackle both spatiotemporal redundancy and dependency, by learning local and global token affinity respectively in shallow and deep layers. We conduct extensive experiments on the popular video benchmarks, e.g., Kinetics-400, Kinetics-600, and Something-Something V1&V2. With only ImageNet-1K pretraining, our UniFormer achieves 82.9%/84.8% top-1 accuracy on Kinetics-400/Kinetics-600, while requiring 10× fewer GFLOPs than other state-of-the-art methods. For Something-Something V1 and V2, our UniFormer achieves new state-of-the-art performances of 60.9% and 71.2% top-1 accuracy respectively. Code is available at https://***/Sense-X/UniFormer. © 2022 ICLR 2022 - 10th International Conference on Learning Representationss. All rights reserved.

关键词： Redundancy

来源：评论

学校读者我要写书评

暂无评论

Robust text line detection in equipment nameplate images

Robust text line detection in equipment nameplate images

引用

2019 IEEE International Conference on Robotics and Biomimetics, ROBIO 2019

作者： Lai, Jiangyu Guo, Lanqing Qiao, Yu Chen, Xiaolong Zhang, Zhengfu Liu, Canping Li, Ying Fu, Bin Guangzhou Power Supply Bureau Co. Ltd. Guangzhou China ShenZhen Key Lab of Computer Vision and Pattern Recognition SIATSenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences China SIAT Branch Shenzhen Institute of Artificial Intelligence and Robotics for Society China

ISBN: (纸本)9781728163215

Scene text detection for equipment nameplates in the wild is important for equipment inspection robot since it enables inspection robot to take specific actions for different equipment's. Although text detection in images has achieved great progress in recent years, the detection for equipment nameplates faces several challenges such as extreme illumination and distortion which significantly decrease the detection performance. In this paper, we propose a deep text detection model Robust Text Line Detection (RTLD) for locating word level text instances in equipment cards. Specifically, the proposed model first employs a corner detection module to determine the four corner points of each nameplate, and then a carefully designed image transformed module transforms the irregular nameplate region into a rectangular region. Finally, text detection module is introduced to locate every word level text instance in the transformed images. We conduct extensive experiments to examine our proposed methods on real equipment nameplate images. Our model achieves 91.2% precision and 92.6% recall on Equipment Nameplate Dataset. The experimental results demonstrate the effectiveness of our models. © 2019 IEEE.

关键词： Nameplates

来源：评论

学校读者我要写书评

暂无评论

Modulating Image Restoration with Continual Levels via Adaptive Feature Modification Layers

Modulating Image Restoration with Continual Levels via Adapt...

引用

IEEE/CVF Conference on computer vision and pattern recognition

作者： Jingwen He Chao Dong Yu Qiao ShenZhen Key Lab of Computer Vision and Pattern Recognition SIAT-SenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences

ISBN: (纸本)9781728132945

In image restoration tasks, like denoising and super-resolution, continual modulation of restoration levels is of great importance for real-world applications, but has failed most of existing deep learning based image restoration methods. Learning from discrete and fixed restoration levels, deep models cannot be easily generalized to data of continuous and unseen levels. This topic is rarely touched in literature, due to the difficulty of modulating well-trained models with certain hyper-parameters. We make a step forward by proposing a unified CNN framework that consists of little additional parameters than a single-level model yet could handle arbitrary restoration levels between a start and an end level. The additional module, namely AdaFM layer, performs channel-wise feature modification, and can adapt a model to another restoration level with high accuracy. By simply tweaking an interpolation coefficient, the intermediate model - AdaFM-Net could generate smooth and continuous restoration effects without artifacts. Extensive experiments on three image restoration tasks demonstrate the effectiveness of both model training and modulation testing. Besides, we carefully investigate the properties of AdaFM layers, providing a detailed guidance on the usage of the proposed method.

关键词： image restoration technique restoration Noise reduction

来源：评论

学校读者我要写书评

暂无评论

Tensor Low-Rank Reconstruction for Semantic Segmentation 1

引用

16th European Conference on computer vision, ECCV 2020

作者： Chen, Wanli Zhu, Xinge Sun, Ruoqi He, Junjun Li, Ruiyu Shen, Xiaoyong Yu, Bei The Chinese University of Hong Kong New Territories Hong Kong Shanghai Jiao Tong University Shanghai China ShenZhen Key Lab of Computer Vision and Pattern Recognition SIAT-SenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences Beijing China SmartMore Shenzhen China

ISBN: (数字)9783030585204

ISBN: (纸本)9783030585198

Context information plays an indispensable role in the success of semantic segmentation. Recently, non-local self-attention based methods are proved to be effective for context information collection. Since the desired context consists of spatial-wise and channel-wise attentions, 3D representation is an appropriate formulation. However, these non-local methods describe 3D context information based on a 2D similarity matrix, where space compression may lead to channel-wise attention missing. An alternative is to model the contextual information directly without compression. However, this effort confronts a fundamental difficulty, namely the high-rank property of context information. In this paper, we propose a new approach to model the 3D context representations, which not only avoids the space compression but also tackles the high-rank difficulty. Here, inspired by tensor canonical-polyadic decomposition theory (i.e, a high-rank tensor can be expressed as a combination of rank-1 tensors.), we design a low-rank-to-high-rank context reconstruction framework (i.e, RecoNet). Specifically, we first introduce the tensor generation module (TGM), which generates a number of rank-1 tensors to capture fragments of context feature. Then we use these rank-1 tensors to recover the high-rank context features through our proposed tensor reconstruction module (TRM). Extensive experiments show that our method achieves state-of-the-art on various public datasets. Additionally, our proposed method has more than 100 times less computational cost compared with conventional non-local-based methods. © 2020, Springer Nature Switzerland AG.

关键词： Tensors

来源：评论

学校读者我要写书评

暂无评论

MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-labeled Visual recognition

MetaCleaner: Learning to Hallucinate Clean Representations f...

引用

IEEE/CVF Conference on computer vision and pattern recognition

作者： Weihe Zhang Yali Wang Yu Qiao Shenzhen Key Lab of Computer Vision and Pattern Recognition SIAT-SenseTime Joint Lab Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences

ISBN: (纸本)9781728132945

Deep Neural Networks (DNNs) have achieved remarkable successes in large-scale visual recognition. However, they often suffer from overfitting under noisy labels. To alleviate this problem, we propose a conceptually simple but effective MetaCleaner, which can learn to hallucinate a clean representation of an object category, according to a small noisy subset from the same category. Specially, MetaCleaner consists of two flexible submodules. The first sub-module, namely Noisy Weighting, can estimate the confidence scores of all the images in the noisy subset, by analyzing their deep features jointly. The second submodule, namely Clean Hallucinating, can generate a clean representation from the noisy subset, by summarizing the noisy images with their confidence scores. Via MetaCleaner, DNNs can strengthen its robustness to noisy labels, as well as enhance its generalization capacity with richer data diversity. Moreover, MetaCleaner can be easily integrated into the standard training procedure of DNNs, which promotes its value for real-life applications. We conduct extensive experiments on two popular benchmarks in noisy-labeled recognition, i.e., Food-101N and Clothing1M. For both datasets, our MetaCleaner significantly outperforms baselines, and achieves the state-of-the-art performance.

关键词： Noise Hallucinations confidence training procedure submodule recognition (Psychology) Dataset

来源：评论

学校读者我要写书评

暂无评论

Deformation Robust Text Spotting with Geometric Prior

Deformation Robust Text Spotting with Geometric Prior

引用

IEEE International Conference on Image Processing

作者： Xixuan Hao Aozhong Zhang Xianze Meng Bin Fu ShenZhen Key Lab of Computer Vision and Pattern Recognition Shenzhen Institute of Advanced Technology Chinese Academy of Sciences The University of Hong Kong

The goal of text spotting is to perform text detection and recognition simultaneously. Although the diversity of luminosity and orientation in scene texts has been widely studied, the font diversity and shape variance of the same character are ignored in recent works, since most characters in natural images are rendered in standard fonts. To solve this problem, we present a Chinese Artistic Dataset, termed as ARText, which contains 33, 000 artistic images with rich shape deformation and font diversity. Based on this database, we develop a deformation robust text spotting method (DR TextSpotter) to solve the recognition problem of complex deformation of characters in different fonts. Specifically, we propose a geometric prior module to highlight the important features based on the unsupervised landmark detection sub-network. A graph convolution network is further constructed to fuse the character features and landmark features, and then performs semantic reasoning to enhance the discrimination for different characters. The experiments are conducted on ARText and IC19-ReCTS datasets. Our results demonstrate the effectiveness of our proposed method. The datasets and models will become publicly available after publication.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共19页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：