检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

709 篇 会议
278 篇 期刊文献
14 册 图书

馆藏范围

1,001 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

592 篇 工学
- 381 篇 计算机科学与技术...
- 338 篇 软件工程
- 131 篇 信息与通信工程
- 99 篇 生物工程
- 97 篇 机械工程
- 97 篇 光学工程
- 75 篇 控制科学与工程
- 67 篇 生物医学工程（可授...
- 36 篇 仪器科学与技术
- 35 篇 化学工程与技术
- 33 篇 电气工程
- 25 篇 电子科学与技术（可...
- 17 篇 建筑学
- 16 篇 土木工程
- 12 篇 安全科学与工程
- 9 篇 力学（可授工学、理...
343 篇 理学
- 145 篇 数学
- 130 篇 物理学
- 106 篇 生物学
- 42 篇 统计学（可授理学、...
- 35 篇 化学
- 10 篇 系统科学
137 篇 管理学
- 93 篇 图书情报与档案管...
- 51 篇 管理科学与工程(可...
- 17 篇 工商管理
25 篇 医学
- 24 篇 临床医学
- 22 篇 基础医学(可授医学...
- 18 篇 药学(可授医学、理...
18 篇 艺术学
- 18 篇 设计学（可授艺术学...
16 篇 法学
- 16 篇 社会学
5 篇 经济学
3 篇 农学
2 篇 文学
1 篇 教育学

主题

125 篇 feature extracti...
113 篇 pattern recognit...
100 篇 computer vision
85 篇 image segmentati...
76 篇 training
71 篇 support vector m...
68 篇 handwriting reco...
68 篇 character recogn...
48 篇 shape
47 篇 optical characte...
41 篇 accuracy
37 篇 histograms
33 篇 databases
31 篇 testing
30 篇 cameras
30 篇 robustness
28 篇 image edge detec...
28 篇 writing
27 篇 hidden markov mo...
27 篇 kernel

机构

204 篇 computer vision ...
43 篇 computer vision ...
42 篇 university of ch...
40 篇 shenzhen key lab...
31 篇 national key lab...
30 篇 pattern analysis...
28 篇 faculty of compu...
26 篇 shenzhen key lab...
21 篇 siat branch shen...
19 篇 pattern analysis...
19 篇 shanghai ai labo...
18 篇 department of st...
17 篇 computer vision ...
16 篇 sensetime resear...
16 篇 computer vision ...
16 篇 shenzhen key lab...
14 篇 school of comput...
13 篇 pattern analysis...
12 篇 pattern analysis...
12 篇 computer vision ...

作者

113 篇 umapada pal
105 篇 pal umapada
59 篇 qiao yu
54 篇 vittorio murino
39 篇 b.b. chaudhuri
32 篇 michael blumenst...
32 篇 palaiahnakote sh...
31 篇 alessio del bue
30 篇 blumenstein mich...
28 篇 murino vittorio
28 篇 shivakumara pala...
27 篇 yu qiao
26 篇 dong chao
25 篇 chaudhuri b.b.
23 篇 u. pal
19 篇 liu xin
18 篇 lu tong
17 篇 wang yali
17 篇 tong lu
16 篇 chanda sukalpa

语言

978 篇 英文
19 篇 其他
4 篇 中文

检索条件"机构=Computer Vision and Pattern"

共 1001 条记录，以下是11-20 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Spatio-temporal Attention Graph Convolutions for Skeleton-based Action Recognition 23rd

Spatio-temporal Attention Graph Convolutions for Skeleton-b...

引用

22nd Scandinavian Conference on Image Analysis, SCIA 2023

作者： Le, Cuong Liu, Xin Computer Vision and Pattern Recognition Laboratory School of Engineering Science Lappeenranta-Lahti University of Technology LUT Lappeenranta Finland Computer Vision Laboratory Department of Electrical Engineering Linköping University Linköping Sweden

ISBN: (纸本)9783031314346

In skeleton-based action recognition, graph convolutional networks (GCN) have been applied to extract features based on the dynamic of the human body and the method has achieved excellent results recently. However, GCN-based techniques only focus on the spatial correlations between human joints and often overlook the temporal relationships. In an action sequence, the consecutive frames in a neighborhood contain similar poses and using only temporal convolutions for extracting local features limits the flow of useful information into the calculations. In many cases, the discriminative features can present in long-range time steps and it is important to also consider them in the calculations to create stronger representations. We propose an attentional graph convolutional network, which adapts self-attention mechanisms to respectively model the correlations between human joints and between every time steps for skeleton-based action recognition. On two common datasets, the NTU-RGB+D60 and the NTU-RGB+D120, the proposed method achieved competitive classification results compared to state-of-the-art methods. The project’s GitHub page: STA-GCN. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Low Resource Degraded Quality Document Image Binarization - Domain Adaptation is the Way 13

Low Resource Degraded Quality Document Image Binarization - ...

引用

13th Indian Conference on computer vision, Graphics, and Image Processing, ICVGIP 2022

作者： Kundu, Ahana Bhattacharya, Ujjwal Computer Vision and Pattern Recognition Indian Statistical Institute West Bengal Kolkata India

ISBN: (纸本)9781450398237

Usually, image binarization plays a crucial role in automatic analysis of degraded documents from their captured images. However, this binarization task is often difficult due to a number of reasons including the high similarity between noisy background and faded foreground pixels. The study presented here is particularly focused on binarization of images of low-resource degraded quality documents based on a set of recently collected image samples of several rare, ancient and severely degraded quality printed documents of Bangla, the 2nd and 5th most popular script of India and the world respectively. This new collection of degraded document image samples will henceforth be referred as 'ISIDDI2' and it consists of 139 images of Bangla old document pages. Samples of 'ISIDDI', another existing database of degraded Bangla document image samples, have also been used in the present study. A novel deep architecture based on attention UNET++ with dilated convolution operation is proposed for this binarization task. The model is optimized using human vision perceptible distance reciprocal distortion (DRD) loss. Since the binarization ground truth of samples of both 'ISIDDI2' and 'ISIDDI' are not available, the proposed network has been trained using samples of DIBCO and H-DIBCO datasets and an unsupervised domain adaptation (DA) module is employed for adaptation of the proposed architecture to the degradation patterns of 'ISIDDI2' or 'ISIDDI' samples. The proposed binarization strategy includes certain post-processing operation based on a modified k-neighbourhood based approach for recovery of broken characters. Results of our extensive experimentation show that the proposed binarization strategy has improved the binarization output of state-of-the-art methods on both ISIDDI2 and ISIDDI datasets. Also, its performance on well-known DIBCO samples is satisfactory. © 2022 ACM.

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

Bootstrap Diffusion Model Curve Estimation for High Resolution Low-Light Image Enhancement 20th

Bootstrap Diffusion Model Curve Estimation for High Resolut...

引用

20th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2023

作者： Huang, Jiancheng Liu, Yifan Chen, Shifeng ShenZhen Key Lab of Computer Vision and Pattern Recognition Shenzhen Institute of Advanced Technology Chinese Academy of Sciences Shenzhen China University of Chinese Academy of Sciences Beijing China

ISBN: (纸本)9789819970247

Learning-based methods have attracted a lot of research attention and led to significant improvements in low-light image enhancement. However, most of them still suffer from two main problems: expensive computational cost in high resolution images and unsatisfactory performance in simultaneous enhancement and denoising. To address these problems, we propose BDCE, a bootstrap diffusion model that exploits the learning of the distribution of the curve parameters instead of the normal-light image itself. Specifically, we adopt the curve estimation method to handle the high-resolution images, where the curve parameters are estimated by our bootstrap diffusion model. In addition, a denoise module is applied in each iteration of curve adjustment to denoise the intermediate enhanced result of each iteration. We evaluate BDCE on commonly used benchmark datasets, and extensive experiments show that it achieves state-of-the-art qualitative and quantitative performance. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd 2024.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

PCGAUNet: Pixel Correlation and Gaussian Attention Driven Network for Text Segmentation 27th

PCGAUNet: Pixel Correlation and Gaussian Attention Driven Ne...

引用

27th International Conference on pattern Recognition, ICPR 2024

作者： Roy, Ayush Palaiahnakote, Shivakumara Pal, Umapada Antonacopoulos, Apostolos Ramachandra, Raghavendra Computer Vision and Pattern Recognition Indian Statistical Institute Kolkata India School of Science Engineering and Environment University of Salford Manchester United Kingdom Norwegian University of Science and Technology Trondheim Norway

ISBN: (纸本)9783031784460

Text-line segmentation is still considered challenging for complex background scene images. The success of text detection and recognition depends on the success of the text segmentation. This study presents a new method for text segmentation to facilitate reliable detection and recognition. Therefore, we introduce a new model called Pixel Correlation and Gaussian Attention Driven Network (PCGAUNet) for text segmentation. To extract pixel correlation, we modified the MultiResUnet architecture, which leverages pixel-wise correlation to effectively highlight foreground pixels. In addition, the proposed model utilizes the prior spatial statistics of bottleneck features to create a learnable Gaussian distribution, which guides the decoder for accurate text segmentation. Experimental results on three standard scene text segmentation datasets, ICDAR13 FST, Total Text, and COCO-TS, show that the proposed model outperforms existing methods. Furthermore, the results for the underwater dataset UTS-55 show that our model is robust and generic. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

XLSI: A New Xception and Log Polar Transform Based Approach for Scene Text Script Identification 27th

XLSI: A New Xception and Log Polar Transform Based Approach ...

引用

27th International Conference on pattern Recognition, ICPR 2024

作者： Roy, Ayush Palaiahnakote, Shivakumara Pal, Umapada Antonacopoulos, Apostolos Blumenstein, Michael Computer Vision and Pattern Recognition Indian Statistical Institute Kolkata Kolkata India School of Science Engineering and Environment University of Salford Manchester United Kingdom University of Technology Sydney Sydney Australia

ISBN: (纸本)9783031784941

Script identification of text in natural scene images is challenging due to complex backgrounds, arbitrary orientations, different-sized characters, varying fonts, and multiple styles. Most existing methods are not effective in the presence of the above challenges. This paper introduces a new approach based on the Xception architecture and employing the log-polar transformed original image as an additional input, enabling the extraction of cues that are invariant to rotation, scaling, but are sensitive to script. The rationale behind the proposed work is that the combination of global features with text style features makes a significant difference in discriminating between different scripts. To combine the features extracted by Xception from the input image and log the polar transform of the input image, the proposed method introduces a style-enhanced fusion block. In addition, to further improve the performance of script identification, the proposed approach uses a new receptive channel selective focal attention module. Comparative evaluation results on three benchmark datasets, namely CVSI 2015, SIW-13, and MLe2e show that the proposed method outperforms the state-of-the-art in terms of classification rate. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

A New Contrastive Learning Based Model for Estimating Degree of Multiple Personality Traits Using Social Media Posts 1

引用

7th Asian Conference on pattern Recognition, ACPR 2023

作者： Biswas, Kunal Shivakumara, Palaiahnakote Pal, Umapada Sarkar, Ram Jadavpur University Kolkata India Faculty of Computer Science and Information Technology University of Malaya Kula Lumpur Malaysia Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (数字)9783031476372

ISBN: (纸本)9783031476365

Estimating the degree of multiple personality traits in a single image is challenging due to the presence of multiple people, occlusion, poor quality etc. Unlike existing methods which focus on the classification of a single personality using images, this work focuses on estimating different personality traits using a single image. We believe that when the image contains multiple persons and modalities, one can expect multiple emotions and expressions. This work separates given input images into different faces of people, recognized text, meta-text and background information using face segmentation, text recognition and scene detection techniques. Contrastive learning is explored to extract features from each segmented region based on clustering. The proposed work fuses textual and visual features extracted from the image for estimating the degree of multiple personality traits. Experimental results on our benchmark datasets show that the proposed model is effective and outperforms the existing methods. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Can an Image Tell the Tale: Looking Beyond the Haze to Determine PM2.5 Concentration

Can an Image Tell the Tale: Looking Beyond the Haze to Deter...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Sagarnil Chakraborty Sarbani Palit Harsh Bhandari Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (数字)9798350359312

ISBN: (纸本)9798350359329

In the past few decades, due to rapid growth in industrialization, there has been a steady decline of the air quality along with an increase in the concentration of PM2.5. It is well known that a high PM2.5 concentration adversely affects the environment and has hazardous impact on public health. Therefore, it is important to monitor the PM2.5 concentration at geographic locations where air quality monitoring stations are presently unavailable, especially in remote areas. Unfortunately, installation of such monitoring stations requires expensive instruments and constant maintenance. This paper presents a novel, low-cost and portable alternative to such measurement apparatus, where PM2.5 concentration is estimated based on image input obtained from a camera. The novelty of the present work lies in its hitherto unique attempt to capture information regarding PM2.5 content from visibility degradation caused by the pollutant which is further supplemented by important knowledge regarding seasonal and diurnal variation of it. The latter has a crucial role in the prevention of confounding effects arising from the presence of other weather and atmospheric elements. Another important highlight is the use of a full reference image metric as a feature, for which a powerful, dehazing algorithm has been employed. The results obtained are extremely promising, providing a close to accurate estimation of PM2.5 concentration with R 2 values far higher than reported in the literature. To summarize, the construction of a unique feature set, together with an appropriate machine learning algorithm, lead to an extremely reliable, stand-alone approach, deployable on a hand-held device such as a mobile and is a very significant contribution indeed of the proposed approach.

关键词： Machine learning algorithms Prevention and mitigation Instruments Neural networks Air quality Pollution measurement Maintenance

来源：评论

学校读者我要写书评

暂无评论

DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly

DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D ...

引用

Conference on computer vision and pattern Recognition (CVPR)

作者： Gianluca Scarpellini Stefano Fiorini Francesco Giuliari Pietro Morerio Alessio Del Bue Pattern Analysis and Computer Vision (PAVIS) Istituto Italiano di Tecnologia (IIT)

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Reassembly tasks play a fundamental role in many fields and multiple approaches exist to solve specific reassembly problems. In this context, we posit that a general unified model can effectively address them all, irrespective of the input data type (images, 3D, etc.). We introduce DiffAssemble, a Graph Neural Network (GNN)-based architecture that learns to solve reassembly tasks using a diffusion model formulation. Our method treats the elements of a set, whether pieces of 2D patch or 3D object fragments, as nodes of a spatial graph. Training is performed by introducing noise into the position and rotation of the elements and iteratively denoising them to reconstruct the coherent initial pose. DiffAssemble achieves state-of-the-art (SOTA) results in most 2D and 3D reassembly tasks and is the first learning-based approach that solves 2D puzzles for both rotation and translation. Furthermore, we highlight its remarkable reduction in run-time, performing 11 times faster than the quickest optimization-based method for puzzle solving. Code available at https://***/IIT-PAVIS/DiffAssemble.

关键词： Training Solid modeling Three-dimensional displays Noise reduction Noise Diffusion models Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Aquaformer: Underwater Image Enhancement via Adaptive Transformer

Aquaformer: Underwater Image Enhancement via Adaptive Transf...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Harsh Bhandari Sarbani Palit Computer Vision and Pattern Recognition Unit Indian Statistical Institute Kolkata India

ISBN: (数字)9798350359312

ISBN: (纸本)9798350359329

Water causes degradation of quality in optical images captured underwater due to its physical properties of absorption and scattering. This degradation is further aggravated by the increase in water depth and the presence of contaminated water. Transformers in the vision domain have made a quantum leap in many vision tasks such as detection, and segmentation but yet to make any progress in enhancing degraded underwater images. We propose a transformer-based model named “Aquaformer” which makes four major contributions: an adaptive layer normalization, replacement of masked cyclic shift with symmetric padding in window partitioning, a novel aggregation mechanism, and an adjustable fusion approach. These succeed in making the model a very powerful one, producing significantly better performance compared to the latest state-of-the-art methods. Testing on multiple benchmark datasets, employing both quantitative and qualitative metrics, establishes its supremacy.

关键词： Degradation Water Adaptation models Scattering Benchmark testing Transformers Water pollution

来源：评论

学校读者我要写书评

暂无评论

Dynamic Feature Queue for Surveillance Face Anti-spoofing via Progressive Training

Dynamic Feature Queue for Surveillance Face Anti-spoofing vi...

引用

2023 IEEE/CVF Conference on computer vision and pattern Recognition Workshops, CVPRW 2023

作者： Wang, Keyao Huang, Mouxiao Zhang, Guosheng Yue, Haixiao Zhang, Gang Qiao, Yu China Chinese Academy of Sciences ShenZhen Key Lab of Computer Vision and Pattern Recognition Shenzhen Institute of Advanced Technology China University of Chinese Academy of Sciences China

ISBN: (纸本)9798350302493

In recent years, face recognition systems have faced increasingly security threats, making it essential to employ Face Anti-spoofing (FAS) to protect against various types of attacks in traditional scenarios like phone unlocking, face payment and self-service security inspection. However, further exploration is required to fully secure FAS in long-distance settings. In this paper, we propose two contributions to enhance the security of face recognition systems: Dynamic Feature Queue (DFQ) and Progressive Training Strategy (PTS). DFQ converts the conventional binary classification task into a multi-classification task. It treats live samples as a closed set and attack samples as an open set by using a dynamic queue that stores the features of spoofing samples and updates them. On the other hand, PTS targets difficult samples and iteratively adds them in batches for training. The proposed PTS divides the entire training set into blocks, trains only a small portion of the data, and gradually increases the training data with each stage while also incorporating low-scoring positive samples and high-scoring spoof samples from the test set. These two contributions complement each other by enhancing the model's ability to generalize and defend against various types of attacks, making the face recognition system more secure and reliable. Our proposed methods have achieved top performance on ACER metric with 4.73% on the SuHiFiMask dataset [11] and won the first prize in Surveillance Face Anti-spoofing track of the Challenge@CVPR 2023. © 2023 IEEE.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共101页 << < 1 2 3 4 5 6 7 8 9 10 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：