检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,413 篇 会议
27 篇 期刊文献
14 册 图书

馆藏范围

1,454 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,010 篇 工学
- 901 篇 计算机科学与技术...
- 410 篇 软件工程
- 389 篇 电气工程
- 131 篇 信息与通信工程
- 122 篇 光学工程
- 90 篇 生物工程
- 65 篇 生物医学工程（可授...
- 58 篇 电子科学与技术（可...
- 38 篇 控制科学与工程
- 34 篇 化学工程与技术
- 29 篇 机械工程
- 21 篇 仪器科学与技术
- 17 篇 安全科学与工程
- 10 篇 材料科学与工程（可...
- 9 篇 建筑学
- 8 篇 土木工程
347 篇 医学
- 344 篇 临床医学
- 27 篇 基础医学(可授医学...
- 26 篇 药学(可授医学、理...
333 篇 理学
- 158 篇 物理学
- 147 篇 数学
- 93 篇 生物学
- 47 篇 统计学（可授理学、...
- 32 篇 化学
- 16 篇 系统科学
92 篇 管理学
- 69 篇 图书情报与档案管...
- 23 篇 管理科学与工程(可...
- 13 篇 工商管理
12 篇 农学
- 12 篇 作物学
10 篇 法学
- 9 篇 社会学
6 篇 经济学
3 篇 教育学
2 篇 文学
1 篇 艺术学

主题

142 篇 computer vision
80 篇 image processing
77 篇 image segmentati...
54 篇 feature extracti...
53 篇 computer graphic...
52 篇 cameras
46 篇 deep learning
38 篇 shape
35 篇 face recognition
34 篇 image reconstruc...
30 篇 robustness
28 篇 object detection
28 篇 image color anal...
25 篇 image enhancemen...
25 篇 image edge detec...
25 篇 layout
21 篇 histograms
20 篇 support vector m...
20 篇 image coding
20 篇 databases

机构

19 篇 indian institute...
17 篇 department of co...
14 篇 indian statistic...
13 篇 computer vision ...
12 篇 indian institute...
11 篇 department of el...
10 篇 indian inst tech...
9 篇 indian stat inst...
9 篇 indian institute...
9 篇 indian inst tech...
8 篇 department of el...
8 篇 indian institute...
8 篇 indian inst sci ...
8 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 department of el...

作者

40 篇 chaudhury santan...
29 篇 chaudhuri subhas...
26 篇 mukherjee jayant...
20 篇 das sukhendu
19 篇 raman shanmugana...
18 篇 santanu chaudhur...
16 篇 harit gaurav
16 篇 lall brejesh
15 篇 babu r. venkates...
15 篇 chanda bhabatosh
14 篇 mukherjee dipti ...
13 篇 das partha prati...
12 篇 biswas prabir ku...
12 篇 mishra deepak
11 篇 raman balasubram...
11 篇 sukhendu das
10 篇 biswas soma
10 篇 bhowmick partha
10 篇 bhavsar arnav
10 篇 mukhopadhyay jay...

语言

1,443 篇 英文
8 篇 中文
3 篇 其他

检索条件"任意字段=Proceedings of the Fourteenth Indian Conference on Computer Vision, Graphics and Image Processing"

共 1454 条记录，以下是71-80 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

ViLP: Knowledge Exploration using vision, Language, and Pose Embeddings for Video Action Recognition 14

ViLP: Knowledge Exploration using Vision, Language, and Pose...

引用

14th indian conference on computer vision, graphics and image processing, ICVGIP 2023

作者： Chaudhuri, Soumyabrata Bhattacharya, Saumik Indian Institute of Technology Bhubaneswar Odisha Bhubaneswar India Indian Institute of Technology Kharagpur West Bengal Kharagpur India

ISBN: (纸本)9798400716256

Video Action Recognition (VAR) is a challenging task due to its inherent complexities. Though different approaches have been explored in the literature, designing a unified framework to recognize a large number of human actions is still a challenging problem. Recently, Multi-Modal Learning (MML) has demonstrated promising results in this domain. In literature, 2D skeleton or pose modality has often been used for this task, either independently or in conjunction with the visual information (RGB modality) present in videos. However, the combination of pose, visual information, and text attributes has not been explored yet, though text and pose attributes independently have been proven to be effective in numerous computer vision tasks. In this paper, we present the first pose augmented vision-language model (VLM) for VAR. Notably, our scheme achieves an accuracy of 92.81% and 73.02% on two popular human video action recognition benchmark datasets, UCF-101 and HMDB-51, respectively, even without any video data pre-training, and an accuracy of 96.11% and 75.75% after kinetics pre-training. © 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Value engineering

来源：评论

学校读者我要写书评

暂无评论

QAAD: Quality Aware Adaptive Denoising 11

QAAD: Quality Aware Adaptive Denoising

引用

11th IEEE International conference on Signal processing and Integrated Networks, SPIN 2024

作者： Parse, Tejas Ajay Awasthi, Tanishq Yadav, Dushyant Joshi, Piyush Indian Institute of Information Technology Department of Computer Science Sri City Chittoor India

ISBN: (纸本)9798350308433

image denoising is a fundamental task in computer vision and image processing, crucial for improving the visual quality and interpretability of images captured in noisy environments. In this research, we propose a quality-Aware adaptive denoising (QAAD) approach that leverages the Blind/Referenceless image Spatial Quality Evaluator (BRISQUE) as a key factor in determining the optimal denoising algorithm for an image. Our methodology involves an initial assessment of image quality using BRISQUE scores for a diverse set of images. Subsequently, we employ various image quality enhancement algorithms to enhance the image quality. Through extensive experimentation, we assess the performance of these enhancement algorithms with respect to specific ranges of BRISQUE scores. This evaluation enables us to determine which denoising algorithm performs optimally for images falling within particular BRISQUE score ranges. The innovation in our approach lies in its adaptability. When an image proceeds for denoising, its BRISQUE score is calculated and, based on this score, the most suitable denoising algorithm for that specific image is selected. This adaptive strategy ensures that the chosen denoising algorithm is tailored to the inherent noise characteristics of the input image, leading to superior denoising results. Our experimental results demonstrate the effectiveness of our approach in enhancing image quality and achieving state-of-The-Art denoising results. This research contributes to the field of image processing by providing a data-driven and adaptive solution for image denoising, ultimately enhancing the quality of images in diverse real-world applications. © 2024 IEEE.

关键词： image denoising

来源：评论

学校读者我要写书评

暂无评论

Mandala as computational art: Vectorization and beyond 23

Mandala as computational art: Vectorization and beyond

引用

proceedings of the fourteenth indian conference on computer vision, graphics and image processing

作者： Tusita Sarkar Partha Bhowmick Department of Computer Science and Engineering IIT Kharagpur India

ISBN: (纸本)9798400716256

This paper introduces a novel technique of computational art with mandala—an iconic heritage of indian folk art. Its novelty lies in several fundamental steps. The first one is fixing the asymmetries and the imperfections in a hand-drawn piece of art based on the notion of a primitive map. The primitive map is described using a novel concept of geometric salience—a set of well-defined salient points on the frontier polygon of a primitive—characterizing the concavities and convexities present in the primitive. The primitive map is also used for the vectorization of a mandala and its succinct representation as a mandala sector graph (MSG), which eventually results in efficient graph operations on an existing artwork to create a new piece of art. The use of frontier polygons in different steps of the algorithm makes it robust and efficient. Experimental results on various datasets demonstrate the potential and versatility of the proposed technique.

关键词： Art and culture alpana art and psychology computational art digital art mandala vectorization.

来源：评论

学校读者我要写书评

暂无评论

MCDNet: Multi Context Dense Network for multi-frame super resolution of satellite images 14

MCDNet: Multi Context Dense Network for multi-frame super re...

引用

14th indian conference on computer vision, graphics and image processing, ICVGIP 2023

作者： Chouhan, Avinash Motwani, Harsh Sur, Arijit Chutia, Dibyajyoti Aggarwal, Shiv Prasad North Eastern Space Applications Centre Meghalaya Umiam India Indian Institute of Technology Guwahati Assam Guwahati India

ISBN: (纸本)9798400716256

Satellite image super resolution is an important task that generates high resolution satellite images from low resolution inputs. Multi-frame super resolution utilizes multiple low-resolution images to generate a single high-resolution image. Multi-frame super resolution methods face difficulty in handling spatial and temporal dependencies of pixels. In this work, we proposed a novel architecture named Multi-context Dense Network (MCDNet) to handle spatial and temporal pixel dependencies using multiple approaches of global average pooling, multiple size kernels, and self-attention. The proposed approach improved the PSNR values by 0.29 % and 0.001 % for super resolution of NIR and RED bands on the benchmark PROBA-V dataset. © 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

A fatigue driving detection algorithm based YOLOv5 14

A fatigue driving detection algorithm based YOLOv5

引用

14th International conference on graphics and image processing, ICGIP 2022

作者： Li, Zhanli Jia, Ni Jin, Hongmei College of Computer Science and Technology Xi'an University of Science and Technology China

ISBN: (纸本)9781510666313

Aiming at the problems of slow detection speed and low detection accuracy in existing fatigue driving detection algorithms, a fatigue driving detection algorithm based on YOLOv5 is proposed. In order to improve the feature extraction ability of the network, the convolution module is used to replace the slice structure in Backbone;the algorithm combines the data features of fatigue driving images to simplify and optimize the Neck structure of the YOLOv5 model, which will be suitable for detecting 19×19 features of larger size objects. Graph branch pruning to reduce model complexity and improve real-time detection. Finally, on the basis of facial feature extraction, the algorithm determines the state of fatigue features according to PERCLOS and POM parameters combined with thresholds and outputs the results. The experimental results on the NHTU-DDD data set show that the accuracy of the detection model reaches 95.25%, the model size is only 10MB, and the single-frame detection speed is 9ms, which is 21.5% higher than the original YOLOv5 algorithm. At the same time, the model parameters are greatly reduced, which can better meet the real-time requirements of fatigue detection application scenarios. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Knowledge Distillation with Ensemble Calibration 23

Knowledge Distillation with Ensemble Calibration

引用

proceedings of the fourteenth indian conference on computer vision, graphics and image processing

作者： Ishan Mishra Riyanshu Jain Dhruv Viradiya Divyam Patel Deepak Mishra Computer Science and Engineering Indian Institute of Technology Jodhpur India Computer Science and Engineering Indian Institute of Technology Jodhpur India Electrical Engineering Indian Institute of Technology Jodhpur India

ISBN: (纸本)9798400716256

Knowledge Distillation is a transfer learning and compression technique that aims to transfer hidden knowledge from a teacher model to a student model. However, this transfer often leads to poor calibration in the student model. This can be problematic for high-risk applications that require well-calibrated models to capture prediction uncertainty. To address this issue, we propose a simple and novel technique that enhances the calibration of the student network by using an ensemble of well-calibrated teacher models. We train multiple teacher models using various data-augmentation techniques such as cutout, mixup, CutMix, and AugMix and use their ensemble for knowledge distillation. We evaluate our approach on different teacher-student combinations using CIFAR-10 and CIFAR-100 datasets. Our results demonstrate that our technique improves calibration metrics (such as expected calibration and overconfidence errors) while also increasing the accuracy of the student network.

关键词： Augmentation Confidence Calibration Ensemble Knowledge Distillation

来源：评论

学校读者我要写书评

暂无评论

C-SAW: Self-Supervised Prompt Learning for image Generalization in Remote Sensing 23

C-SAW: Self-Supervised Prompt Learning for Image Generalizat...

引用

proceedings of the fourteenth indian conference on computer vision, graphics and image processing

作者： Avigyan Bhattacharya Mainak Singha Ankit Jha Biplab Banerjee Indian Institute of Technology Bombay IN

ISBN: (纸本)9798400716256

We focus on domain and class generalization problems in analyzing optical remote sensing images, using the large-scale pre-trained vision-language model (VLM), CLIP. While contrastively trained VLMs show impressive zero-shot generalization performance, their effectiveness is limited when dealing with diverse domains during training and testing. Existing prompt learning techniques overlook the importance of incorporating domain and content information into the prompts, which results in a drop in performance while dealing with such multi-domain data. To address these challenges, we propose a solution that ensures domain-invariant prompt learning while enhancing the expressiveness of visual features. We observe that CLIP’s vision encoder struggles to identify contextual image information, particularly when image patches are jumbled up. This issue is especially severe in optical remote sensing images, where land-cover classes exhibit well-defined contextual appearances. To this end, we introduce C-SAW, a method that complements CLIP with a self-supervised loss in the visual space and a novel prompt learning technique that emphasizes both visual domain and content-specific features. We keep the CLIP backbone frozen and introduce a small set of projectors for both the CLIP encoders to train C-SAW contrastively. Experimental results demonstrate the superiority of C-SAW across multiple remote sensing benchmarks and different generalization tasks.

关键词： vision-language models

来源：评论

学校读者我要写书评

暂无评论

Improved Discrepancy based Domain Adaptation Network using combined loss functions and Feature transformations 14

Improved Discrepancy based Domain Adaptation Network using c...

引用

14th indian conference on computer vision, graphics and image processing, ICVGIP 2023

作者： Ashly, Ajith Ganesha, Srinivas Damaraju Gopakumar, G. Department of Computer Science School of Computing Amrita Vishwa Vidyapeetham Kerala Amritapuri690525 India

ISBN: (纸本)9798400716256

Domain shifts are a common problem in computer vision. As a result, a classifier trained on a source domain cannot perform well on a target domain. Due to this, a source classifier trained to differentiate based on a specific distribution cannot classify well on new data from a different distribution. In this work, we have studied different domain adaptation networks for their complexity and performance and have shown that a simple discrepancy based network can be modified to produce a performance on par with the more complex GAN based networks[8]. This is achieved by picking and selectively combining loss functions and preprocessing transformations. We have shown that the proposed method has produced accuracies 77.62% and 81.67% on popular datasets Office-31 and MNIST/MNIST-M, surpassing the highest reported accuracies by 6.47% and 16.67% compared to seminal papers [16][1] in this field. © 2023 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Complex networks

来源：评论

学校读者我要写书评

暂无评论

Aggregated Co-attention based Visual Question Answering 23

Aggregated Co-attention based Visual Question Answering

引用

proceedings of the fourteenth indian conference on computer vision, graphics and image processing

作者： Aakansha Mishra Ashish Anand Prithwijit Guha IIT Guwahati India Indian Institute of Technology Guwahati IN IIT Guwahati IN

ISBN: (纸本)9798400716256

Recent developments in the field of Visual Question Answering (VQA) have witnessed promising improvements in performance through contributions in attention based networks. Most such approaches have focused on unidirectional attention that leverage over attention from textual domain (question) on visual space. This work proposes a multistage co-attention framework. Here, co-attention framework performs both image and text attention. The co-attention mechanism is repeated in multiple stages. Attention on different stages may capture some significant and distinct features for learning better contextual information. Thus, aggregation of attention is performed to preserve the information from different stages. The proposed architecture with multiple stage network could suffer from vanishing or exploding gradients. To prevent this, loss at each of the different stages is computed. Extensive experiments and analysis are performed for validating the effects of aggregated attention and stage-wise loss.

关键词： Co-attention Multimodal Interaction VQA

来源：评论

学校读者我要写书评

暂无评论

A Novel Approach for Neuromorphic vision Data Compression based on Deep Belief Network 23

A Novel Approach for Neuromorphic Vision Data Compression ba...

引用

proceedings of the fourteenth indian conference on computer vision, graphics and image processing

作者： Sally Khaidem Abhipraay Nevatia Mansi Sharma Department of Electrical Engineering Indian Institute of Technology Madras IN Department of Mechanical Engineering Indian Institute of Technology Madras IN

ISBN: (纸本)9798400716256

A neuromorphic camera is an image sensor that emulates the human eyes capturing only changes in local brightness levels. They are widely known as event cameras, silicon retinas or dynamic vision sensors (DVS). DVS records asynchronous per-pixel brightness changes, resulting in a stream of events that encode the time, location, and polarity of brightness change. DVS consumes little power and can capture a wider dynamic range with no motion blur and higher temporal resolution than conventional frame-based cameras. Despite yielding a lower bit rate compared to conventional video capture, the present approach of event capture demonstrates enhanced compressibility. Hence, we introduce a novel deep learning-based compression methodology tailored for event data. The proposed technique employs a deep belief network (DBN) to condense the high-dimensional event data into a latent representation, which is subsequently encoded utilising an entropy-based coding method. Notably, our proposed scheme represents one of the initial endeavours to integrate deep learning methodologies for event compression. It achieves a high compression ratio while maintaining good reconstruction quality outperforming state-of-the-art event data coders and other lossless benchmark techniques.

关键词： Entropy coding deep belief network dynamic vision sensor. neuromorphic computing

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共146页 << < 4 5 6 7 8 9 10 11 12 13 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：