检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

1,851 篇 会议
120 册 图书
40 篇 期刊文献

馆藏范围

2,011 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,517 篇 工学
- 1,317 篇 计算机科学与技术...
- 750 篇 软件工程
- 430 篇 电气工程
- 252 篇 信息与通信工程
- 251 篇 光学工程
- 195 篇 生物工程
- 118 篇 生物医学工程（可授...
- 117 篇 控制科学与工程
- 95 篇 电子科学与技术（可...
- 87 篇 机械工程
- 68 篇 化学工程与技术
- 59 篇 安全科学与工程
- 37 篇 仪器科学与技术
- 37 篇 交通运输工程
- 34 篇 材料科学与工程（可...
- 22 篇 建筑学
- 18 篇 农业工程
620 篇 理学
- 324 篇 物理学
- 232 篇 数学
- 209 篇 生物学
- 71 篇 化学
- 62 篇 统计学（可授理学、...
- 28 篇 系统科学
345 篇 医学
- 333 篇 临床医学
- 61 篇 基础医学(可授医学...
- 48 篇 药学(可授医学、理...
174 篇 管理学
- 121 篇 图书情报与档案管...
- 66 篇 管理科学与工程(可...
24 篇 农学
- 22 篇 作物学
23 篇 法学
- 21 篇 社会学
12 篇 教育学
8 篇 经济学
5 篇 军事学
2 篇 文学

主题

232 篇 computer vision
101 篇 image processing
92 篇 deep learning
88 篇 artificial intel...
84 篇 image processing...
73 篇 image segmentati...
72 篇 computer graphic...
68 篇 feature extracti...
63 篇 pattern recognit...
41 篇 object detection
41 篇 convolutional ne...
40 篇 machine learning
37 篇 computational mo...
36 篇 image enhancemen...
34 篇 image reconstruc...
33 篇 computer imaging...
32 篇 graphics process...
32 篇 image classifica...
31 篇 face recognition
31 篇 visualization

机构

19 篇 indian statistic...
15 篇 indian institute...
12 篇 indian inst tech...
9 篇 indian institute...
9 篇 indian institute...
9 篇 indian inst tech...
8 篇 indian institute...
8 篇 indian institute...
8 篇 indian inst tech...
8 篇 indian institute...
7 篇 indian stat inst...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 indian inst sci ...
6 篇 iit kharagpur kh...
6 篇 indian inst tech...
6 篇 faculty of elect...
6 篇 department of co...

作者

31 篇 chaudhury santan...
25 篇 mukherjee jayant...
25 篇 chaudhuri subhas...
21 篇 das sukhendu
16 篇 lall brejesh
15 篇 babu r. venkates...
15 篇 raman shanmugana...
13 篇 das partha prati...
13 篇 mukherjee dipti ...
13 篇 harit gaurav
13 篇 mishra deepak
11 篇 chanda bhabatosh
11 篇 mukherjee snehas...
10 篇 biswas soma
10 篇 raman balasubram...
10 篇 jawahar c.v.
10 篇 biswas prabir ku...
10 篇 sur arijit
9 篇 banerjee biplab
9 篇 balasubramanian ...

语言

1,970 篇 英文
37 篇 其他
6 篇 中文
2 篇 俄文

检索条件"任意字段=6th Indian Conference on Computer Vision, Graphics and Image Processing"

共 2011 条记录，以下是71-80 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Cross View and Cross Walking Gait Recognition Using a Convolutional Neural Network 1

引用

8th International conference on computer vision and image processing (CVIP)

作者： Nahar, Sonam Narsingani, Sagar Patel, Yash Pandit Deendayal Energy Univ Gandhinagar Gujarat India

ISBN: (数字)9783031581816

ISBN: (纸本)9783031581809;9783031581816

In this paper, we propose a gait recognition method using a convolutional neural network (CNN). A CNN architecture is designed and trained to learn an efficient representation with which walking patterns i.e., gait can be disentangled from the visual appearance of the subjects caused by covariate factors such as variation in view angles, clothing and carrying conditions. Since dynamic areas contain the most informative part of the human gait and are insensitive to changes in various covariate conditions, we feed the gait entropy images as input to CNN model to capture mostly the motion information. the learned gait features from CNN are then fed into a K-NN classifier to identify individuals based on their unique gait patterns. Experiments are carried out for cross-view and cross-walking gait recognition using the CASIA-B dataset. Our experimental results demonstrate the effectiveness of the proposed method.

关键词： Gait Recognition CNN Cross View Cross Walking

来源：评论

学校读者我要写书评

暂无评论

Identity Preserved Expressive Talking Faces with Synchrony 8th

Identity Preserved Expressive Talking Faces with Synchrony

引用

8th International conference on computer vision and image processing (CVIP)

作者： Abhijeet, Karumuri Meher Ali, Arshad Guha, Prithwijit Indian Inst Technol Guwahati Dept Elect & Elect Engn Gauhati India

ISBN: (纸本)9783031581809;9783031581816

this work proposes a novel approach to talking face generation using driving audio. the driving audio and a single image of the target person are provided as input to the proposed model. the model generates a realistic video of the target person uttering the driving audio. Recent works in this domain have focused on either one of expressions or lip-sync or identity preservation. this model provides supervision over photo realism, expression fulfilment, identity preservation and audio-visual synchrony which are crucial factors in synthesizing a realistic video. the proposed system is end-to-end trainable and the learning is performed with six losses. this method can generate photo realistic, expressive and audio-synced talking faces while preserving the identity of the target person. this work proposes a discriminator network to impose audio-visual synchrony in the generated video. the proposed model is trained on RAVDESS dataset containing 24 professional actors (12 female and 12 male), uttering two statements in a neutral North American accent with disgust, sad, angry, happy, surprise, fearful and calm emotions. this work is benchmarked on the VID-TIMIT dataset against three baseline models.

关键词： Talking Faces GANs Face Animation Action-Unit

来源：评论

学校读者我要写书评

暂无评论

Feature Fusion and Multi-head Attention Based Hindi Captioner 8th

Feature Fusion and Multi-head Attention Based Hindi Captione...

引用

8th International conference on computer vision and image processing (CVIP)

作者： Meghwal, Virendra Kumar Mittal, Namita Singh, Girdhari Malaviya Natl Inst Technol Jaipur Rajasthan India

ISBN: (纸本)9783031581809;9783031581816

Deep learning-based methods are extensively used in image captioning, but most of these methods depend on features from a single encoder for generating captions. Different encoders capture different features of an image, and thus, using features from multiple encoders may help improve the models' performance. Moreover, there needs to be more research on Hindi caption generation on large datasets such asMSCOCO. Recently, transformers have performed well on tasks such as image classification and object detection. One such transformer is the Swin Transformer. It captures both local as well as global information present in the image. A Faster RCNN, on the other hand, captures only local (object-level) information but does not capture global details. Using a single image feature generation method might sometimes result in incorrect feature generation, or some important objects may be missed while generating the feature vector. this problem can be mitigated by combining features from different methods. Furthermore, as local features-based models have produced better results in different domains, utilizing both Swin Transformer and Faster RCNN may result in better captioning models. this work proposes to use Swin Transformer-based image features along with Faster RCNN-based image features to generate Hindi captions for images. A decoder with two GRUs and Multi-head Attention uses these image features to build Hindi captions. Experiments demonstrate that the proposed method can generate high-quality captions while improving the performance of automatic evaluation metrics, establishing the method's efficacy.

关键词： image Captioning Feature Fusion computer vision Natural Language processing

来源：评论

学校读者我要写书评

暂无评论

Road Line and Direction Detection Methods Based on Vanishing Point for an Electric Vehicle 6

Road Line and Direction Detection Methods Based on Vanishing...

引用

6th Global Power, Energy and Communication conference (GPECOM)

作者： Yildirim, Merve Karaduman, Ozgur Kurum, Hasan Firat Univ Dept Elect & Elect Engn Elazig Turkiye Firat Univ Dept Software Engn Elazig Turkiye Univ Biruni Dept Elect & Elect Engn Istanbul Turkiye

ISBN: (纸本)9798350351088;9798350351095

image road detection is a significant problem in the applications of autonomous vehicles and mobile robots. the bend direction and angle are the main subjects for the inclined roads. To move safely on the road, it is very important to determine the road lines and the direction of the bend. For this reason, a Vanishing Point (VP) detection algorithm is proposed to estimate the road boundaries for an Electric Vehicle (EV). However, it is challenging to effectively estimate the VP from a video image of the inclined roads. therefore, a reliable VP estimate approach is suggested that makes use of the junction points of the line segments taken from an image and a probabilistic voting process. Besides, the OpenCV library, which has widely used computer vision algorithms and the Python software language, is utilized in the study. As a result, a Driver Assistance System (DAS), which is an important step in autonomous driving, is developed, and safe driving is provided by this work.

关键词： electric vehicle image processing python OpenCV road line detection direction detection vanishing point

来源：评论

学校读者我要写书评

暂无评论

TaPaSe: Tanjore Paddy Seed Dataset 8th

TaPaSe: Tanjore Paddy Seed Dataset

引用

8th National conference on computer vision, Pattern Recognition, image processing and graphics-NCVPRIPG

作者： Sasithradevi, A. Vijayalakshmi, M. Varsini, S. R. Gabriel, Joshua R. Roomi, S. Mohamed Mansoor Prakash, P. Vellore Inst Technol Ctr Adv Data Sci Chennai Tamil Nadu India Vellore Inst Technol Sch Elect Engn Chennai Tamil Nadu India Vellore Inst Technol Sch Comp Sci & Engn Chennai Tamil Nadu India Thiagarajar Coll Engn Dept Elect & Commun Engn Madurai Tamil Nadu India Madras Inst Technol Dept Elect Engn Chennai Tamil Nadu India

ISBN: (纸本)9789819752119;9789819752126

Rice is a popular staple diet in India, and its demand has recently increased. thanjavur, located in the Cauvery Delta region, is known as the rice granary of South India. Due to recent technological advancements, digital farming and globalization have significantly impacted the agricultural industry. It is crucial to differentiate between types of rice grains to prevent fraudulent labeling during import and export. To achieve this, a dataset, namely "TaPaSe Dataset", comprising five varieties of rice, including MTU 1010, MTU 1290, Narmadha, Pacha Ponni, and Sonna Masur, which are mainly cultivated in thanjavur, has been collected. We designed an image acquisition system to capture the aforementioned varieties in real time. the captured paddy rice images are highly challenging in the sense that all the images are captured under illumination and scale variations. We evaluated existing deep learning models to understand their ability to classify paddy seed varieties. the existing pre-trained models attain remarkable recognition rates on the proposed paddy seed varieties dataset.

关键词： image classification Deep learning Pre-trained models

来源：评论

学校读者我要写书评

暂无评论

TRAQID - Traffic-Related Air Quality image Dataset 24

TRAQID - Traffic-Related Air Quality Image Dataset

引用

15th indian conference on computer vision graphics and image processing

作者： Kathalkar, Om Rajendra Nilesh, Nitin Chaudhari, Sachin Namboodiri, Anoop Int Inst Informat Technol Signal Proc & Commun Res Ctr Hyderabad Telangana India Int Inst Informat Technol Ctr Visual Informat Technol CVIT Hyderabad Telangana India

ISBN: (纸本)9798400710759

Air quality estimation through sensor-based methods is widely used. Nevertheless, their frequent failures and maintenance challenges constrain the scalability of air pollution monitoring efforts. Recently, it has been demonstrated that air quality estimation can be done using image-based methods. these methods offer several advantages including ease of use, scalability, and low cost. However, the accuracy of these methods hinges significantly on the diversity and magnitude of the dataset utilized. the advancement of air quality estimation through image analysis has been limited due to the lack of available datasets. Addressing this gap, we present TRAQID - Traffic-Related Air Quality image Dataset, a novel dataset capturing 26,678 front and rear images of traffic alongside co-located weather parameters, multiple levels of Particulate Matters (PM) and Air Quality Index (AQI) values. Spanning over multiple seasons, with over 70 hours of data collection in the twin cities of Hyderabad and Secunderabad, India, the TRAQID offers diverse day and night imagery amid unstructured traffic conditions, encompassing six AQI categories ranging from "Good" to "Severe". State-of-the-art air quality estimation techniques, which were trained on a smaller and less-diverse dataset, showed poor results on the dataset presented in this paper. TRAQID models various uncertainty types, including seasonal changes, unstructured traffic patterns, and lighting conditions. the information from the two views (front and rear) of the traffic can be combined to improve the estimation performance in such challenging conditions. As such, the TRAQID serves as a benchmark for image-based air quality estimation tasks and AQI prediction, given its diversity and magnitude. Dataset Link

关键词： image Dataset Air Quality Estimation Vehicle-Induced Pollution Environmental Data

来源：评论

学校读者我要写书评

暂无评论

An Integrated Model for Text to Text, image to Text and Audio to Text Linguistic Conversion using Machine Learning Approach 6

An Integrated Model for Text to Text, Image to Text and Audi...

引用

6th International conference on Information Systems and computer Networks, ISCON 2023

作者： Singh, Aman Raj Bhardwaj, Diwakar Dixit, Mridul Kumar, Lalit GLA University Department of Cea Mathura India Amity University Noida India

ISBN: (纸本)9798350346961

this paper presents an integrated model that uses machine learning techniques to perform text-to-text, image-to-text, and audio-to-text conversions, with particularly focus on indian languages. the proposed model which can translate text, image, and voice has been tested on large datasets of various indian languages and utilizes state-of-the-art techniques such as machine learning, computer vision, and speech recognition to accurately transcribe and translate the input data. the results obtained from the experiments demonstrate the effectiveness of the model by accurately converting text, images, and audio to text, and the potential applications of our proposed model range from language learning, accessibility for non-verbal or non-hearing individuals to cross-language communication. the proposed model is intended to bridge the language gap and facilitate communication among people from different linguistic backgrounds.. © 2023 IEEE.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Variational Distribution and Experience Replay for 3D Reconstruction in a Continual Learning Framework 24

Variational Distribution and Experience Replay for 3D Recons...

引用

15th indian conference on computer vision graphics and image processing

作者： Palit, Sanchar Biswas, Sandika Indian Inst Technol Elect Engn Mumbai Maharashtra India Monash Univ Melbourne Vic Australia

ISBN: (纸本)9798400710759

Single-image 3D reconstruction is a research challenge focused on predicting 3D object shapes from single-view images, requiring all training data for all objects to be available from the start. In dynamic environments, it's impractical to gather data for all objects at once;data becomes available in phases with restrictions on past data access. therefore, the model must reconstruct new objects while retaining the ability to reconstruct previous objects without accessing prior data. Additionally, existing 3D reconstruction methods in continual learning fail to reproduce previous shapes accurately, as they are not designed to manage changing shape information in dynamic scenes. To this end, we propose a continual learning-based 3D reconstruction method. Our goal is to design a model that can accurately reconstruct previously seen classes even after training on new ones, ensuring faithful reconstruction of both current and previous objects. To achieve this, we propose using variational distribution from the latent space, which represent abstract shapes and effectively retain shape information within a simplified code structure that requires minimal memory. Additionally, saliency maps preserve object attributes, capturing both local minor shape details and the overall shape structure. We employ experience replay to leverage these saliency maps effectively. Both methods ensure that the shape is faithfully reconstructed, preserving all minor details from the previous dataset. this is vital due to resource constraints in storing extensive training data. thorough experiments show competitive results compared to established methods, both quantitatively and qualitatively.

关键词： 3D reconstruction Continual Learning Variational Inference image saliency

来源：评论

学校读者我要写书评

暂无评论

Adapting Spatial Transformer Networks Across Diverse Hardware Platforms: A Comprehensive Implementation Study 6

Adapting Spatial Transformer Networks Across Diverse Hardwar...

引用

6th International conference on AI Circuits and Systems (AICAS)

作者： Bettayeb, Meriem Hassan, Eman Khan, Muhammad Umair Halawani, Yasmin Saleh, Hani Mohammad, Baker Khalifa Univ Syst On Chip SoC Lab Comp & Commun Engn Dept Abu Dhabi U Arab Emirates Univ Dubai Coll Engn & IT Dubai U Arab Emirates

ISBN: (纸本)9798350383638;9798350383645

the field of artificial intelligence (AI) holds a variety of algorithms designed with the goal of achieving high accuracy at low computational cost and latency. One popular algorithm is the vision transformer (ViT), which excels at various computer vision tasks for its ability to capture long-range dependencies effectively. this paper analyzes a computing paradigm, namely, spatial transformer networks (STN), in terms of accuracy and hardware complexity for image classification tasks. the paper reveals that for 2D applications, such as image recognition and classification, STN is a great backbone for AI algorithms for its efficiency and fast inference time. this framework offers a promising solution for efficient and accurate AI for resource-constrained Internet of things (IoT) and edge devices. the comparative analysis of STN implementations on the central processing unit (CPU), Raspberry Pi (RPi), and Resistive Random Access Memory (RRAM) architectures reveals nuanced performance variations, providing valuable insights into their respective computational efficiency and energy utilization.

关键词： Spatial Transformer Network image Classification vision transformer raspberry Pi hardware platforms artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

A computer vision Approach for Autonomous Cars to Drive Safe at Construction Zone 6

A Computer Vision Approach for Autonomous Cars to Drive Safe...

引用

6th IEEE International conference on image processing, Applications and Systems, IPAS 2025

作者： Ahammed, Abu Shad Shahi Amran Hossain, Md Obermaisser, Roman University of Siegen Chair of Embedded Systems Siegen Germany

ISBN: (纸本)9798331506520

To build a smarter and safer city, a secure, efficient, and sustainable transportation system is a key requirement. the autonomous driving system (ADS) plays an important role in the development of smart transportation and is considered one of the major challenges facing the automotive sector in recent decades. A car equipped with an autonomous driving system (ADS) comes with various cutting-edge functionalities such as adaptive cruise control, collision alerts, automated parking, and more. A primary area of research within ADAS involves identifying road obstacles in construction zones regardless of the driving environment. this paper presents an innovative and highly accurate road obstacle detection model utilizing computer vision technology that can be activated in construction zones and functions under diverse drift conditions, ultimately contributing to build a safer road transportation system. the model developed with the YOLO framework achieved a mean average precision exceeding 94% and demonstrated an inference time of 1.6 milliseconds on the validation dataset, underscoring the robustness of the methodology applied to mitigate hazards and risks for autonomous vehicles. © 2025 IEEE.

关键词： Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共202页 << < 4 5 6 7 8 9 10 11 12 13 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：