检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

921 篇 会议
61 册 图书
23 篇 期刊文献

馆藏范围

1,005 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

822 篇 工学
- 719 篇 计算机科学与技术...
- 375 篇 软件工程
- 336 篇 电气工程
- 109 篇 信息与通信工程
- 105 篇 光学工程
- 91 篇 生物工程
- 54 篇 生物医学工程（可授...
- 41 篇 电子科学与技术（可...
- 35 篇 化学工程与技术
- 28 篇 控制科学与工程
- 21 篇 机械工程
- 15 篇 安全科学与工程
- 12 篇 仪器科学与技术
- 10 篇 土木工程
- 8 篇 建筑学
- 8 篇 航空宇航科学与技...
- 7 篇 交通运输工程
281 篇 理学
- 127 篇 物理学
- 118 篇 数学
- 94 篇 生物学
- 39 篇 统计学（可授理学、...
- 33 篇 化学
- 11 篇 系统科学
269 篇 医学
- 269 篇 临床医学
- 25 篇 基础医学(可授医学...
- 22 篇 药学(可授医学、理...
75 篇 管理学
- 63 篇 图书情报与档案管...
- 13 篇 管理科学与工程(可...
- 8 篇 工商管理
10 篇 法学
- 9 篇 社会学
4 篇 农学
3 篇 经济学
1 篇 教育学

主题

46 篇 computer graphic...
44 篇 artificial intel...
42 篇 image processing...
33 篇 computer vision
29 篇 deep learning
27 篇 pattern recognit...
25 篇 image segmentati...
22 篇 computer imaging...
20 篇 image enhancemen...
19 篇 graphics process...
17 篇 face recognition
17 篇 image reconstruc...
15 篇 cameras
13 篇 image processing
11 篇 generative adver...
10 篇 object detection
10 篇 computer communi...
10 篇 pixels
10 篇 feature extracti...
10 篇 machine learning

机构

15 篇 indian institute...
11 篇 indian statistic...
11 篇 carnegie mellon ...
11 篇 hebrew universit...
11 篇 google research
10 篇 indian inst tech...
8 篇 indian stat inst...
8 篇 indian inst tech...
7 篇 indian institute...
7 篇 indian institute...
7 篇 indian inst tech...
7 篇 indian inst tech...
7 篇 technische unive...
7 篇 indian inst sci ...
6 篇 indian institute...
6 篇 indian institute...
6 篇 indian inst tech...
6 篇 technical univer...
6 篇 indian inst tech...
6 篇 department of co...

作者

31 篇 chaudhury santan...
26 篇 mukherjee jayant...
25 篇 chaudhuri subhas...
19 篇 das sukhendu
15 篇 babu r. venkates...
14 篇 das partha prati...
14 篇 lall brejesh
14 篇 raman shanmugana...
13 篇 harit gaurav
12 篇 mukherjee dipti ...
12 篇 chanda bhabatosh
12 篇 mishra deepak
11 篇 martial hebert
11 篇 yair weiss
11 篇 vittorio ferrari
11 篇 cristian sminchi...
11 篇 biswas prabir ku...
10 篇 banerjee biplab
10 篇 biswas soma
10 篇 sur arijit

语言

998 篇 英文
4 篇 中文
3 篇 俄文
2 篇 其他

检索条件"任意字段=15th Indian Conference on Computer Vision Graphics and Image Processing"

共 1005 条记录，以下是981-990 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Generative Adversarial Networks for 3D Scene Reconstruction

Generative Adversarial Networks for 3D Scene Reconstruction

引用

International conference on Computing and Networking Technology (ICCNT)

作者： K. Senthamil Selvan Samaksh Goyal Aravindan M K Omkaresh S. Kulkarni K. Yuvaraj Nitish Vashisht Department of Electronics and Communication Engineering Prince Shri Venkateshwara Padmavathy Engineering College Chennai india Quantum University Research Center Quantum University School of Engineering and Technology Mechaical Engineering JAIN (Deemed to be University) Bangalore Karnataka India Department of Computer Science and Engineering (AI & ML) Vishwakarma Institute of Technology Pune INDIA Department of Computer Science Karpagam Academy of Higher Education Coimbatore Centre of Research Impact and Outcome Chitkara University Rajpura Punjab India

ISBN: (数字)9798350370249

ISBN: (纸本)9798350370270

Generative opposed Networks (GANs) are a generative model broadly utilized in device mastering, PC vision, and herbal language processing (NLP). GANs hire neural networks, a generator, and a discriminator that are trained collectively to be able to generate realistic-looking statistics, along with photographs, audio clips, or 3-D scenes. In this painting, we are aware of $3-\mathrm{D}$ scene reconstruction using GAN-based total methods. We aim to generate 3-D scenes from given 2nd pics, allowing us to benefit from insight into the structure and layout of complicated 3-D environments. To this end, we endorse leveraging a method known as inverse rendering to enhance the accuracy of the reconstructed 3-D scenes. We compare our method to the usage of artificial and real-international images, and the effects display the efficacy of our approach. Finally, we talk about capacity destiny guidelines for three-D scene reconstruction and the usage of GAN-based total strategies.

关键词： three-dimensional displays Shape Neural networks Layout Generative adversarial networks Rendering (computer graphics) Generators image reconstruction Painting Guidelines

来源：评论

学校读者我要写书评

暂无评论

vision-based Steering Angle Prediction by the Fusion of Depth and Intensity Deep Features 2018

Vision-based Steering Angle Prediction by the Fusion of Dept...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Vijay John Swarn Singh Rathour Seiichi Mita Hossein Tehrani Kazuhisa Ishimaru Masataka Konishi Hakusho Chin Toyota Technological Institute Japan DENSO CORPORATION Japan SOKEN INC. Japan

ISBN: (纸本)9781450366151

In monocular camera-based end-to-end driving, the vehicle driving parameters such as steering angle, speed etc are directly estimated from the camera using deep learning. On the other hand, in traditional autonomous driving, these parameters are estimated using multiple modules such as sensing, behaviour generation, path planning and control. Owing to its ability to directly estimate the driving parameters, the end-to-end driving framework has received significant attention from the research community. In this paper, we present a novel stereo-based deep learning framework for end-to-driving, where the depth and appearance information generated using the stereo camera, are integrated to improve the steering angle prediction accuracy, especially for varying illumination conditions. Validation of the proposed algorithm is performed using multiple sequences of pre-defined driving routes with an expert driver. Each pre-defined driving route is acquired over multiple days with varying illumination conditions. Utilizing the acquired dataset, we show that the steering angle prediction accuracy of stereo-based end-to-end driving is better than monocular camera-based end-to-end driving.

关键词： End-to-End driving Stereo vision

来源：评论

学校读者我要写书评

暂无评论

Zero-shot Learning using Graph Regularized Latent Discriminative Cross-domain Triplets 2018

Zero-shot Learning using Graph Regularized Latent Discrimina...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Omkar Gune Meet Vora Biplab Banerjee Subhasis Chaudhuri Indian Institute of Technology Bombay India Indian Institute of Technology Roorkee India

ISBN: (纸本)9781450366151

Zero-shot learning (ZSL) for visual recognition aims at identifying the previously unseen class samples given a trained model on the labeled visual samples of seen classes and additional class-level semantic side information for all classes. Often ZSL is tackled by learning an embedding function from the visual to semantic space or vice-versa. However, learning this mapping often results in loss of discriminative property of learned embedding space, thus severely compromising the recognition performance on the test samples. In order to ensure improved discrimination in the embedding space, we introduce a ZSL framework by leveraging the intuitive idea of cross-domain triplets based metric learning for learning such a space. Additionally, we introduce a novel graph Laplacian based regularizer which aligns the graph structures of the visual and semantic spaces in the learned embedding space. Simultaneously optimizing both the criteria results in a compact, discriminative, and meaningful embedding space, which is experimentally found to be superior to most of its existing counterparts on both the standard ZSL (AwA and CUB) and the challenging generalized ZSL (AwA1, AwA2, CUB) settings.

关键词： Zero-shot learning cross-domain alignment

来源：评论

学校读者我要写书评

暂无评论

Jointly Learning Convolutional Representations to Compress Radiological images and Classify thoracic Diseases in the Compressed Domain 18

Jointly Learning Convolutional Representations to Compress R...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Ekagra Ranjan Soumava Paul Siddharth Kapoor Aupendu Kar Ramanathan Sethuraman Debdoot Sheet Indian Institute of Technology Guwahati India Indian Institute of Technology Kharagpur India National Institute of Technology Karnataka India Intel Technology India Pvt. Ltd. Bangalore India

ISBN: (纸本)9781450366151

Deep learning models trained in natural images are commonly used for different classification tasks in the medical domain. Generally, very high dimensional medical images are down-sampled by using interpolation techniques before feeding them to deep learning models that are imageNet compliant and accept only low-resolution images of size 224 x 224 px. this popular technique may lead to the loss of key information thus hampering the classification. Significant pathological features in medical images typically being small sized and highly affected. To combat this problem, we introduce a convolutional neural network (CNN) based classification approach which learns to reduce the resolution of the image using an autoencoder and at the same time classify it using another network, while both the tasks are trained jointly. this algorithm guides the model to learn essential representations from high-resolution images for classification along with reconstruction. We have used the publicly available dataset of chest x-rays to evaluate this approach and have outperformed state-of-the-art on test data. Besides, we have experimented with the effects of different augmentation approaches in this dataset and report baselines using some well known imageNet class of CNNs.

关键词： Convolutional autoencoder

来源：评论

学校读者我要写书评

暂无评论

Video Dehazing using LMNN with respect to Augmented MRF 18

Video Dehazing using LMNN with respect to Augmented MRF

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Kushal Borkar Snehasis Mukherjee Indian Institute of Information Technology Sricity Chittoor Andhra Pradesh

ISBN: (纸本)9781450366151

the presence of haze within the atmospheric medium degrades the quality of videos captured by camera sensors. the expulsion of haze, referred to as dehazing, is typically performed subject to the physical degradation display, that involves an explanation of an ill-posed inverse drawback. A few efforts have been made for image dehazing, whereas, video dehazing still remains an unexplored area of research. this paper proposes an approach for video dehazing combining the concepts of single image dehazing, optical stream estimation and Markov Random Field (MRF). the proposed method enhances the temporal and spatial coherence of the hazy video. Assuming that the dark channel of the haze-free picture is zero, we acquire the raw transmission map. In the proposed approach, we focus on the raw transmission map obtained from the dark channel prior using guided filter. We assess the forward and reverse optical streams between the neighboring frames to locate individual pixels using Linear Discriminant Analysis. the color of the haze-free pixels in the frames is approximated by a few hundred discrete colors, which generate a fixed cluster in space and the directions of the pixel. the pixels at a given cluster are spread and can be determined by analyzing the forward and in reverse optical frames to predict its value after haze removal. Largest Margin Nearest Neighbor (LMNN) algorithm is applied to get the smooth transmission map of the foggy frames of the video to approximate the pixel value in the RGB space. the stream fields are utilized in an augmented MRF model on the transmission guide obtained to enhance the temporal and the spatial coherence of the transmission. the proposed method is compared against the state-of-the-art on both real and synthetic videos to preserve the information optimally.

关键词： bayes

来源：评论

学校读者我要写书评

暂无评论

DP-GAN: Dual Pathway Generative Adversarial Network for Face Recognition in Degraded Scenarios 2018

DP-GAN: Dual Pathway Generative Adversarial Network for Face...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Avishek Bhattacharjee Samik Banerjee Sukhendu Das Indian Institute of Technology Madras Chennai Tamil Nadu India

ISBN: (纸本)9781450366151

Face Recognition (FR) using Convolutional Neural Network (CNN) based models have achieved considerable success in constrained environments. they however fail to perform well in unconstrained scenarios, especially when the images are captured using surveillance cameras. these probe samples suffer from degradations such as noise, poor illumination, low resolution, blur as well as aliasing, when compared to the rich training (gallery) set, comprising mostly of mugshot images captured in laboratory settings. these images in the training (gallery) set are crisp and have high contrast, compared to the probe samples. To cope with this scenario, we propose a novel dual-pathway generative adversarial network (DP-GAN) which maps low resolution images captured using surveillance camera into their corresponding high resolution images, which are gallery-like, using a novel combination of multi-scale reconstruction and Jensen-Shannon divergence based loss. these images thus obtained are then used to train a deep domain adaptation (deep-DA) network to perform the task of FR. the proposed network achieves superior results (>90%) on four benchmark surveillance face datasets, evident from the rank-1 recognition rates when compared with recent state-of-the-art CNN-based techniques.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Activity Recognition in Egocentric Videos Using Bag of Key Action Units 2018

Activity Recognition in Egocentric Videos Using Bag of Key A...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： K. Sai Suma G. Aditya Snehasis Mukherjee Indian Institute of Information Technology Sricity Chittoor Andhra Pradesh

ISBN: (纸本)9781450366151

In this paper we present a novel methodology for recognizing human activity in Egocentric video based on the Bag of Visual Features. the proposed technique is based on the assumption that, only a portion of the whole video can be sufficient to identify an activity. Rather, we argue that, for activity recognition in egocentric videos, the proposed approach performs better than any deep learning based method. Because, in egocentric videos, often the person wiring the sensor, becomes static for long time, or moves his head frequently. In both the cases, it becomes difficult to learn the spatiotemporal pattern of the video during action. the proposed approach divides the video into smaller video segments called Video Units. Spatio-temporal features extracted from the units, are clustered to construct the dictionary of Action Units (AU). the AUs are ranked based upon their score of likeliness. the scores are obtained by constructing a weighted graph with the AUs as vertices and edge weights calculated based on the frequencies of occurrences of the AUs during the activity. the less significant AUs are pruned out from the dictionary, and the revised dictionary of key AUs are used for activity classification. We test our approach on benchmark egocentric dataset and achieve a good accuracy.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Seek and You Will Find: A New Optimized Framework for Efficient Detection of Pedestrian 18

Seek and You Will Find: A New Optimized Framework for Effici...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Sudip Das Partha Sarathi Mukherjee Ujjwal Bhattacharya Indian Statistical Institute Kolkata India

ISBN: (纸本)9781450366151

Studies of object detection and localization, particularly pedestrian detection have received considerable attention in recent times due to its several prospective applications such as surveillance, driving assistance, autonomous cars, etc. Also, a significant trend of latest research studies in related problem areas is the use of sophisticated Deep Learning based approaches to improve the benchmark performance on various standard datasets. A trade-off between the speed (number of video frames processed per second) and detection accuracy has often been reported in the existing literature. In this article, we present a new but simple deep learning based strategy for pedestrian detection that improves this trade-off. Since training of similar models using publicly available sample datasets failed to improve the detection performance to some significant extent, particularly for the instances of pedestrians of smaller sizes, we have developed a new sample dataset consisting of more than 80K annotated pedestrian figures in videos recorded under varying traffic conditions. Performance of the proposed model on the test samples of the new dataset and two other existing datasets, namely Caltech Pedestrian Dataset (CPD) and CityPerson Dataset (CD) have been obtained. Our proposed system shows nearly 16% improvement over the existing state-of-the-art result.

关键词： Convolutional Neural Networks

来源：评论

学校读者我要写书评

暂无评论

Nrityantar: Pose oblivious indian classical dance sequence classification system 18

Nrityantar: Pose oblivious Indian classical dance sequence c...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Vinay Kaushik Prerana Mukherjee Brejesh Lall Dept. of Electrical Engineering IIT Delhi

ISBN: (纸本)9781450366151

In this paper, we attempt to advance the research work done in human action recognition to a rather specialized application namely indian Classical Dance (ICD) classification. the variation in such dance forms in terms of hand and body postures, facial expressions or emotions and head orientation makes pose estimation an extremely challenging task. To circumvent this problem, we construct a pose-oblivious shape signature which is fed to a sequence learning framework. the pose signature representation is done in two-fold process. First, we represent person-pose in first frame of a dance video using symmetric Spatial Transformer Networks (STN) to extract good person object proposals and CNN-based parallel single person pose estimator (SPPE). Next, the pose basis are converted to pose flows by assigning a similarity score between successive poses followed by non-maximal suppression. Instead of feeding a simple chain of joints in the sequence learner which generally hinders the network performance we constitute a feature vector of the normalized distance vectors, flow, angles between anchor joints which captures the adjacency configuration in the skeletal pattern. thus, the kinematic relationship amongst the body joints across the frames using pose estimation helps in better establishing the spatio-temporal dependencies. We present an exhaustive empirical evaluation of state-of-the-art deep network based methods for dance classification on ICD dataset.

关键词： Action Recognition

来源：评论

学校读者我要写书评

暂无评论

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and Rotation Invariant Character Recognition: SRC-CNN for Scale and Rotation Invariant Character Recognition 2018

Scale and Rotation Corrected CNNs (SRC-CNNs) for Scale and R...

引用

Proceedings of the 11th indian conference on computer vision, graphics and image processing

作者： Swetha V. C. Deepak Mishra Sai Subrahmanyam Gorthi Indian Institute of Space Science and Technology Thiruvananthapuram Kerala Indian Institution of Technology Tirupati Andra Pradesh

ISBN: (纸本)9781450366151

Last decade has witnessed rapid growth for the popularity of Convolutional Neural Networks (CNNs), in detecting and classifying objects. the self trainable nature of CNNs makes them the strongest candidate as a classifier and a feature extractor. However, many of the existing CNN architectures fail recognizing texts or objects under input rotation and scaling. this paper introduces an elegant approach, 'Scale and Rotation Corrected CNN (SRC-CNN)' for scale and rotation invariant text recognition, exploiting the concept of principal component of characters. Prior to training and testing with baseline CNN, 'SRC-CNN' maps each character image to a reference orientation and scale, which is again derived from the character image itself. SRC-CNN is capable of recognizing characters in a document, even though they differ in orientation and scale greatly. the proposed method does not demand any training with samples which are scaled or rotated. the performance of proposed approach is validated on different character data sets like MNIST, MNIST_rot_12k and English alphabets and compared with state of the art rotation invariant classification networks. SRC-CNN is a generalized approach and can be extended for rotation and scale invariant classification of many other datasets as well, choosing any appropriate baseline CNN. Here we have demonstrated the generality of the proposed SRC-CNN on MNIST Fashion data set and found to perform well in rotation and scale invariant classification of objects as well. this paper demonstrates how the basic PCA based rotation and scale invariant image recognition can be integrated to CNN for achieving better rotational and scale invariances in classification.

关键词：

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共101页 << < 92 93 94 95 96 97 98 99 100 101 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：