检索结果-内蒙古大学图书馆

International Joint Conference on Neural Networks (IJCNN)

作者： Lu, Yaping Zhang, Li Wang, Bangjun Yang, Jiwen Suzhou Univ Sch Comp Sci & Technol Suzhou 215006 Jiangsu Peoples R China Suzhou Univ Prov Key Lab Comp Informat Proc Technol Suzhou 215006 Jiangsu Peoples R China

ISBN: (纸本)9781479914845

Deep networks are well known for their powerful function approximations. To train a deep network efficiently, greedy layer-wise pre-training and fine tuning are required. Typically, pre-training, aiming to initialize a deep network, is implemented via unsupervised feature learning, with multiple feature representations generated. However, in general only the last layer representation is to be employed because of its abstraction and compactness being the best with comparisons to the ones of lower layers. To make full use of the representations of all layers, this paper proposes a feature ensemble learning method based on sparse autoencoders for image classification. Specifically, we train three softmax classifiers by using the representations of different layers, instead of one classifier trained by applying the last layer representation. Of the three softmax classifiers, two are obtained by training stacked auto encoders with fine tuning, and the other one is obtained by directly using a concatenation of two representations. To improve accuracy and stability of a single softmax classifier, the ensemble of multiple classifiers is considered, and some Naive Bayes combination rules are introduced to integrate the three classifiers. Experimental results on the MNIST and COIL datasets are presented, with comparisons to other classification methods.

关键词： deep network feature representation feature ensemble autoencoder softmax Naive Bayes

来源：评论

学校读者我要写书评

暂无评论

R~2FP: Rich and Robust Feature Pooling for Mining Visual Data

R~2FP: Rich and Robust Feature Pooling for Mining Visual Dat...

引用

IEEE International Conference on Data Mining

作者： Wei Xiong Bo Du Lefei Zhang Ruimin Hu Wei Bian Jialie Shen Dacheng Tao School of Computer Science Wuhan University National Engineering Research Center for MultimediaSoftware Luojiashan Centre for Quantum Computation & Intelligent Systems University of Technology School of Information Systems Singapore Management University

ISBN: (纸本)9781467395052

The human visual system proves smart in extracting both global and local features. Can we design a similar way for unsupervised feature learning? In this paper, we propose a novel pooling method within an unsupervised feature learning framework, named Rich and Robust Feature Pooling (R~2FP), to better explore rich and robust representation from sparse feature maps of the input data. Both local and global pooling strategies are further considered to instantiate such a method and intensively studied. The former selects the most conductive features in the sub-region and summarizes the joint distribution of the selected features, while the latter is utilized to extract multiple resolutions of features and fuse the features with a feature balancing kernel for rich representation. Extensive experiments on several image recognition tasks demonstrate the superiority of the proposed techniques.

关键词： Pooling autoencoder Representation learning

来源：评论

学校读者我要写书评

暂无评论

Speech Separation based on Deep Belief Network

Speech Separation based on Deep Belief Network

引用

2015 International Industrial Informatics and Computer Engineering Conference(IIICEC 2015)

作者： Wu Haijia Zhang Xiongwei Zhang Liangliang Zou Xia College of Command Information and Systems PLA University of Science and Technology

Thanks to its hierarchical and generative nature,Deep Belief Network（DBN） is effective to feature representation and extraction in signal *** this paper,DBN is investigated and implemented to monaural speech ***,two separate DBNs are trained to extract features from mixed noisy signals and target clean speech ***,the two types of extracted features are associated together by training a BP neural network to obtain a mapping from the features of mixed signals to the features of target ***,by performing DBN and the above mapping neural network,target speech can be estimated from the input mixed *** are conducted on different kinds of mixed signals including female/male speech mixtures,human-speech/Gaussian-noise audio mixtures,and human-speech/music audio *** PESQ scores of the extracted speech are 3.32,2.59,and 3.42 respectively,which illustrates that the model performs well on speech separation tasks,especially on the mixed signals where the inference signals have obvious spectral structures.

关键词： speech separation deep learning deep belief network restricted Boltzmann machine autoencoder

来源：评论

学校读者我要写书评

暂无评论

A Novel Method Based on Data Visual Autoencoding for Time-Series Classification

A Novel Method Based on Data Visual Autoencoding for Time-Se...

引用

2015年中国智能自动化学术会议

作者： Chen Qian Yan Wang Lei Guo School of Automation Science and Electrical Engineering Beihang University

A variety of techniques based on numerical characteristics are currently presented for mining time-series data. However, we find that time-series data generally contain curves sharing some set of visual characteristics and *** characteristics offer a deeper understanding of time-series data, and open up a potential new technique for time-series analysis. Particularly beneficial from recent advances in deep neural networks, representations and features can be automatically learnt by deep learning architectures such as autoencoders. Based on that, our work proposes a novel method, named time-series visualization(TSV), to efficiently detect visual characteristics from curves of time-series data and use these characteristics for intelligent analysis. Architecture and algorithm of TSV based on stacked autoencoders are introduced in this paper. Further, important factors affecting the performance of TSV are discussed based on empirical results. Through empirical evaluation, it is demonstrated that TSV has better efficiency and higher classification accuracy on analyzing the datasets with significant curve feature.

关键词： Time series autoencoder Classification Input dropout TSV

来源：评论

学校读者我要写书评

暂无评论

INCORPORATING IMAGE DEGENERATION MODELING WITH MULTITASK LEARNING FOR IMAGE SUPER-RESOLUTION

INCORPORATING IMAGE DEGENERATION MODELING WITH MULTITASK LEA...

引用

IEEE International Conference on Image Processing

作者： Yudong Liang Jinjun Wang Shizhou Zhang Yihong Gong Xi'an Jiaotong University Institute of Artificial Intelligence and Robotics

ISBN: (纸本)9781479983407

Learning the non-linear image upscaling process has previously been considered as a simple regression process, where various models have been utilized to describe the correlations between high-resolution (HR) and low-resolution (LR) images/patches. In this paper, we present a multitask learning framework based on deep neural network for image super-resolution, where we jointly consider the image super-resolution process and the image degeneration process. By sharing parameters between the two highly relevant tasks, the proposed framework could effectively improve the obtained neural network based mapping model between HR and L-R image patches. Experimental results have demonstrated clear visual improvement and high computational efficiency, especially with large magnification factors.

关键词： Super-resolution Multitask learning autoencoder Degeneration modeling super-resolution images degeneration Neural network Imagery (Psychotherapy) Image Heart Rate Computational efficiency Modeling Learning

来源：评论

学校读者我要写书评

暂无评论

高光谱图像的数据压缩与分类算法研究

高光谱图像的数据压缩与分类算法研究

引用

作者：郭智西安电子科技大学

学位级别：硕士

高光谱图像是一种特征维度大、像素点众多的图像数据集,目前对其主要研究工作包括了特征选择、特征提取、模式分类等等。由于高光谱图像的数据量较为庞大且存在冗余信息,因此对数据的特征学习与挖掘有效数据点是图像处理的关键。目前主... 详细信息

高光谱图像是一种特征维度大、像素点众多的图像数据集,目前对其主要研究工作包括了特征选择、特征提取、模式分类等等。由于高光谱图像的数据量较为庞大且存在冗余信息,因此对数据的特征学习与挖掘有效数据点是图像处理的关键。目前主要的特征学习算法包括PCA、LDA等传统特征学习算法,以及新兴且越来越流行的基于深度学习的算法;而对于关注度较低的图像数据压缩领域,主要的压缩方法包括基于kNN的筛选算法以及利用Nystrom的数据约减算法。本文以深度学习为基础,结合了基于kNN的筛选算法、神经网络分类器算法对高光谱图像进行数据压缩,找到有代表性的少数数据点,对其进行标记并训练,相比于实际操作中的随机选点进行标记和训练,可提供更有效地指导模型的训练,提高后续分类操作的准确性;而最后一部分内容则是将广泛应用于自然图像识别的卷积神经网络算法进行模型的归纳与推广,应用于高维度的高光谱图像分类中。主要工作概括如下：1.本文提出了一种基于多层网络架构的数据压缩与分类算法,并将其应用于高光谱图像的数据约减与分类,压缩数据的过程中算法可根据用户的需求决定每一次压缩的数据量,直到已筛选出的数据点满足能够良好表示原始图像中每个像素点的值为止。随后我们用筛选出的数据点构成训练样本集,利用SVM分类器对其余数据点构成的测试集进行有监督分类。2.本文提出了一种基于深度网络特征的数据压缩表示与分类算法,该方法涉及了流行的深度网络有关知识,以及神经网络的相关理论,能够对原始图像的特征得到更有效的表示,并将这个在新的特征空间下的图像数据利用神经网络分类器跟现有训练样本集进行拟合比对,把相似性较低的未标记样本加入到训练数据中,直到满足用户对训练数据个数的要求为止。随后利用筛选出的数据点以及前期已知的少量样本构成训练样本集,利用SVM分类器对其余数据点构成的测试集进行分类。3.本文提出了一种基于卷积神经网络的分类器算法,并将其应用到高光谱图像的分类中,该算法利用了多层卷积网络对原始高光谱图像的训练样本集进行特征学习,使用神经网络对这些特征和对应的类别标记进行训练建模,最后对其余数据点构成的测试集在此模型下进行分类。

关键词：数据压缩分类 Nystr(o|")m 深度学习 autoencoder softmax CNN 高光谱图像

来源：评论

学校读者我要写书评

暂无评论

Feature Ensemble Learning based on Sparse autoencoders for Image Classification

Feature Ensemble Learning based on Sparse Autoencoders for I...

引用

International Joint Conference on Neural Networks

作者： Yaping Lu Li Zhang Bangjun Wang Jiwen Yang School of Computer Science and Technology & Provincial Key Laboratory for Computer Information Processing Technology Soochow University

ISBN: (纸本)9781479914821

Deep networks are well known for their powerful function approximations. To train a deep network efficiently, greedy layer-wise pre-training and fine tuning are required. Typically, pre-training, aiming to initialize a deep network, is implemented via unsupervised feature learning, with multiple feature representations generated. However, in general only the last layer representation is to be employed because of its abstraction and compactness being the best with comparisons to the ones of lower layers. To make full use of the representations of all layers, this paper proposes a feature ensemble learning method based on sparse autoencoders for image classification. Specifically, we train three softmax classifiers by using the representations of different layers, instead of one classifier trained by applying the last layer representation. Of the three softmax classifiers, two are obtained by training stacked auto-encoders with fine tuning, and the other one is obtained by directly using a concatenation of two representations. To improve accuracy and stability of a single softmax classifier, the ensemble of multiple classifiers is considered, and some Naive Bayes combination rules are introduced to integrate the three classifiers. Experimental results on the MNIST and COIL datasets are presented, with comparisons to other classification methods.

关键词： Deep network Feature representation Feature ensemble autoencoder Softmax Naive Bayes

来源：评论

学校读者我要写书评

暂无评论

Construction and Reduction Methods of Vulnerability Index System in Power SCADA

引用

INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS 2014年第6期8卷 335-352页

作者： Li, Yuancheng Chu, Shengnan North China Elect Power Univ Sch Control & Comp Engn Beijing Peoples R China

Electric power SCADA (Supervisory Control and Data Acquisition) system gradually transforming from a separate private network to an open public network, seriously increases the vulnerability risk in electric power SCADA. In order to assess the vulnerability risk in electric power SCADA system, the paper firstly uses Delphi method and AHP (Analytic Hierarchy Process) to build an index system of vulnerability risk assessment, to fully represent the vulnerability of electric power SCADA system. As index data of vulnerability risk assessment in power SCADA is characterized by strong relation and high dimensionality, the method of autoencoder is proposed to reduce dimensionality of index data by representing high-dimensional data in a low dimensional space. Auto encoder method can obtain the optimal initial weight in pre-training and then back-propagate error derivatives adjusting weights with the initial weights to minimize the reconstruction error finally getting the best reconstructed results. The paper conducts simulation experiments about reconstruction error in pre-training and fine-tuning process in MATLAB experimental platform, and the experimental results show that dimensional code received by reducing dimensionality of data can basically fully represent high-dimensional data. The lowdimensional code as input can significantly reduce the complexity in the construction of model of vulnerability risk assessment in Electric power SCADA system in later work.

关键词： electric power SCADA system index system of vulnerability assessment autoencoder reducing dimensionality

来源：评论

学校读者我要写书评

暂无评论

VOICE CONVERSION USING DEEP NEURAL NETWORKS WITH SPEAKER-INDEPENDENT PRE-TRAINING

VOICE CONVERSION USING DEEP NEURAL NETWORKS WITH SPEAKER-IND...

引用

IEEE Workshop on Spoken Language Technology (SLT 2014)

作者： Mohammadi, Seyed Hamidreza Kain, Alexander Oregon Hlth & Sci Univ Ctr Spoken Language Understanding Portland OR 97201 USA

ISBN: (纸本)9781479971299

In this study, we trained a deep autoencoder to build compact representations of short-term spectra of multiple speakers. Using this compact representation as mapping features, we then trained an artificial neural network to predict target voice features from source voice features. Finally, we constructed a deep neural network from the trained deep autoencoder and artificial neural network weights, which were then fine-tuned using back-propagation. We compared the proposed method to existing methods using Gaussian mixture models and frame-selection. We evaluated the methods objectively, and also conducted perceptual experiments to measure both the conversion accuracy and speech quality of selected systems. The results showed that, for 70 training sentences, frame-selection performed best, regarding both accuracy and quality. When using only two training sentences, the pre-trained deep neural network performed best, regarding both accuracy and quality.

关键词： voice conversion pre-training deep neural network autoencoder

来源：评论

学校读者我要写书评

暂无评论

Modeling Video Dynamics with Deep Dynencoder

引用

13th European Conference on Computer Vision (ECCV)

作者： Yan, Xing Chang, Hong Shan, Shiguang Chen, Xilin Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100190 Peoples R China

ISBN: (纸本)9783319105932;9783319105925

Videos always exhibit various pattern motions, which can be modeled according to dynamics between adjacent frames. Previous methods based on linear dynamic system can model dynamic textures but have limited capacity of representing sophisticated nonlinear dynamics. Inspired by the nonlinear expression power of deep autoencoders, we propose a novel model named dynencoder which has an autoencoder at the bottom and a variant of it at the top (named as dynpredictor). It generates hidden states from raw pixel inputs via the autoencoder and then encodes the dynamic of state transition over time via the dynpredictor. Deep dynencoder can be constructed by proper stacking strategy and trained by layer-wise pre-training and joint fine-tuning. Experiments verify that our model can describe sophisticated video dynamics and synthesize endless video texture sequences with high visual quality. We also design classification and clustering methods based on our model and demonstrate the efficacy of them on traffic scene classification and motion segmentation. ...

关键词： Video Dynamics Deep Model autoencoder Time Series Dynamic Textures

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：