检索结果-内蒙古大学图书馆

Input Layer Regularization of Multilayer Feedforward Neural Networks

IEEE ACCESS 2017年 5卷 10979-10985页

作者： Li, Feng Zurada, Jacek M. Liu, Yan Wu, Wei Dalian Univ Technol Sch Math Sci Dalian 116024 Peoples R China Univ Louisville Dept Elect & Comp Engn Louisville KY 40208 USA Univ Social Sci Informat Technol Inst PL-90113 Lodz Poland Dalian Polytech Univ Sch Informat Sci & Engn Dalian 116034 Peoples R China

Multilayer feedforward neural networks (MFNNs) have been widely used for classification or approximation of nonlinear mappings described by a data set consisting of input and output samples. In many MFNN applications, a common compressive sensing task is to find the redundant dimensions of the input data. The aim of a regularization technique presented in this paper is to eliminate the redundant dimensions and to achieve compression of the input layer. It is achieved by introducing an L-1 or L-1/2 regularizer to the input layer weights training. As a comparison, in the existing references, a regularization method is usually applied to the hidden layer for a better representation of the dataset and sparsification of the network. Gradient-descent method is used for solving the resulting optimization problem. Numerical experiments including a simulated approximation problem and three classification problems (Monk, Sonar, and the MNIST data set) have been used to illustrate the algorithm.

关键词： Multilayer feedforward neural network autoencoder compressive sensing regularization of input layer L-1 and L-1/2 regularization

来源：评论

学校读者我要写书评

暂无评论

Supervised monaural source separation based on autoencoders

Supervised monaural source separation based on autoencoders

引用

IEEE International Conference on Acoustics, Speech and Signal Processing

作者： Keiichi Osako Yuki Mitsufuji Rita Singh Bhiksha Raj Sony Corporation Minato-ku Tokyo Japan Carnegie Mellon University Pittsburgh PA USA

ISBN: (纸本)9781509041183

In this paper, we propose a new supervised monaural source separation based on autoencoders. We employ the autoencoder for the dictionary training such that the nonlinear network can encode the target source with high expressiveness. The dictionary is trained by each target source without the mixture signal, which makes the system independent from the context where the dictionaries will be used. In separation process, the decoder portions of the trained autoencoders are used as dictionaries to find the activations in a iterative manner such that a summation of the decoder outputs approximates the original mixture. The results of the instruments source separation experiments revealed that the separation performance of the proposed method was superior to that of the NMF.

关键词： source separation autoencoder neural networks non-negative matrix factorization

来源：评论

学校读者我要写书评

暂无评论

Face Verification via Learned Representation on Feature-Rich Video Frames

引用

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY 2017年第7期12卷 1686-1698页

作者： Goswami, Gaurav Vatsa, Mayank Singh, Richa Indraprastha Inst Informat Technol Delhi Delhi 110020 India

Abundance and availability of video capture devices, such as mobile phones and surveillance cameras, have instigated research in video face recognition, which is highly pertinent in law enforcement applications. While the current approaches have reported high accuracies at equal error rates, performance at lower false accept rates requires significant improvement. In this paper, we propose a novel face verification algorithm, which starts with selecting feature-rich frames from a video sequence using discrete wavelet transform and entropy computation. Frame selection is followed by representation learning-based feature extraction, where three contributions are presented: 1) deep learning architecture, which is a combination of stacked denoising sparse autoencoder (SDAE) and deep Boltzmann machine (DBM);2) formulation for joint representation in an autoencoder;and 3) updating the loss function of DBM by including sparse and low rank regularization. Finally, a multilayer neural network is used as the classifier to obtain the verification decision. The results are demonstrated on two publicly available databases, YouTube Faces and Point and Shoot Challenge. Experimental analysis suggests that: 1) the proposed featurerichness-based frame selection offers noticeable and consistent performance improvement compared with frontal only frames, random frames, or frame selection using perceptual no-reference image quality measures and 2) joint feature learning in SDAE and sparse and low rank regularization in DBM helps in improving face verification performance. On the benchmark Point and Shoot Challenge database, the algorithm yields the verification accuracy of over 97% at 1% false accept rate whereas, on the YouTube Faces database, over 95% verification accuracy is observed at equal error rate.

关键词： Deep learning autoencoder deep Boltzmann machine face recognition frame selection

来源：评论

学校读者我要写书评

暂无评论

Learning Multi-Modality Features for Scene Classification of High-Resolution Remote Sensing Images

引用

Journal of Computer and Communications 2018年第11期6卷 185-193页

作者： Feng’an Zhao Xiongmei Zhang Xiaodong Mu Zhaoxiang Yi Zhou Yang Department of Computer Science and Technology Xi’an Research Institute of High-Tech Xi’an China

Scene classification of high-resolution remote sensing (HRRS) image is an important research topic and has been applied broadly in many fields. Deep learning method has shown its high potential to in this domain, owing to its powerful learning ability of characterizing complex patterns. However the deep learning methods omit some global and local information of the HRRS image. To this end, in this article we show efforts to adopt explicit global and local information to provide complementary information to deep models. Specifically, we use a patch based MS-CLBP method to acquire global and local representations, and then we consider a pretrained CNN model as a feature extractor and extract deep hierarchical features from full-connection layers. After fisher vector (FV) encoding, we obtain the holistic visual representation of the scene image. We view the scene classification as a reconstruction procedure and train several class-specific stack denoising autoencoders (SDAEs) of corresponding class, i.e., one SDAE per class, and classify the test image according to the reconstruction error. Experimental results show that our combination method outperforms the state-of-the-art deep learning classification methods without employing fine-tuning.

关键词： Feature Fusion Multiple Features Scene Classification Stack Denoising autoencoder

来源：评论

学校读者我要写书评

暂无评论

Tunnel Effect in CNNs: Image Reconstruction From Max Switch Locations

引用

IEEE SIGNAL PROCESSING LETTERS 2017年第3期24卷 254-258页

作者： Saint Andre, Matthieu de La Roche Rieger, Laura Hannemose, Morten Kim, Junmo Korea Adv Inst Sci & Technol Daejeon 34141 South Korea EFREI F-94800 Villejuif France Tech Univ Berlin D-10623 Berlin Germany Tech Univ Denmark DK-2800 Lyngby Denmark Korea Adv Inst Sci & Technol Sch Elect Engn Daejeon 34141 South Korea

In this letter, we show that reconstruction of an image passed through a neural network is possible, using only the locations of the max pool activations. This was demonstrated with an architecture consisting of an encoder and a decoder. The decoder is amirrored version of the encoder, where convolutions are replaced with deconvolutions and poolings are replaced with unpooling layers. The locations of the max pool switches are transmitted to the corresponding unpooling layer. The reconstruction is computed only from these switches without the use of feature values. Using only the max switch location information of the pool layers, a surprisingly good image reconstruction can be achieved. We examine this effect in various architectures, as well as how the quality of the reconstruction is affected by the number of features. We also compare the reconstruction with an encoder with randomly initialized weights with an encoder pretrained for classification. Finally, we give recommendations for future architecture decisions.

关键词： autoencoder convolutional neural networks deconvolution encoding image reconstruction pooling unpooling

来源：评论

学校读者我要写书评

暂无评论

Deep Neural Network based Place and Manner of Articulation Detection and Classification for Bengali Continuous Speech

引用

Procedia Computer Science 2018年 125卷 895-901页

作者： Tanmay Bhowmik Amitava Chowdhury Shyamal Kumar Das Mandal CET IIT Kharagpur Kharagpur-721302 India SCS University of Petroleum and Energy Studies Dehradun-248007 India

The phonological features are the most basic unit of a speech knowledge hierarchy. This paper reports about detection and classification of phonological features from Bengali continuous speech. The phonological features are based on place and manner of articulation. All the experiments are performed by a deep neural network based framework. Two different models are designed for the detection and classification task. The deep-structured models are pre-trained by stacked autoencoder. The C-DAC speech corpus is used for continuous spoken Bengali speech data. Frame wise cepstral representation is provided in the input layer of the deep-structured model. Speech data from multiple speakers has been used to confirm speaker-independency. In detection task, the system achieved 86.19% average overall accuracy. In the classification task, accuracy for the classification of place of articulation remains low with 50.2% while in manner-based classification, the system delivered an improved performance with 98.9% accuracy.

关键词： Phonological features place of articulation manner of articulation deep neural network detection classification autoencoder

来源：评论

学校读者我要写书评

暂无评论

Dimensionality reduction for protein secondary structure and solvent accesibility prediction

引用

JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY 2018年第5期16卷 1850020页

作者： Aydin, Zafer Kaynar, Oguz Gormez, Yasin Abdullah Gul Univ Dept Comp Engn TR-38080 Kayseri Turkey Cumhuriyet Univ Dept Management Informat Syst TR-58000 Sivas Turkey

Secondary structure and solvent accessibility prediction provide valuable information for estimating the three dimensional structure of a protein. As new feature extraction methods are developed the dimensionality of the input feature space increases steadily. Reducing the number of dimensions provides several advantages such as faster model training, faster prediction and noise elimination. In this work, several dimensionality reduction techniques have been employed including various feature selection methods, autoencoders and PCA for protein secondary structure and solvent accessibility prediction. The reduced feature set is used to train a support vector machine at the second stage of a hybrid classifier. Cross-validation experiments on two difficult benchmarks demonstrate that the dimension of the input space can be reduced substantially while maintaining the prediction accuracy. This will enable the incorporation of additional informative features derived for predicting the structural properties of proteins without reducing the accuracy due to overfitting.

关键词： Secondary structure prediction solvent accessibility prediction feature selection dimension reduction autoencoder

来源：评论

学校读者我要写书评

暂无评论

Community detection in complex networks using deep auto-encoded extreme learning machine

引用

MODERN PHYSICS LETTERS B 2018年第16期32卷

作者： Wang, Feifan Zhang, Baihai Chai, Senchun Xia, Yuanqing Beijing Inst Technol Sch Automat 5 Zhongguancun South St Beijing 10081 Peoples R China

Community detection has long been a fascinating topic in complex networks since the community structure usually unveils valuable information of interest. The prevalence and evolution of deep learning and neural networks have been pushing forward the advancement in various research fields and also provide us numerous useful and off the shelf techniques. In this paper, we put the cascaded stacked autoencoders and the unsupervised extreme learning machine (ELM) together in a two-level embedding process and propose a novel community detection algorithm. Extensive comparison experiments in circumstances of both synthetic and real-world networks manifest the advantages of the proposed algorithm. On one hand, it outperforms the k-means clustering in terms of the accuracy and stability thus benefiting from the determinate dimensions of the ELM block and the integration of sparsity restrictions. On the other hand, it endures smaller complexity than the spectral clustering method on account of the shrinkage in time spent on the eigenvalue decomposition procedure.

关键词： Community detection complex networks deep learning extreme learning machine autoencoder

来源：评论

学校读者我要写书评

暂无评论

Multi-omics integration for neuroblastoma clinical endpoint prediction

引用

BIOLOGY DIRECT 2018年第20180427期13卷 5-5页

作者： Francescatto, Margherita Chierici, Marco Dezfooli, Setareh Rezvan Zandona, Alessandro Jurman, Giuseppe Furlanello, Cesare Fdn Bruno Kessler Via Sommar 18 I-38123 Trento Italy Univ Trento Ctr Integrat Biol Via Sommar 9 I-38123 Trento Italy Univ Padua Dept Informat Engn Padua Italy

Background: High-throughput methodologies such as microarrays and next-generation sequencing are routinely used in cancer research, generating complex data at different omics layers. The effective integration of omics data could provide a broader insight into the mechanisms of cancer biology, helping researchers and clinicians to develop personalized therapies. Results: In the context of CAMDA 2017 Neuroblastoma Data Integration challenge, we explore the use of Integrative Network Fusion (INF), a bioinformatics framework combining a similarity network fusion with machine learning for the integration of multiple omics data. We apply the INF framework for the prediction of neuroblastoma patient outcome, integrating RNA-Seq, microarray and array comparative genomic hybridization data. We additionally explore the use of autoencoders as a method to integrate microarray expression and copy number data. Conclusions: The INF method is effective for the integration of multiple data sources providing compact feature signatures for patient classification with performances comparable to other methods. Latent space representation of the integrated data provided by the autoencoder approach gives promising results, both by improving classification on survival endpoints and by providing means to discover two groups of patients characterized by distinct overall survival (OS) curves.

关键词： Neuroblastoma Integration Prediction Classification autoencoder

来源：评论

学校读者我要写书评

暂无评论

A Generalized Deep Learning-Based Diagnostic System for Early Diagnosis of Various Types of Pulmonary Nodules

引用

TECHNOLOGY IN CANCER RESEARCH & TREATMENT 2018年第17期17卷

作者： Shaffie, Ahmed Soliman, Ahmed Fraiwan, Luay Ghazal, Mohammed Taher, Fatma Dunlap, Neal Wang, Brian van Berke, Victor Keynton, Robert Elmaghraby, Add El-Baz, Ayman Univ Louisville Bioengn Dept 200 E Shipp Ave Louisville KY 40208 USA Univ Louisville Comp Engn & Comp Sci Dept Louisville KY 40208 USA Abu Dhabi Univ Dept Elect & Comp Engn Abu Dhabi U Arab Emirates Zayed Univ Coll Technol Innovat Dubai U Arab Emirates Univ Louisville Dept Radiat Oncol Louisville KY 40208 USA Univ Louisville Dept Cardiovasc & Thorac Surg Louisville KY 40208 USA

A novel framework for the classification of lung nodules using computed tomography scans is proposed in this article. To get an accurate diagnosis of the detected lung nodules, the proposed framework integrates the following 2 groups of features: ( I ) appearance features modeled using the higher order Markov Gibbs random field model that has the ability to describe the spatial inhomogeneities inside the lung nodule and (2) geometric features that describe the shape geometry of the lung nodules. The novelty of this article is to accurately model the appearance of the detected lung nodules using a new developed seventh-order Markov Gibbs random field model that has the ability to model the existing spatial inhomogeneities for both small and large detected lung nodules, in addition to the integration with the extracted geometric features. Finally, a deep autoencoder classifier is fed by the above 2 feature groups to distinguish between the malignant and benign nodules. To evaluate the proposed framework, we used the publicly available data from the Lung Image Database Consortium. We used a total of 727 nodules that were collected from 467 patients. The proposed system demonstrates the promise to be a valuable tool for the detection of lung cancer evidenced by achieving a nodule classification accuracy of 91.20%.

关键词： computer-aided diagnosis higher order MGRF computed tomography autoencoder pulmonary nodule lung cancer

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：