Feature selection is a crucial step in data preprocessing because feature selection reduces the dimensionality of data by eliminating irrelevant and redundant features. Since manual labeling is expensive, unsupervised...
详细信息
Feature selection is a crucial step in data preprocessing because feature selection reduces the dimensionality of data by eliminating irrelevant and redundant features. Since manual labeling is expensive, unsupervised feature selection has received increasing attention in recent years. However, existing unsupervised feature selection methods tend to prioritize selecting highly correlated features over exploring feature diversity. Thus, a regularized fractal autoencoder(RFAE) method is proposed to select informative features in an unsupervised way. Specifically, the fractal autoencoder network extends autoencoders to construct a correspondence neural network and a selection neural network. The correspondence neural network exploits interfeature correlations and the selection neural network selects the informative features. A redundancy regularization strategy consists of a redundancy elimination regularization term based on the dependency between features and a sparse regularization term based on the group lasso. The redundancy regularization strategy eliminates feature subset redundancy and enhances network generalization ability. Extensive experimental results on six publicly available datasets show that the proposed RFAE outperforms the compared methods regarding clustering accuracy and classification accuracy. Moreover, the proposed RFAE achieves acceptable computation efficiency.
暂无评论