Vertical data structure is very important for closed frequent itemset mining. All closed frequent itemsets-can be found by simply using the operations of AND/OR. However, it consumes a large amount of storage space, e...
详细信息
Vertical data structure is very important for closed frequent itemset mining. All closed frequent itemsets-can be found by simply using the operations of AND/OR. However, it consumes a large amount of storage space, especially in the case of large-size dataset. This paper proposes an algorithm for mining closed frequent itemsets based on a new vertical data structure. The proposed data structure is helpful to save storage space by using a multi-layer index. At the same time, numerous CPU and graphics processing unit can be employed in parallel to achieve high-efficiency computing. Especially when dealing with large datasets, the proposed algorithm can obtain a high-speed computing with the help of graphics processing unit. The improved vertical structure reduces the storage space of the data. The experimental results show that our proposed algorithm requires much less computation time than other related methods. Copyright (C) 2016 John Wiley & Sons, Ltd.
With the rapid development of China39;s urbanization, land use automatic monitoring is imperative for the efficient allocation of resources and environmental conservation, especially in a metropolitan area with incr...
详细信息
With the rapid development of China's urbanization, land use automatic monitoring is imperative for the efficient allocation of resources and environmental conservation, especially in a metropolitan area with increased ecological-environmental stress as a result of large high-density population. Qingpu, a district of Shanghai, is located in Upper Huangpu Catchment for fresh water supply and an area of considerable ecological value, but it has been also experiencing development pressures from urban sprawl. An automatic procedure was proposed by using remotely sensed mass data to reveal the land use change in Qingpu District. In this procedure, historical remotely sensed data were first registered into the same spatial resolution and radioactive level. Then, data were subset via the boundary of study area for facilitating the following processing. Forward principal components rotation(FPCR) and iterative self-organizing data analysis techniques algorithm(FPCR-ISOdata) was developed to decorrelate or denoise data and divided the data into k groups(k>the number of land use types). After merging the same connoted groups and correcting misclassified units with promising accuracy, the series of land use map are created. The results further elaborated on the spatial-temporal change of agricultural land in Qingpu along with other land use changes over the past two decades. The conversions between land uses were shown as bare land appearing in a state of uncompleted development or transition. The results were consistent with reality and the method was accurate and practical.
The evolution of modern approach in knowledge systems, decision support systems and clinical constraints estimation algorithms that formulate machine learning, softcomputing and data mining in presenting a new outloo...
详细信息
ISBN:
(纸本)9781467394178
The evolution of modern approach in knowledge systems, decision support systems and clinical constraints estimation algorithms that formulate machine learning, softcomputing and data mining in presenting a new outlook for health informatics domain. Health is then clearly understood as the essential part while describing a person's sense of well-being. The delivery of health care services therefore considers as higher proportion, and played an efficient role in information and communication technologies for its effective distribution mechanism. data mining in health informatics are developing into optimistic area for producing vision from diverse data set. data mining techniques are proved to be as a valuable resource for health care informatics. The main scope of writing this paper is to analyse the effectiveness of data mining techniques in health informatics and compare various techniques, approaches or methods and different tools used and its effect on the healthcare industry. The main motive of using data mining application in healthcare systems is to exploit a machine driven tool for identifying and circulating useful and relevant healthcare information.
This study proposed the application of data mining technique namely Adaptive Neuro-Fuzzy inference system (ANFIS) on forest fires hotspot data to develop classification models for hotspots occurrence in Central Kalima...
This study proposed the application of data mining technique namely Adaptive Neuro-Fuzzy inference system (ANFIS) on forest fires hotspot data to develop classification models for hotspots occurrence in Central Kalimantan. Hotspot is a point that is indicated as the location of fires. In this study, hotspot distribution is categorized as true alarm and false alarm. ANFIS is a softcomputing method in which a given inputoutput data set is expressed in a fuzzy inference system (FIS). The FIS implements a nonlinear mapping from its input space to the output space. The method of this study classified hotspots as target objects by correlating spatial attributes data using three folds in ANFIS algorithm to obtain the best model. The best result obtained from the 3rd fold provided low error for training (error = 0.0093676) and also low error testing result (error = 0.0093676). Attribute of distance to road is the most determining factor that influences the probability of true and false alarm where the level of human activities in this attribute is higher. This classification model can be used to develop early warning system of forest fire.
With the development of the research in data mining, cluster analysis has been widely used in several areas. Aiming at the issue that traditional clustering methods are not appropriate to high-dimensional data, a cuck...
详细信息
ISBN:
(纸本)9781785610899
With the development of the research in data mining, cluster analysis has been widely used in several areas. Aiming at the issue that traditional clustering methods are not appropriate to high-dimensional data, a cuckoo search fuzzy-weighting algorithm for subspace clustering is proposed on the basis of the existing soft subspace clustering algorithms. In the proposed algorithm, a novel objective function is firstly designed by considering the fuzzy weighting within-cluster compactness and the between-cluster separation, and loosening the constraints of dimension weight matrix. Then gradual membership and improved cuckoo search, a global search strategy, are introduced to optimize the objective function and search subspace clusters, giving novel learning rules for clustering. At last, the performance of the proposed algorithm on the clustering analysis of various low and high dimensional datasets is experimentally compared with that of several competitive subspace clustering algorithms. Experimental studies demonstrate that the proposed algorithm can obtain better performance than most of the existing soft subspace clustering algorithms.
Owing to the factors of cost and time limit, the number of samples is usually small in the early stages of manufacturing systems. When the number of available data is small, traditional statistic techniques have diffi...
详细信息
ISBN:
(纸本)9781467396431
Owing to the factors of cost and time limit, the number of samples is usually small in the early stages of manufacturing systems. When the number of available data is small, traditional statistic techniques have difficulty to obtain robust analyses. Therefore, based on a uni-modality distribution assumption, many researchers have proposed virtual sample generation methods to expand the training sample size to enhance the performance of small data set learning. In practice, small data may be following a multi-modality distribution. Therefore, in order to solve multi-modal small data sets, this study proposes a new approach to create multi-modality Weibull virtual samples, where we use the maximal p-value to estimate parameters of the Weibull distribution. In addition, the soft DBSCAN method is used to identify a suitable number of modalities. One data set is employed to check the performance of the proposed method, and comparisons are made by the prediction on root mean square error. The results using a paired t-test show that the proposed method has a superior prediction performance than that of the mega-trend-diffusion method using a uni-modality triangular membership function.
Support vector machine is a kind of machine learning method,which can be trained by using the nonlinear mapping algorithm to transform a low dimensional input space into a high dimensional feature space,so that the hi...
详细信息
Support vector machine is a kind of machine learning method,which can be trained by using the nonlinear mapping algorithm to transform a low dimensional input space into a high dimensional feature space,so that the high dimensional feature space can be used to analyze the nonlinear characteristics of the *** this paper,we use support vector machine to predict the stock price,and we discuss and experiment with how to select the appropriate historical data attributes of stock price as the training *** experimental results show that the correct data attributes can improve the accuracy of stock price forecasting,but inappropriate data attributes can make the prediction accuracy *** the same time,the training focused on the use of too many of the data attributes will increase the prediction time to a certain extent.
The development of electronic features for use in apparel has advanced rapidly in recent years, and applications in athletic wear have been particularly successful. However, 39;Smart Fashion39; has not yet been in...
详细信息
ISBN:
(数字)9783319076386
ISBN:
(纸本)9783319076386;9783319076379
The development of electronic features for use in apparel has advanced rapidly in recent years, and applications in athletic wear have been particularly successful. However, 'Smart Fashion' has not yet been integrated into everyday garments. In this paper we propose a new approach to the design of interfaces in Smart Fashion, which we refer to as the soft User Interface (SUI). The ways in which e-textiles physically convey information differs greatly from traditional ways in that information is communicated via graphical user interfaces on computers, smartphones or on WearComp devices. As a result of our research, we advocate the use of iconic and indexical signs for Smart Fashion as these are widely accessible and understood. As an extension to this new interface paradigm, we expect that the harvesting of biometric data, including bodily gestures, will significantly extend the possibilities of SUIs.
In this work we have studied the influence of applied DC electric field on the SHG conversion efficiency of poled organic polymeric films. Regression analysis is carried out to get the desired relationship between the...
详细信息
暂无评论