This paper presents a novel clustering algorithm based on clustering coefficient. It includes two steps: First, k-nearest-neighbor method and correlation convergence are employed for a preliminary clustering. Then, th...
详细信息
ISBN:
(纸本)9783319117492;9783319117485
This paper presents a novel clustering algorithm based on clustering coefficient. It includes two steps: First, k-nearest-neighbor method and correlation convergence are employed for a preliminary clustering. Then, the results are further split and merged according to intra-class and inter-class concentration degree based on clustering coefficient. The proposed method takes correlation between each other in a cluster into account, thereby improving the weakness existed in previous methods that consider only the correlation with center or core data element. Experiments show that our algorithm performs better in clustering compact data elements as well as forming some irregular shape clusters. It is more suitable for applications with little prior knowledge, e. g. hotspots discovery.
The current online English learning resource push methods have problems of poor customer satisfaction, low reliability of pushed resources and low recall rate of resource pushes. Therefore, this paper proposes an onli...
详细信息
The current online English learning resource push methods have problems of poor customer satisfaction, low reliability of pushed resources and low recall rate of resource pushes. Therefore, this paper proposes an online English learning resource push method based on Bayesian inference. Firstly, obtain online English learning resource data and classify online learning resources. Then, by mining and analysing learner learning data, clustering algorithms are used to locate and infer the learner's learning style. Finally, based on Bayesian inference, a Naive Bayesian classifier is developed, and a network English online learning resource push model is developed to achieve effective network English online learning resource distribution. Through relevant experiments, it has been confirmed that the customer satisfaction of this method varies from 96.0% to 99.8%, the push reliability varies from 90.5% to 99.8% and the resource push recall rate is 99.9%, which has the characteristic of good push effect.
作者:
Modak, SoumitaFaculty
Department of Statistics University of Calcutta Basanti Devi College Kolkata India
In this paper, a novel nonparametric norm-based clustering algorithm is proposed to classify real-valued continuous data sets given in arbitrary dimensional space. For data univariate, multivariate or high-dimensional...
详细信息
As information technology and data mining techniques evolve at a breakneck pace, they bring transformative potential to the educational landscape. The burgeoning growth of online educational resources not only enriche...
详细信息
As information technology and data mining techniques evolve at a breakneck pace, they bring transformative potential to the educational landscape. The burgeoning growth of online educational resources not only enriches learning but also introduces complexities in resource management and personalization. This study pioneers the construction of a Fundamental Training Educational Resource Repository (FTERR) by harnessing sophisticated data mining approaches. We initiate by intricately mapping students' learning needs through knowledge graph and semantic analysis, ensuring a deep alignment with individual educational journeys. Subsequently, we address the hurdles in resource management by introducing an innovative data storage model. This model, in synergy with advanced clustering algorithms, streamlines the retrieval process, rendering it both swift and intuitive. Our research transcends traditional data mining applications in education, steering towards a more informed and responsive educational ecosystem. This novel approach not only elevates the precision and efficiency of educational resource allocation but also significantly enriches the student learning experience, marking a leap forward in educational informatization. The main purpose of this study is to address the issue of insufficient personalized learning needs in online educational resources by introducing advanced data mining technologies, including knowledge graphs and semantic analysis. The study develops new data storage models and utilizes clustering algorithms to achieve rapid and visual retrieval of educational resources, thereby enhancing the utilization efficiency of educational resources and improving students' learning experiences. The methods and findings of this study can provide references for data mining applications in other fields and promote the application and development of data science in a broader range of fields.
An integration algorithm for clustering is presented, in which a maximization & minimum algorithm to determine the initial centers and BWP (Between-Within Proportion) index for input of optimal k. In theory, the b...
详细信息
ISBN:
(纸本)9783038350064
An integration algorithm for clustering is presented, in which a maximization & minimum algorithm to determine the initial centers and BWP (Between-Within Proportion) index for input of optimal k. In theory, the bigger the BWP index, the better the clustering effectiveness. Then a numerical example of air transport market segment is presented to show the effectiveness and efficiency of the method presented in the document.
The basis for grouping structural elements is a tradeoff between optimization in design and construction. The grouped elements should have common optimal design specifications that are structurally safe and convenient...
详细信息
An important feature of structural data especially those from structural determination and protein-ligand docking programs is that their distribution could be both uniform and non-uniform. Traditional clustering algor...
详细信息
ISBN:
(纸本)9783319052694;9783319052687
An important feature of structural data especially those from structural determination and protein-ligand docking programs is that their distribution could be both uniform and non-uniform. Traditional clustering algorithms developed specifically for non-uniformly distributed data may not be adequate for their classification. Here we present a geometric partitional algorithm that could be applied to both uniformly and non-uniformly distributed data. The algorithm is a top-down approach that recursively selects the outliers as the seeds to form new clusters until all the structures within a cluster satisfy certain requirements. The applications of the algorithm to a diverse set of data from NMR structure determination, protein-ligand docking and simulation show that it is superior to the previous clustering algorithms for the identification of the correct but minor clusters. The algorithm should be useful for the identification of correct docking poses and for speeding up an iterative process widely used in NMR structure determination.
This paper proposed an efficient PSO clustering algorithm with point symmetry distance based on cooperative evolution strategy. It not only determined the number of clusters, but also detected the proper partitions in...
详细信息
ISBN:
(纸本)9781845648299;9781845648282
This paper proposed an efficient PSO clustering algorithm with point symmetry distance based on cooperative evolution strategy. It not only determined the number of clusters, but also detected the proper partitions in data sets when the data sets possess the property of symmetry. In the algorithm, a new point symmetry distance is used to compute the similarity instead of the Euclid distance. Cooperative evolution strategy with multi-populations is introduced to prevent the PSO algorithm from trapping into the local optimal solution. The performance of the proposed algorithm is tested in two artificial data sets. The simulation results show that the performance of the algorithm is better than other algorithms mentioned in this paper.
WSNs consists several nodes spread over experimental fields for specific application temporarily. The spatially distributed sensor nodes sense and gather the information for intended parameters like temperature, sound...
详细信息
ISBN:
(纸本)9781479946235
WSNs consists several nodes spread over experimental fields for specific application temporarily. The spatially distributed sensor nodes sense and gather the information for intended parameters like temperature, sound, vibrations, etc for the particular application. In this paper, we evaluate the impact of different algorithms i.e. clustering for densely populated field application, energy backup by adding energy harvesting node in field, positioning energy harvesting node smartly in the field and also positioning the base station in sensor field to optimize the communication between cluster head and base station. The analysis and simulation results justifies that availability of power backup for cluster nodes using energy harvesting and positioning the energy harvesting node and also base station enhance the lifetime of sensor network fields. WSN with power backup density based clustering algorithm can be applied for many sensitive applications like military for hostile and remote areas or environmental monitoring where human intervention is not possible.
With the progress of technology and the enhancement of social demand for privacy protection, optical monitoring equipment has gradually caused public concern. In contrast, millimeter-wave(mmWave) radar monitoring has ...
详细信息
With the progress of technology and the enhancement of social demand for privacy protection, optical monitoring equipment has gradually caused public concern. In contrast, millimeter-wave(mmWave) radar monitoring has been rapidly developed because of its superiority in privacy. However, the indoor environment is relatively complex, and traditional density-based clustering algorithms perform poorly in accurate tracking. The point cloud data generated from indoor scenario echoes collected by mmWave radar is relatively sparse and accompanied by noise points, which significantly affects tracking performance. In this paper, we propose an improved DBSCAN clustering algorithm that uses a multi-frame aggregation method to suppress multipath effects and eliminate 'false targets'. It is combined with the extended Kalman filter(EKF) algorithm to form a complete system. In our system, the raw data collected by mmWave radar is processed by fast fourier transform(FFT), static clutter removal and constant false alarm rate(CFAR) to obtain point cloud data. Since the density of point cloud data greatly affects the performance of clustering algorithms, we use multi-frame aggregation method to process the point cloud data to increase its density. Accurate indoor personnel tracking is then achieved through clustering and extended Kalman filtering, and the tracking error is within 0.1 m.
暂无评论