检索结果-内蒙古大学图书馆

Predicting coronary artery disease: a comparison between two data mining algorithms

BMC PUBLIC HEALTH 2019年第1期19卷 448-448页

作者： Ayatollahi, Haleh Gholamhosseini, Leila Salehi, Masoud Iran Univ Med Sci Hlth Management & Econ Res Ctr Tehran Iran Iran Univ Med Sci Sch Hlth Management & Informat Sci Dept Hlth Informat Management Tehran Iran AJA Univ Med Sci Sch Paramed Sci Tehran Iran Iran Univ Med Sci Sch Publ Hlth Dept Biostat Tehran Iran

BackgroundCardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronary artery diseases. Therefore, the present study aimed to compare the positive predictive value (PPV) of CAD using artificial neural network (ANN) and SVM algorithms and their distinction in terms of predicting CAD in the selected *** present study was conducted by using data mining techniques. The research sample was the medical records of the patients with coronary artery disease who were hospitalized in three hospitals affiliated to AJA University of Medical Sciences between March 2016 and March 2017 (n=1324). The dataset and the predicting variables used in this study was the same for both data mining techniques. Totally, 25 variables affecting CAD were selected and related data were extracted. After normalizing and cleaning the data, they were entered into SPSS (V23.0) and Excel 2013. Then, R 3.3.2 was used for statistical *** SVM model had lower MAPE (112.03), higher Hosmer-Lemeshow test's result (16.71), and higher sensitivity (92.23). Moreover, variables affecting CAD (74.42) yielded better goodness of fit in SVM model and provided more accurate result than the ANN model. On the other hand, since the area under the receiver operating characteristic (ROC) curve in the SVM algorithm was more than this area in ANN model, it could be concluded that SVM model had higher accuracy than the ANN *** to the results, the SVM algorithm presented higher accuracy and better performance than the ANN model and was characterized with higher power and sensitivity. Overall, it provided a better classification for the prediction of CAD. The use of other data mining algorithms are suggested to improve the positive predictive value o

关键词： Coronary artery disease (CAD) data mining algorithms Artificial neural network (ANN) Support vector machine (SVM)

来源：评论

学校读者我要写书评

暂无评论

A Performance Comparison of data mining algorithms Based Intrusion Detection System for Smart Grid

A Performance Comparison of Data Mining Algorithms Based Int...

引用

IEEE International Conference on Electro Information Technology (EIT)

作者： El Mrabet, Zakaria El Ghazi, Hassan Kaabouch, Naima Univ North Dakota Sch Elect Engn & Comp Sci Grand Forks ND 58201 USA Natl Inst Posts & Telecommun Rabat Rabat Morocco

ISBN: (纸本)9781728109275

Smart grid is an emerging and promising technology. It uses the power of information technologies to deliver intelligently the electrical power to customers, and it allows the integration of green technology to meet the environmental requirements. Unfortunately, information technologies have inherent vulnerabilities and weaknesses that expose the smart grid to a wide variety of security risks. The Intrusion detection system (IDS) plays an important role in securing smart grid networks and detecting malicious activity, yet it suffers from several limitations. Many research papers have been published to address these issues using several algorithms and techniques. Therefore, a detailed comparison between these algorithms is needed. This paper presents an overview of four data mining algorithms used by IDS in Smart Grid. A performance evaluation of these algorithms is conducted based on several metrics including the probability of detection, probability of false alarm, probability of miss detection, efficiency, and processing time. Results show that Random Forest outperforms the other three algorithms in detecting attacks with a higher probability of detection, lower probability of false alarm, lower probability of miss detection, and higher accuracy.

关键词： Smart Grid IDS data mining algorithms probability of detection probability of false alarm Random Forest Naive Bayes

来源：评论

学校读者我要写书评

暂无评论

Application of data mining algorithms for improving stress prediction of automobile drivers: A case study in Jordan

引用

COMPUTERS IN BIOLOGY AND MEDICINE 2019年 114卷 103474-103474页

作者： Hadi, Wa'el El-Khalili, Nuha AlNashashibi, May Issa, Ghassan AlBanna, Abed Alkarim Univ Petra Comp Informat Syst Amman Jordan Univ Petra Software Engn Amman Jordan Univ Petra Comp Sci Amman Jordan

Driving daily through traffic congestion has been recognised as a major cause of stress. High levels of stress while driving negatively impact the driver's decisions which could potentially lead to accidents and other long-term health hazards. Accordingly, there is a great need to determine stress levels for drivers based on measuring and predicting the major causes (features or classes) that increase stress levels. In this paper, the problem of predicting automobile drivers' stress levels, as experienced during actual driving, is investigated through the application of five different data mining algorithms, namely K-Nearest Neighbour (KNN), Decision Tree (J48), Random Forest (RF), Support Vector Machine (SVM), and Artificial Neural Networks (ANN). An experiment was conducted on 14 drivers taking various routes in Amman Jordan, with a wearable biomedical device attached to the driver to instantly collect physiological data. The collected data (dataset) is grouped into two different categories, namely 'Yes' to signify the presence of stress and 'No' to signify the absence of stress. In order to efficiently apply data mining algorithms to the data set, oversampling was used to avoid the negative effect of driver samples with a lesser class on the prediction of stress. The findings are evaluated in relation to stress prediction and accordingly contrasted alongside standard reference approaches that do not consider oversampling and/or feature selection using the Friedman rank test. The proposed approach, in combination with RF, was seen to surpass any others in terms of accuracy, AUC, specificity, and sensitivity. The accuracy, AUC, specificity, and sensitivity rates produced by RF utilising our proposed approach were 98.92%, 99.91%, 98.46%, and 99.36%, respectively.

关键词： data mining algorithms Stress prediction Feature selection Oversampling

来源：评论

学校读者我要写书评

暂无评论

Analysis of the Samples with an Unknown Matrix Using data mining algorithms

引用

INORGANIC MATERIALS 2017年第14期53卷 1454-1457页

作者： Molchanova, E. I. Korzhova, E. N. Stepanova, T. V. Kuz'min, V. V. Irkutsk State Transport Univ Irkutsk Russia Irkutsk State Univ Irkutsk Russia

In determining a limited number of analytes in samples having a complex chemical composition with an unknown matrix, the combination of data mining algorithms (problems of clustering and regression) is proposed. This makes it possible to compensate for the influence of the components of the host medium on the intensity of the analytical line of an element being determined. The technology developed is tested in the X-ray fluorescence determination of S, Fe, Cu, Zn, and As in float concentrate samples during processing of polymetallic ores and V and Fe in synthetic film samples that are adequate in physicochemical properties to samples of welding fumes deposited on a filter. The error of the results of analysis has decreased by a factor 1.5-5 compared to the use of the Lucas-Tooth classical regression equation. The developed technology considerably increases the rapidity of analysis when it is used with X-ray spectrometers of consecutive action.

关键词： data mining algorithms cluster heterogeneous materials X-ray fluorescence analysis models of calibration functions regression equations least-squares method calibration samples adequacy error

来源：评论

学校读者我要写书评

暂无评论

Parallelization of data mining algorithms for Multicore Processors 4

Parallelization of Data Mining Algorithms for Multicore Proc...

引用

4th Mediterranean Conference on Embedded Computing (MECO

作者： Kholod, Ivan Kuprianov, Mikhail Shorov, Andrey St Petersburg Electrotech Univ LETI Fac Comp Sci & Technol St Petersburg Russia

ISBN: (纸本)9781479989997

The article describes a approach of parallel data mining algorithms to be executed on multicore processors of various architecture. The suggested method presents an algorithm as a consequence of pure functions with unified interfaces. For parallel execution additional functions are introduced to share data and models between the parallel threads. Besides such functions allow to obtain various parallel algorithm structures and implement various strategies of execution for different environment conditions. Application of the described method is illustrated through algorithm Naive Bayes.

关键词： data mining parallel data mining data mining algorithms distributed data mining parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

The Formal Model of data mining algorithms for Parallelize algorithms 19

The Formal Model of Data Mining Algorithms for Parallelize A...

引用

19th International Multiconference on Advanced Computer Systems

作者： Kholod, Ivan Karshiyev, Zaynidin Shorov, Andrey St Petersburg Electrotech Univ LETI Ul Prof Popova 5 St Petersburg Russia

ISBN: (纸本)9783319151472;9783319151465

The present paper describes the formal model of data mining algorithms. These models consider each data mining algorithm as a sequence of operations. This allows us to determine ways for parallel execution of data mining algorithms. The software implementation of the formal model is executed on the Java language. A few data mining algorithms were developed on the basis of the suggested formal modal. The algorithm k-means is described in the paper as the example.

关键词： data mining Parallel data mining data mining algorithms Parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Creation of data mining algorithms as Functional Expression for Parallel and Distributed Execution 13th

引用

13th International Conference on Parallel Computing Technologies (PaCT)

作者： Kholod, Ivan Petukhov, Ilya St Petersburg Electrotech Univ LETI St Petersburg Russia

ISBN: (纸本)9783319219097;9783319219080

The article describes extension of lambda-calculation for creation of parallel data mining algorithms. The proposed approach uses presentation of the algorithm as a consequence of pure functions with unified interfaces. For parallel execution we use special function that allows to change a structure of the algorithm and to implement various strategies for processing of data set and model.

关键词： Parallel algorithms data mining Parallel data mining Distributed data mining data mining algorithms

来源：评论

学校读者我要写书评

暂无评论

Framework for Multi Threads Execution of data mining algorithms

Framework for Multi Threads Execution of Data Mining Algorit...

引用

IEEE North West Russia Section Young Researchers in Electrical and Electronic Engineering Conference (2015 ElConRusNW)

作者： Kholod, Ivan St Petersburg Electrotech Univ LETI Fac Comp Sci & Technol St Petersburg Russia

ISBN: (纸本)9781479973064

the present paper describes the framework for creating data mining algorithms from thread-safe functional blocks. This framework requirements decomposition of algorithms into independently functioning blocks. These blocks must have unified interfaces and implement pure functions. The framework allows create new data mining algorithms from existing blocks and improves the existing algorithms by optimizing single blocks or the whole structure of the algorithms. This becomes possible due to a number of important properties such as thread-safety inherent in pure functions and hence functional blocks.

关键词： data mining parallel data mining data mining algorithms distributed data mining parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Prediction of fresh herbage yield using data mining techniques with limited plant quality parameters

引用

SCIENTIFIC REPORTS 2024年第1期14卷 1-15页

作者： Celik, Senol Tutar, Halit Gonulal, Erdal Er, Hasan Bingol Univ Fac Agr Dept Anim Sci Biometry & Genet Unit TR-12000 Bingol Turkiye Bingol Univ Fac Agr Dept Field Crops TR-12000 Bingol Turkiye Bahri Dagdas Int Agr Res Inst TR-42000 Konya Turkiye Bingol Univ Fac Agr Dept Biosyst Engn TR-12000 Bingol Turkiye

The purpose of this study was to ascertain the fresh herbage yield, fertilizer dosage, and plant characteristics of the Sorghum-Sudangrass hybrid grown in arid and semi-arid regions, as well as their interrelationships. For this reason, data from the Sorghum-Sudangrass hybrid were used to assess the predictive performance of several data mining techniques, including CHAID, CART, MARS, and Bagging MARS. Plant traits were measured in Konya and Sanliurfa during 2021 and 2022. The descriptive statistical values were calculated as follows: plant height 306.27 cm, stem diameter 9.47 mm, fresh herbage yield 10852.51 kg/da, crude protein ratio 9.66%, acid detergent fiber 33.39%, neutral detergent fiber 51.85%, acid detergent lignin 9.76%, dry matter digestibility 62.88%, dry matter intake 2.34%, and relative feed value 114.68 (average values). The predictive capacities of the fitted models were assessed using model fit statistics such as the coefficient of determination (R-2), adjusted R-2, root mean square error (RMSE), mean absolute percentage error (MAPE), standard deviation ratio (SD ratio), and Akaike Information Criterion (AIC). With the lowest values for RMSE, MAPE, SD ratio, and AIC (246, 1.926, 0.085, and 845, respectively), and the highest R-2 value (0.993) and adjusted R-2 value (0.989), the MARS algorithm was determined to be the best model for characterizing fresh herbage yield. As a solid alternative to other data mining techniques, the MARS algorithm was shown to be the most appropriate model for forecasting fresh herbage production.

关键词： data mining algorithms Fertilizer dose Fresh herbage yield Sorghum-sudangrass hybrid

来源：评论

学校读者我要写书评

暂无评论

Using data mining techniques to generate test cases from graph transformation systems specifications

引用

AUTOMATED SOFTWARE ENGINEERING 2024年第1期31卷 17-17页

作者： Araghi, Maryam Asgari Rafe, Vahid Khendek, Ferhat Concordia Univ Dept Elect & Comp Engn Montreal PQ Canada Goldsmiths Univ London Dept Comp London England Arak Univ Fac Engn Dept Comp Engn Arak *** Iran

Software testing plays a crucial role in enhancing software quality. A significant portion of the time and cost in software development is dedicated to testing. Automation, particularly in generating test cases, can greatly reduce the cost. Model-based testing aims at generating automatically test cases from models. Several model based approaches use model checking tools to automate test case generation. However, this technique faces challenges such as state space explosion and duplication of test cases. This paper introduces a novel solution based on data mining algorithms for systems specified using graph transformation systems. To overcome the aforementioned challenges, the proposed method wisely explores only a portion of the state space based on test objectives. The proposed method is implemented using the GROOVE tool set for model-checking graph transformation systems specifications. Empirical results on widely used case studies in service-oriented architecture as well as a comparison with related state-of-the-art techniques demonstrate the efficiency and superiority of the proposed approach in terms of coverage and test suite size.

关键词： Software testing Model-based testing Test case generation Model checking data mining algorithms Graph transformation systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：