"Tree-based ensemble algorithms" (TEAs) are extensively employed for classification and regression problems. However, existing TEAs lag behind the trade-off between TEA interpretability and achieving cutting...
详细信息
Adolescents' positive mental health is deeply associated with their growth. Identifying the factors that contribute to the positive mental health of junior and senior high school students is crucial for supporting...
Adolescents' positive mental health is deeply associated with their growth. Identifying the factors that contribute to the positive mental health of junior and senior high school students is crucial for supporting youth development in the long run. Leveraging questionnaire data, this paper utilizes machine learning algorithms and fusion methods for feature selection to examine the factors that influence the onset of adolescent positive mental health. The influential features are ranked by importance. Then, machine learning algorithms in a heuristic form are employed to assess the predictive power of the fused features. The best f1-score of the classifiers reaches 0.844. It illustrates the effectiveness of feature filtering and provides valuable references for educational administrators to potential interventions.
Modern healthcare is going through a big change because of the way that new tools are proposed. This paper works at how the Internet of Things (IoT), heterogeneous data transfer and processing, and Artificial Intellig...
Modern healthcare is going through a big change because of the way that new tools are proposed. This paper works at how the Internet of Things (IoT), heterogeneous data transfer and processing, and Artificial Intelligence (AI) robotics could completely change how hospitals work. IoT technology has become a significant tool because it makes it possible to watch and collect data in realtime from hospital properties like medical equipment, wearables for patients, and smart sensors. This steady flow of different kinds of data brings both possibilities and problems. In our study, we talk about how significant it is to have strong data transmission and processing systems that can handle different data sources well. Intelligent automation that is led by AI is at the heart of this change. AI systems can work with the huge amount of data created by IoT devices and use machine learning, predictive analytics, and cognitive computing to improve different parts of hospital management. This paper gives case studies and real-world examples of how IoT, diverse dataprocessing, and AI automation have been used successfully in hospitals to show how these ideas can be used in the real world. IoT technology, diverse dataprocessing, and AI automation are assembled to change the way hospitals work by making them more effective and easier to maintain. This study offers the significance of using these tools to keep up with the varying needs of current healthcare. As hospitals continue to change, this relationship will play an even bigger role in making sure that patients get superior care while making the best use of their resources.
Automatic recognition of facial expressions is a common problem in human-computer interaction. While humans can recognize facial expressions very easily, machines cannot do it as easily as humans. Analyzing facial cha...
详细信息
With the rapid development of smart grid, the substation secondary cable condition monitoring data is growing exponentially and gradually constitutes the secondary circuit condition monitoring big data. The traditiona...
With the rapid development of smart grid, the substation secondary cable condition monitoring data is growing exponentially and gradually constitutes the secondary circuit condition monitoring big data. The traditional computing architecture can no longer meet the computing performance demand. Combining Spark big dataprocessing technology and AliCloud E-MapReduce cloud computing platform, we propose a parallel pattern recognition method for substation secondary cable condition monitoring big data, aiming to improve the ability of the secondary cable online monitoring system to quickly batch analyze the alarm monitoring data that suddenly increase in a short period of time. Spark-KNN, a parallelized K-nearest neighbor classification algorithm based on Spark, is designed to realize parallel pattern recognition of massive secondary cable monitoring data. The experimental results show that the average performance of Spark-KNN is 3.17 times higher than that of Hadoop MapReduce implementation, which is more suitable for performing real-time processing tasks of secondary cable monitoring big data.
Heart disease has been the leading cause of death in recent decades. Every five minutes, one person dies unexpectedly from heart disease. Researchers from all over the world are assisting doctors in diagnosing heart p...
详细信息
Heart disease has been the leading cause of death in recent decades. Every five minutes, one person dies unexpectedly from heart disease. Researchers from all over the world are assisting doctors in diagnosing heart problems. However, machine learning techniques can substantially minimize the number of tests required. The main objective of this paper is to predict the heart disease of a person well in advance to minimise the need for expensive treatment and medicines. The present papers have used some of the finest approaches of machine learning like decision tree and KNN classifier for prediction of heart diseases. These algorithms are quite useful in diagnosing cardiac illness without the use of machinery or labs, by figuring out the problem at a far lower cost. These algorithms are very widely utilized and helpful in other domains. The dataset was obtained from the Kaggle website and contain data of 303 patients and 76 attributes. The accuracy of the KNN along with Decision tree algorithms have been compared with existing work of researchers and found to be more in the detection of the heart disease. These algorithms certainly help in reducing the number of deaths which are happening all around the world.
Banks make the majority of their income from loans. A lot of individuals apply for loans, and it is difficult to choose the real candidate who will repay the loan. A lot of misunderstandings may occur when selecting t...
详细信息
Banks make the majority of their income from loans. A lot of individuals apply for loans, and it is difficult to choose the real candidate who will repay the loan. A lot of misunderstandings may occur when selecting the real applicant when the process is done manually. As a result, a loan prediction system based on machine learning is developed, in which the system will automatically identify the qualified candidates. This is beneficial to both the bank personnel and the applicant. The loan approval process will be greatly shortened. The loan data is predicted by using the hybrid model of Naive Bayes (NB) and Decision Tree (DT) algorithms. First, the dataset is given to the three classification algorithms– Support Vector Machine (SVM), NB and DT algorithms and the prediction is done with these three algorithms. The accuracy of each of these three is used to assess performance. The creation of the hybrid model increases accuracy. The dataset is given to NB for training and the prediction of NB is given to DT Algorithm for training. Test data are sent to the model for prediction after training. The model is evaluated, and the performance is measured in terms of different metrics form sklearn metrics. This prediction of loan range is useful for bank staff to give the loan amount accordingly. The NB algorithm checks for equality and independence of all the features in the dataset. In DT algorithm, the tree is constructed based on the information gain value. The attribute with high information gain value is placed as the root node and also the other nodes are constructed based on information gain value. The proposed hybrid model predicts - yes or no, and based on the prediction, whether the loan is to be sanctioned or denied for the applicant is specified.
Telemetry and monitoring are important means to ensure the safety of flight test. In order to solve the problem of multi-source data stream time synchronization and integrated monitoring under the multi-airplane coope...
Telemetry and monitoring are important means to ensure the safety of flight test. In order to solve the problem of multi-source data stream time synchronization and integrated monitoring under the multi-airplane cooperative flight test scenario, a multi-data stream time synchronization processing algorithm is designed, and a data and video correlation storage and synchronous playback scheme is proposed. Multidimensional integrated monitoring and online playback have been achieved by adopting the concept of componentization design. The system has been applied to the flight test mission of multi-aircraft cooperation, and the actual application results show that the multi-source data time synchronization algorithm is effective, which greatly improves monitoring experience, and effectively ensures the development of flight test program of the multi-aircraft cooperation.
Digitalization has resulted in accumulation of gigantic amount of data and that too at an alarming rate. processing such a large amount of data is the biggest challenge of the recent world. Extracting meaningful infor...
Digitalization has resulted in accumulation of gigantic amount of data and that too at an alarming rate. processing such a large amount of data is the biggest challenge of the recent world. Extracting meaningful information and ensuring correctness and preciseness of data is the core of current requirements of digital world. The process involving preprocessing of raw data, extracting meaningful information and deducing conclusive evidence for decision making lead to the field named as data mining. data mining is a framework of patterns and rules aiming at extracting the relationship or hidden information from the enormous set of databases. The paper targets the healthcare sector and the role of data mining in it. It also presents the challenges pertaining to the healthcare industry. The current trends and the future scope have also been illustrated.
By evaluating received data packets, network traffic categorization (NTC) identifies distinct categories of applications or traffic data. It is a significant technique in today's communication networks. data input...
详细信息
ISBN:
(数字)9798350371406
ISBN:
(纸本)9798350371413
By evaluating received data packets, network traffic categorization (NTC) identifies distinct categories of applications or traffic data. It is a significant technique in today's communication networks. data input, preprocessing, feature extraction, classification, and performance analysis are all processes in the network traffic classification process. The utility of learning approaches to categorize traffic over a network has been pushed by rapid improvements in machine learning. In the event of consistent datasets, inherent properties of internet networks cause uneven class distributions. This phenomenon is known as class imbalance, and it is gaining popularity in a variety of fields of study. Various network traffic classification and data balancing algorithms are discussed in this study.
暂无评论