Gastric cancer is predominantly caused by demographic-diet factors as compared to other cancer types. the aim of the study is to predict Early Gastric Cancer (EGC) factors from diet and lifestyle characteristics of Mi...
ISBN:
(数字)9781728148762
ISBN:
(纸本)9781728148779
Gastric cancer is predominantly caused by demographic-diet factors as compared to other cancer types. the aim of the study is to predict Early Gastric Cancer (EGC) factors from diet and lifestyle characteristics of Mizo-ethnicity using supervised machine learning algorithms. For this study, 80 cases and 160 controls are selected and a dataset containing 11 features that are core risk factors for the gastric cancer have been chosen for data mining. the learning curves show Naive Bayes, Logistic Regression and Multilayer perceptron are the best fit classification algorithms for our dataset. data models are constructed and evaluated using: brier score, accuracy, precision_recall curves for cases (patients) and controls (healthy individuals), and false positives. the data interpretation shows Naive Bayes has the highest classification results having an accuracy of 90%, withthe lowest Brier score of 0.1, and a false positive rate of 3% as compared to other models. Logistic regression classifier shows equally good performances with setback in brier_score and false positives. this study found that extra salt, tuibur, smoking and alcohol are the non_invasive etiological factors for gastric cancer in Mizoram population as predicted by the Naive Bayes algorithm. this knowledge will be helpful for initiating early screening and to educate the public about the risk of dietary and lifestyle factors in high risk population with unique habits.
In order to effectively classify the increasingly massive load data, the method of applying support vector machine to load classification is studied. By comparing and selecting the C-SVC classification and the RBF ker...
In order to effectively classify the increasingly massive load data, the method of applying support vector machine to load classification is studied. By comparing and selecting the C-SVC classification and the RBF kernel function, the optimization method of the grid search is used to find the optimal parameter combination of the LIBSVM classifier and establish the model. the visualization accuracy and index function are used to observe the classification accuracy of each class. Operations such as weighting the unbalanced samples are conducted to adjust the accuracy gap between large and small samples. Experimental results show that the method is feasible in load classification.
ENIGMA is an efficient implementation of learning-based guidance for given clause selection in saturation-based automatedtheorem provers. In this work, we describe several additions to this method. this includes bett...
详细信息
ISBN:
(数字)9783319968124
ISBN:
(纸本)9783319968124;9783319968117
ENIGMA is an efficient implementation of learning-based guidance for given clause selection in saturation-based automatedtheorem provers. In this work, we describe several additions to this method. this includes better clause features, adding conjecture features as the proof state characterization, better data pre-processing, and repeated model learning. the enhanced ENIGMA is evaluated on the MPTP2078 dataset, showing significant improvements.
the Objective of the study is to Analyze and mining rice breeding data withdata explore and machine learning algorithms to discover how rice biological characters influence the economic characters, explore effective ...
详细信息
ISBN:
(数字)9783030061371
ISBN:
(纸本)9783030061371;9783030061364
the Objective of the study is to Analyze and mining rice breeding data withdata explore and machine learning algorithms to discover how rice biological characters influence the economic characters, explore effective methods and technologies for breeders and help them find appropriate breeding parents, and provide tools for parental selection in rice breeding. the author developed a B/S application with Python and Django, which implement real-time data mining of rice breeding data. data analysis and processing result generated from decision tree algorithm can find effective breeding knowledge and patterns, and spectral biclustering algorithm can find required varieties withtheir local features follow certain patterns. the system can help breeders find useful knowledge and patterns more quickly, and improves the accuracy and efficiency of crop breeding.
the proceedings contain 64 papers. the special focus in this conference is on intelligentdataengineering and automatedlearning. the topics include: data streams fusion by frequent correlations mining;web genre clas...
the proceedings contain 64 papers. the special focus in this conference is on intelligentdataengineering and automatedlearning. the topics include: data streams fusion by frequent correlations mining;web genre classification via hierarchical multi-label classification;multi-agent reinforcement learning for control systems: challenges and proposals;optimal filtering for time series classification;managing monotonicity in classification by a pruned random forest;ensemble selection based on discriminant functions in binary classification task;an extension of multi-label binary relevance models based on randomized reference classifier and local fuzzy confusion matrix;fusion of self-organizing maps with different sizes;a particle swarm clustering algorithm with fuzzy weighted step sizes;a bacterial colony algorithm for association rule mining;information retrieval and data forecasting via probabilistic nodes combination;knowledge discovery in enterprise databases for forecasting new product success;deterministic extraction of compact sets of rules for subgroup discovery;variable transformation for granularity change in hierarchical databases in actual data mining solutions;an empirical evaluation of robust Gaussian process models for system identification;throughput analysis of automatic production lines based on simulation methods;study of collective robotic tasks based on the behavioral model of the agent;minimalist artificial eye for autonomous robots and path planning;intelligentautomated design of machine components using antipatterns;application of fuzzy logic controller for machine load balancing in discrete manufacturing system;local search based on a local utopia point for the multiobjective travelling salesman problem;a hybrid programming framework for resource-constrained scheduling problems and building an efficient evolutionary algorithm for forex market predictions.
Most of the traditional fault diagnosis methods rely on the expert knowledge of artificial extraction features and related fields, and these algorithms are not accurate, and the robustness and generalization ability a...
Most of the traditional fault diagnosis methods rely on the expert knowledge of artificial extraction features and related fields, and these algorithms are not accurate, and the robustness and generalization ability are poor. Convolutional neural network is one of the most widely used deep learning models. Based on its unique convolution-pooling network structure, convolutional neural network has powerful feature extraction and expression capabilities. In this paper, based on the characteristics of one-dimensional vibration signals, a fault diagnosis algorithm model based on one-dimensional convolutional neural network is proposed. through the experiment of the bearing fault public data set, the proposed algorithm has more than 99% fault recognition rate.
Suburban railway lines are emerging in China in the last few years. Not as definite as in high-speed railway or urban rail transit, the applications of signaling and traffic control system in suburban railway have not...
Suburban railway lines are emerging in China in the last few years. Not as definite as in high-speed railway or urban rail transit, the applications of signaling and traffic control system in suburban railway have not been strictly defined. Based on the investigation of the traffic scale and its characteristics in suburban railway, and traffic control systems implemented respectively in national railway and urban rail transit, an agent-based system, named D-Agent, is presented to support suburban railway traffic control and management. It consists of six basic modules: local data, knowledge base, skills, reasoning mechanism, data processing, and communication. Infrastructure characteristics, train performances, safety and operational restrictions are stored in local data and knowledge base. Functions of traffic control and adaptive support become the skills of D-Agent, and all possible interfaces of traffic control system are learned as the languages of D-Agent for communication with other systems. through continuous learning, D-Agent can be trained to support the traffic control system or dispatcher in handling traffic problems, as well in adapting to different signaling systems that may be applied in suburban railway.
Globally, cardiovascular (heart) diseases are the major cause of death. About 80% of deaths are reported in developing countries. Looking at the trend and lifestyle, one can predict that by 2030 around 23.6 million pe...
详细信息
the article attempts to demonstrate the possible application opportunities of multidimensional stochastic systems entropy modeling in medicine on a specific example. An assessment of rural male population health state...
详细信息
Classification tasks using imbalanced data are not challenging on their own. When the classes are linearly separable, a regular classification algorithm usually induces predictive models able to distinguish the classe...
详细信息
暂无评论