The main pillar of the Indian Economy is Agriculture as it ensures food security and financial stability for all people in ancient days. However, due to the unnatural weather conditions, the production of yield crops ...
详细信息
The main pillar of the Indian Economy is Agriculture as it ensures food security and financial stability for all people in ancient days. However, due to the unnatural weather conditions, the production of yield crops has severely affected the income of farmers. This work helps early-stage farmers by guiding them through the process of sowing the right crops based on soil using the help of machine learning algorithms. The seeds are collected based on the soil parameters such as Nitrogen, phosphorous, potassium, temperature, and humidity. The system is trained using the Random Forest algorithm to suggest and help the farmers to achieve a successful harvest. A web-based application is also being developed which will allow users to get data about nutrients required for the crop. Experiments are conducted on the collected data samples from the web application and predict the crop yield with good accuracy of 91.2 % and in less response time.
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), as an adaptive immunity mechanism in prokaryotes, is a programmable system that can be used for editing genes of various species. Possibly it can be ...
详细信息
ISBN:
(数字)9781665468190
ISBN:
(纸本)9781665468206
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), as an adaptive immunity mechanism in prokaryotes, is a programmable system that can be used for editing genes of various species. Possibly it can be used to edit genes causing some rare diseases of humans, such as Ichthyosis. CRISPR is associated with the Cas9 protein that cuts a DNA sequence. Cas9 binds to a gRNA that guides Cas9 to an editable site. Although versatile and easy to design, a misguiding gRNA could result wrong edition of genes. Hence, the CRISPR/Cas9 process needs to find ideal gRNAs that can guide Cas9 to on-targets, and avoid off-targets. Various machine learning algorithms play important roles in finding effective gRNAs for the CRISPR/Cas9 by predicting the cleavage efficiency of gRNA. Here, we aim to provide an overview and comprehensive analysis of various machine and deep learningalgorithms that are effective in predicting CRISPR gRNA sequences’ on-target activities. Comparison results show that the hybrid approaches combining deep learning and other machine learning algorithms present excellent outcomes. We applied the models to predict the cleavage efficiency of gRNAs from the FLG gene causing Ichthyosis, and the results suggest to use multiple models to obtain consistent and trustworthy results. Although our work does not provide final solutions, it will open opportunities for computational tools to diagnose and treat rare diseases. The datasets and codes are available at https://***/ksanyour/MLforGeneEditing.
machinelearning plays very important role in processing of large amounts of structured and unstructured data. A set of algorithms can be used to get meaningful insights into the data that are helpful in making effect...
详细信息
ISBN:
(纸本)9781509012824
machinelearning plays very important role in processing of large amounts of structured and unstructured data. A set of algorithms can be used to get meaningful insights into the data that are helpful in making effective business decisions. Document clustering is one of the popular machinelearning technique used to group unstructured data (text documents) based on its content and further analyze the data to understand the patterns in it. The unstructured data gets transformed into semi-structured data and structured data in stages by using text mining and clustering (k-means) techniques. Classification is another machinelearning technique that can be implemented for use cases like "fraud detection and cross-sell & up-sell opportunity identification" in banking, financial services and insurance industry. This paper focuses on the implementation of both document clustering algorithm and a set of classification algorithms (Decision Tree, Random Forest and Naïve Bayes), along with appropriate industry use cases. Also, the performance of three classification algorithms will be compared by calculation of "Confusion Matrix" which in turn helps us to calculate performance measures such as, "accuracy", "precision", and "recall".
In today’s world, stress is a big problem that affects people’s health and happiness. More and more people are feeling stressed out, which can lead to lots of health issues like breathing problems, feeling overwhelm...
详细信息
ISBN:
(数字)9798350394474
ISBN:
(纸本)9798350394481
In today’s world, stress is a big problem that affects people’s health and happiness. More and more people are feeling stressed out, which can lead to lots of health issues like breathing problems, feeling overwhelmed, heart attack, diabetes, etc. This work endeavors to forecast stress and non-stress occurrences among college students by applying various machine learning algorithms: Decision Trees, Random Forest, Support Vector machines, AdaBoost, Naive Bayes, Logistic Regression, and K-nearest Neighbors. The primary objective of this work is to leverage a research study to predict and mitigate stress and non-stress based on the collected questionnaire dataset. We conducted a workshop with the primary goal of studying the stress levels found among the students. This workshop was attended by Approximately 843 students aged between 18 to 21 years old. A questionnaire was given to the students validated under the guidance of the experts from the All India Institute of Medical Sciences (AIIMS) Raipur, Chhattisgarh, India, on which our dataset is based. The survey consists of 28 questions, aiming to comprehensively understand the multidimensional aspects of stress, including emotional well-being, physical health, academic performance, relationships, and leisure. This work finds that Support Vector machines have a maximum accuracy for Stress, reaching 95%. The study contributes to a deeper understanding of stress determinants. It aims to improve college student’s overall quality of life and academic success, addressing the multifaceted nature of stress.
Despite the quick growth of the digital world, phishing attempts still constitute a serious threat to the security of online banking transactions. The goal of this research project is to evaluate the usage of machine ...
详细信息
ISBN:
(数字)9798331543624
ISBN:
(纸本)9798331543631
Despite the quick growth of the digital world, phishing attempts still constitute a serious threat to the security of online banking transactions. The goal of this research project is to evaluate the usage of machine learning algorithms to detect websites that are used to spread phishing messages. This work specifically considers complex feature engineering and algorithm selection approaches. The detection method is enhanced by the application of optimal model, which is utilized to extract and evaluate text-based elements from phishing websites. It is done with optimal model. DNSPython and Python-Whois are the technologies utilized to finish gathering domain-related data. However, Scikit-learn makes the process of putting machinelearning models into practice easier. The objectives that AutoML aids in achieving include efficient model selection and optimization. This has the important advantage of making it possible to automatically identify the algorithms with the highest performance levels. The objective of this research endeavour is to enhance the accuracy and robustness of phishing detection systems through the utilization of diverse methodologies and technologies. Additionally, the study makes an effort to clarify how machinelearning would be able to successfully counteract the increasingly complex phishing methods. Through improving the automatic identification of phishing-related websites, our program seeks to contribute to the creation of safer online environments.
Injection molding is a very complicated process to monitor and control. With its high complexity and many process parameters, the optimization of these systems is a very challenging problem. To meet the requirements a...
详细信息
Injection molding is a very complicated process to monitor and control. With its high complexity and many process parameters, the optimization of these systems is a very challenging problem. To meet the requirements and costs demanded by the market, there has been an intense development and research with the aim to maintain the process under control. This paper outlines the latest advances in algorithms for plastic injection process and monitoring, and presents a real case of application that verifies their performance.
Internet of Things describes variety of embedded devices that are connected to the internet. This network of devices are used to build smart environment for smart city applications and used by many organizations. Whil...
详细信息
ISBN:
(数字)9798350372120
ISBN:
(纸本)9798350372137
Internet of Things describes variety of embedded devices that are connected to the internet. This network of devices are used to build smart environment for smart city applications and used by many organizations. While data is constantly exchanged among various IoT devices, the detection of malicious attacks is essential for effective quality of service implementation in IoT networks. In this research, the University of New South Wales TON_IOT Train-Test dataset is used to classify network traffic. State of art machinelearning methods such as Random Forest (RF), Decision Tree (DT), and Gradient Boosting (GB) are used to detect malicious IoT traffic. There are two classification types are considered in this study-binary and multi-label. When compared to the other two models, the Random Forest model gets the best accuracy of 99% in both binary and multi class classification.
In this paper we applied machinelearning techniques for Gastrointestinal Polyps classification from colonoscopy video clips and compared our results to other methods and with the results of clinicians with different ...
详细信息
ISBN:
(数字)9781728167992
ISBN:
(纸本)9781728168005
In this paper we applied machinelearning techniques for Gastrointestinal Polyps classification from colonoscopy video clips and compared our results to other methods and with the results of clinicians with different levels of experience. machinelearning technology allows us to classify tissues that can reduce the waiting time for patients' results. We tested four machine learning algorithms (Support Vector machine, Random Forest, Random Subspace and Extra-Trees) for classification of the polyps in hyperplastic, serrated and adenoma lesions. We used a dataset in which there are 152 instances with the three types of lesions, for 76 polyps. The best results were obtained by Random Forest algorithm with the accuracy of 87%, and the worst results were obtained by Support Vector machine with the accuracy of between 63% and 73%.
The article presents a study which designed, developed and implemented novel machine learning algorithms (MLAs) as a support tool for lymphocyte-related diagnosis using cell population data (CPD), along with absolute ...
详细信息
The article presents a study which designed, developed and implemented novel machine learning algorithms (MLAs) as a support tool for lymphocyte-related diagnosis using cell population data (CPD), along with absolute lymphoid count, age and gender. Topics include stages to build the model; classification algorithms that were tested including decision trees, random forests, and k-nearest neighbour; and finding on the neural networks (NN) algorithm, based on CPD and absolute lymphoid counts.
Air pollution represents an issue that raises many concerns nowadays, as it has various negative effects on the environment and the economy worldwide. Because of the rapid urbanization, cities are suffering from pollu...
详细信息
ISBN:
(数字)9781728175065
ISBN:
(纸本)9781728175072
Air pollution represents an issue that raises many concerns nowadays, as it has various negative effects on the environment and the economy worldwide. Because of the rapid urbanization, cities are suffering from polluted air, so it is important to predict future air quality. For this purpose, new applications of artificial intelligence should be employed. In this paper, we will present several machine learning algorithms, the possible software that can be used for them and the applications used in the field of air quality. Based on the research in the field, we propose SVR, ARIMA and LSTM, 3 machinelearning models, which can be used to predict air pollution. These algorithms have been tested using time-series for PM 10 and PM 2.5 particles. The results showed that SVR and ARIMA algorithms are the most suitable in forecasting air pollutant concentrations.
暂无评论