Hindi Film Industry also referred to as Bollywood has now become a multibillion dollar industry and has also surpassed Hollywood in terms of amount of ticket annually sold. With so much money now riding on Bollywood m...
详细信息
ISBN:
(纸本)9781538684887
Hindi Film Industry also referred to as Bollywood has now become a multibillion dollar industry and has also surpassed Hollywood in terms of amount of ticket annually sold. With so much money now riding on Bollywood movies it has become imperative to make accurate predictions about success of Bollywood films. Today even if a Bollywood movie does not become a hit at box-office the producer of movie will still make profits through sale of satellite rights and music rights but it is Distributors who suffer losses. Hence it has now become imperative for distributors to purchase distribution rights of movies at reasonable prices such that they can obtain profits from it rather than just break even. This problem is a supervised learning problem and will review regression techniques discussed in the literature for predicting the lifetime net India collections of Bollywood films as well as use classification methods on our Bollywood movie dataset for multiclass classification. An evaluation of all the approaches is proposed in which the accuracy score will be reported.
Data analysis has become important with the ever-increasing and diversified data from the past to the present. Supervised machine learning algorithms have been one of the preferred methods with time because of the abi...
详细信息
ISBN:
(纸本)9781538609309
Data analysis has become important with the ever-increasing and diversified data from the past to the present. Supervised machine learning algorithms have been one of the preferred methods with time because of the ability to produce fast and effective solutions in data analysis. One of the most important problems of supervised machine learning algorithms is the inability to achieve the ideal predicted values which are obtained by adversely affecting the test values of little or over training. Therefore, determining the ideal number of training iterations for each algorithm is important for algorithm estimation success. In this study, it has been tried to obtain the best results in the estimation performance with the help of an application being developed in determining the ideal training iteration numbers of the supervised machine learning algorithms on a sample dataset and the prediction performances of the algorithms are compared with each others.
Software Defined Networking (SDN) is a new promising networking concept which has a centralized control over the network and separates the data and control planes. This new approach provides abstraction of lower-level...
详细信息
ISBN:
(纸本)9781509063673
Software Defined Networking (SDN) is a new promising networking concept which has a centralized control over the network and separates the data and control planes. This new approach provides abstraction of lower-level functionality and allows the network administrators to initialize, control, change, and manage network behavior programmatically. The centralized control, being the major advantage of SDN can sometimes also be a major security threat. If the intruder succeeds in attacking the central controller, he would get access to the entire system. The controller is highly vulnerable to Distributed Denial of Service (DDoS) attacks which lead to exhaustion of the system resources which causes non-availability of the services given by the controller. It is critical to detect the attacks in the controller at earlier stage. Many algorithms and techniques have been discovered for this purpose. But less work has been done in the field of SDN networks. Using machine learning algorithms for classifying the connections into legitimate and illegitimate is one such solution. We use two machine learning algorithms namely, the Support Vector machine (SVM) classifier and the Neural Network (NN) classifier to detect the suspicious and harmful connections.
Big data analytics is a booming research area in computer science and many other industries all over the world It has gained great success in vast and varied application sectors. This includes social media, economy, f...
详细信息
ISBN:
(纸本)9781509032945
Big data analytics is a booming research area in computer science and many other industries all over the world It has gained great success in vast and varied application sectors. This includes social media, economy, finance, healthcare, agriculture, etc. Several intelligent machinelearning techniques were designed and used to provide big data predictive analytics solutions. A literature survey of different machinelearning techniques is provided in this paper. Also a study on commonly used machine learning algorithms for big data analytics is done and presented in this paper.
Intrusion detection system (IDS) is one of the implemented solutions against harmful attacks. Furthermore, attackers always keep changing their tools and techniques. However, implementing an accepted IDS system is als...
详细信息
ISBN:
(纸本)9781538638552
Intrusion detection system (IDS) is one of the implemented solutions against harmful attacks. Furthermore, attackers always keep changing their tools and techniques. However, implementing an accepted IDS system is also a challenging task. In this paper, several experiments have been performed and evaluated to assess various machinelearning classifiers based on KDD intrusion dataset. It succeeded to compute several performance metrics in order to evaluate the selected classifiers. The focus was on false negative and false positive performance metrics in order to enhance the detection rate of the intrusion detection system. The implemented experiments demonstrated that the decision table classifier achieved the lowest value of false negative while the random forest classifier has achieved the highest average accuracy rate.
Big Data has been a catalyst force for the machinelearning (ML) area, forcing us to rethink existing strategies in order to create innovative solutions that will push forward the field. This paper presents an overvie...
详细信息
ISBN:
(纸本)9781538614174
Big Data has been a catalyst force for the machinelearning (ML) area, forcing us to rethink existing strategies in order to create innovative solutions that will push forward the field. This paper presents an overview of the strategies for using machinelearning in Big Data with emphasis on the high-performance parallel implementations on many-core hardware. The rationale is to increase the practical applicability of ML implementations to large-scale data problems. The common underlying thread has been the recent progress in usability, cost effectiveness and diversity of parallel computing platforms, specifically, the Graphics Processing Units (GPUs), tailored for a broad set of data analysis and machinelearning tasks. In this context, we provide the main outcomes of a GPU machinelearning Library (GPUMLib) framework, which empowers researchers with the capacity to tackle larger and more complex problems, by using high-performance implementations of well-known ML algorithms. Moreover, we attempt to give insights on the future trends of Big Data Analytics and the challenges lying ahead.
New Technologies such as Big Data and Cloud is playing a vital role in providing solutions to Healthcare problems. Now-a-days healthcare data is growing very drastically day-by-day and it requires an efficient, effect...
详细信息
ISBN:
(纸本)9781538605141
New Technologies such as Big Data and Cloud is playing a vital role in providing solutions to Healthcare problems. Now-a-days healthcare data is growing very drastically day-by-day and it requires an efficient, effective and timely solution to reduce the mortality rate. One of the most critical chronic healthcare problems is diabetes. In Long run, this problem may leads to damage eyes, heart, kidneys and nerves of diabetes patient if improper medication is done which also leads to death. The aim of this paper is to analyze and compare different machine learning algorithms to identify a best predicting algorithm based on various metrics such as accuracy, kappa, precision, recall, sensitivity and specificity. A comprehensive study is done on diabetes dataset with Random Forest (RF), SVM, k-NN, CART and LDA algorithms. The achieved results shows that RF is giving more accurate predictions with compared to other algorithms.
Control and monitoring of asthma progress is highly important for patient's quality of life and healthcare management. Emerging tools for self-management of various chronic diseases have the potential to support p...
详细信息
ISBN:
(纸本)9781538603581
Control and monitoring of asthma progress is highly important for patient's quality of life and healthcare management. Emerging tools for self-management of various chronic diseases have the potential to support personalized patient guidance. This work presents the design aspects of the myAirCoach decision support system, with focus on the assessment of three machinelearning approaches as support tools the first prototype implementation.
Software defect prediction is a well renowned field of software engineering. Determination of defective classes early in the lifecycle of a software product helps software practitioners in effective allocation of reso...
详细信息
ISBN:
(纸本)9781509030125
Software defect prediction is a well renowned field of software engineering. Determination of defective classes early in the lifecycle of a software product helps software practitioners in effective allocation of resources. More resources are allocated to probable defective classes so that defects can be removed in the initial phases of the software product. Such a practice would lead to a good quality software product. Although, hundreds of defect prediction models have been developed and validated by researchers, there is still a need to develop and evaluate more models to draw generalized conclusions. Literature studies have found machinelearning (ML) algorithms to be effective classifiers in this domain. Thus, this study evaluates four ML algorithms on data collected from seven open source software projects for developing software defect prediction models. The results indicate superior performance of the Multilayer Perceptron algorithm over all the other investigated algorithms. The results of the study are also statistically evaluated to establish their effectiveness.
Cloud computing became very popular in past few years, and most of the business and home users rely on its services. Because of its wide usage, cloud computing services became a common target of different cyber-attack...
详细信息
ISBN:
(纸本)9789532330922
Cloud computing became very popular in past few years, and most of the business and home users rely on its services. Because of its wide usage, cloud computing services became a common target of different cyber-attacks executed by insiders and outsiders. Therefore, cloud computing vendors and providers need to implement strong information security protection mechanisms on their cloud infrastructures. One approach that has been taken for successful threat detection that will lead to the successful attack prevention in cloud computing infrastructures is the application of machine learning algorithms. To understand how machine learning algorithms can be applied for cloud computing threat detection, we propose the cloud computing threat classification model based on the feasibility of machine learning algorithms to detect them. In this paper, we addressed three different criteria types, where we considered three types of classification: a) type of learning algorithm, b) input features and c) cloud computing level. Results proposed in this paper can contribute to further studies in the field of cloud threat detection with machine learning algorithms. More specifically, it will help in selecting appropriate input features, or machine learning algorithms, to obtain higher classification accuracy.
暂无评论