The construction industry is experiencing explosive growth in its capability to, generate and collect data. Advances in data storage technology have allowed the transformation of an enormous amount of data into comput...
详细信息
ISBN:
(纸本)1853129259
The construction industry is experiencing explosive growth in its capability to, generate and collect data. Advances in data storage technology have allowed the transformation of an enormous amount of data into computerized database systems. Nowadays, there are many efforts to convert the large amounts of data into useful patterns or trends. Knowledge Discovery in database (KDD) is a process that combines datamining (DM) techniques from machinelearning, pattern recognition, statistics, databases, and visualization to automatically extract concepts, interrelationships, and patterns of interest from a large database. By applying KDD and DM to the analysis of construction project data, this paper presents the results of a research that discovers the knowledge through KDD process to better identify recurring construction problems.
With the rapid development of Internet technology, all kinds of data are growing exponentially. How to effectively manage and utilize these data has become the focus of research in the era of big data. Under the requi...
详细信息
ISBN:
(纸本)9781728155050
With the rapid development of Internet technology, all kinds of data are growing exponentially. How to effectively manage and utilize these data has become the focus of research in the era of big data. Under the requirement of massive data processing, aiming at the time requirement of massive data processing which cannot be met by traditional single-machine serial, this paper proposes a Spark computing framework, studies Bayesian algorithm in datamining, realizes the establishment method of parallel Bayesian algorithm and optimizes it. By using Spark memory computing framework, the efficiency of iteration is high. The computational performance of the parallel computing program is investigated. By comparing Spark parallel computing with traditional single machine serial experiments, it is found that the algorithm can effectively improve the speed of text classification. With the expansion of cluster size, the performance of classification accuracy, time performance and acceleration ratio is better. Parallel Bayesian algorithm based on Spark platform is feasible, which solves the problem that traditional single computer cannot handle large-scale data, and can effectively deal with all kinds of classification problems.
Intraday stock trading has become a popular trend in US, Europe, and Indian markets and forecasting these rapid market movements have become an important topic in finance. With the emergence of technology and computin...
详细信息
The film industry has always been a very important sector in the global market. Therefore, it is very important to maximize the profit by predicting the movie success before its release. Although several studies have ...
详细信息
ISBN:
(纸本)9781538678930
The film industry has always been a very important sector in the global market. Therefore, it is very important to maximize the profit by predicting the movie success before its release. Although several studies have been done in this field, it is still needed to improve the prediction performance and collect more data. This study aims to explore the use of Factorization machines approach in order to predict movie success by predicting IMDb ratings for newly released movies using social media data and compare it to current studies. Also, a framework has been developed in order to gather the movie data from different sources including social media. Comparison of the Factorization machines to the current models shows that there are promising results.
Unpredictable and uncertain volume of the rainfall is the serious nature disaster. In current, available rainfall forecasting model predict rainfall volume hourly, weekly or monthly. This work proposed a supervised le...
详细信息
ISBN:
(纸本)9783319746906;9783319746890
Unpredictable and uncertain volume of the rainfall is the serious nature disaster. In current, available rainfall forecasting model predict rainfall volume hourly, weekly or monthly. This work proposed a supervised learning model which is based on machine leaning algorithms of datamining. This approach classify the low, mid and high volume of rainfall. Proposed approach is practically implemented on different uncertain heavy rainfall regions and compare the accuracy and measured the accuracy by ROC area of classifiers such as Random Forest, SMO, Naive Bayes and Multilayer Perceptron (MLP).
In parallel with technological development the problem of fraud detection is becoming more and more important. Increasing number of electronic transactions in various business environments, on the one hand, and softwa...
详细信息
Using the capabilities of machinelearning and datamining techniques, we formulated a Decision Tree model to aid hotel management in optimizing resource allocation for profit maximization. Additionally, we constructe...
详细信息
Recent years have witnessed a surge of research interest in graph machinelearning. However, the benchmark datasets available to the field are rather limited in both quantity and diversity, an issue particularly notab...
详细信息
ISBN:
(纸本)9798400701030
Recent years have witnessed a surge of research interest in graph machinelearning. However, the benchmark datasets available to the field are rather limited in both quantity and diversity, an issue particularly notable given the immense potential applications of graph learning. The lack of diverse benchmark datasets may have biased the development of graph machinelearning techniques towards narrow directions. By crowdsourcing novel tasks and datasets, this workshop aims to increase the diversity of graph learning benchmarks, identify new demands of graph machinelearning in general, and gain a better synergy of how concrete techniques perform on these benchmarks. Moreover, this workshop offers a platform for discussions of best practices in curating graph learning benchmarks and data-centric approaches for graph learning.
This research explores the feasibility of AI-powered cryptocurrency mining on traditional workstations. Traditional workstations, while widely available, face challenges in terms of computational efficiency and energy...
详细信息
Financial frauds are on the rise globally, causing significant financial losses. This issue has far-reaching consequences, impacting the investment industry, government, and corporate sectors alike. Manual verificatio...
详细信息
暂无评论