Understanding patterns and energy-related data in photovoltaic systems is one of the key tasks in energy generation and distribution. In fact, the use of data-driven tools and predictive learning models can support th...
详细信息
Understanding patterns and energy-related data in photovoltaic systems is one of the key tasks in energy generation and distribution. In fact, the use of data-driven tools and predictive learning models can support the government, power regulatory agencies, and the energy industry in improving their decision-making and operational activities. Bering this in mind, this paper presents a case study of data-driven analysis and machinelearning to forecast the energy charge in the distributed photovoltaic power grid of Queensland, in Australia. Our analysis relies on a freely, open energy tracking platform and the design of three machinelearning approaches built on the basis of Random Forest, Support Vector machines, and Gradient Boosting methods. Experimental results with real data showed that the trained models allow for very consistent predictions while reaching a high forecasting accuracy (around 95%-93% in Generated-Exported prediction, respectively). Moreover, it was found that the Gradient Boosting-based model ensures robust behavior and low prediction errors, as endorsed by quality validation metrics. Another technical aspect observed is that the variables artificially created to boost the models substantially improve the post-analysis and overall accuracy of the results. (C) 2021 The Author(s). Published by Elsevier Ltd.
Depression is a global public health challenge that affects approximately 300 million people. Artificial Intelligence and machinelearning have revolutionized the healthcare sector, allowing the development of models ...
详细信息
In the current era stock price prediction plays a key role for prediction of future data with respect to training the past data by using machinelearning or deep learning technologies. Building a model and then passin...
详细信息
machinelearning (ML) is an important tool in decision-making and many ML algorithms are used in crop yield prediction. ML algorithm predictions are based on various dependent factors. In this study, climate variabili...
详细信息
ISBN:
(纸本)9789819760084;9789819760091
machinelearning (ML) is an important tool in decision-making and many ML algorithms are used in crop yield prediction. ML algorithm predictions are based on various dependent factors. In this study, climate variabilities such as precipitation and crop yield are considered as the dependent factors. This paper shows a micro-level study which concentrates on the Paravanar River Basin which is situated on Cuddalore District, Tamil Nadu, India. There are more than 40 crops grown in this region and the main source of irrigation is the north-east monsoon rainfall. Since, the hypothesis is tested between the annual rainfall and the crop yield, the crops which are cultivated throughout the year are chosen for analysis. They are cashew nut, coriander, sugarcane, sweet potato and turmeric. The PERSIANN-CDR data from National Centres for Environmental Information-National Oceanic and Atmospheric Administration was used as the precipitation data and the agriculture crop production data was downloaded from Ministry of Agriculture and Farmers Welfare of India. The datasets were tested using the following ML algorithms logistic regression, decision tree classifier, random forest classifier and XGBoost classifier. Implementation of ML-derived results can only be done through participatory approaches. Therefore, participatory rural appraisal was done with the farmers and the villagers to assess the willingness of implementation and it shows positive results. Hence the main aim of the study tests the hypotheses on dependency of crop yield on precipitation on a regional scale.
Model fairness is an essential element for Trustworthy AI. While many techniques for model fairness have been proposed, most of them assume that the training and deployment data distributions are identical, which is o...
详细信息
Model fairness is an essential element for Trustworthy AI. While many techniques for model fairness have been proposed, most of them assume that the training and deployment data distributions are identical, which is often not true in practice. In particular, when the bias between labels and sensitive groups changes, the fairness of the trained model is directly influenced and can worsen. We make two contributions for solving this problem. First, we analytically show that existing in-processing fair algorithms have fundamental limits in accuracy and group fairness. We utilize the notion of correlation shifts between labels and groups, which can explicitly capture the change of the above bias. Second, we propose a novel pre-processing step that samples the input data to reduce correlation shifts and thus enables the in-processing approaches to overcome their limitations. We formulate an optimization problem for adjusting the data ratio among labels and sensitive groups to reflect the shifted correlation. A key benefit of our approach lies in decoupling the roles of pre- and in-processing approaches: correlation adjustment via pre-processing and unfairness mitigation on the processed data via inprocessing. Experiments show that our framework effectively improves existing in-processing fair algorithms w.r.t. accuracy and fairness, both on synthetic and real datasets.
machinelearning (ML) applications require high-quality datasets. Automated data augmentation techniques can help increase the richness of training data, thus increasing the ML model accuracy. Existing solutions focus...
详细信息
ISBN:
(纸本)9781665481045
machinelearning (ML) applications require high-quality datasets. Automated data augmentation techniques can help increase the richness of training data, thus increasing the ML model accuracy. Existing solutions focus on efficiency and ML model accuracy but do not exploit the richness of dataset relationships. With relational data, the challenge lies in identifying join paths that best augment a feature table to increase the performance of a model. In this paper we propose a two-step, automated data augmentation approach for relational data that involves: (i) enumerating join paths of various lengths given a base table and (ii) ranking the join paths using filter methods for feature selection. We show that our approach can improve prediction accuracy and reduce runtime compared to the baseline approach.
The 4th international DDDAS 2022conference, convened on October 6–10, featured presentations on Dynamic data Driven Applications Systems (DDDAS)-based approaches and capabilities, in a wide set of areas, with an ove...
详细信息
This paper presents an analysis method of ultimate bearing capacity of foundations based on machinelearning. Firstly, based on the clustering algorithm of machinelearning, the numerical results are processed to obta...
详细信息
The prevailing strategy for ensuring web application accessibility relies heavily on adherence to established guidelines, particularly the Web Content Accessibility Guidelines (WCAG). However, the process of interpret...
详细信息
Nowadays, the problem of short- and long-term forecasts in the Arctic region becomes more important for global logistics and engineering. Here, we present our first findings on the application of machinelearning (ML)...
详细信息
Nowadays, the problem of short- and long-term forecasts in the Arctic region becomes more important for global logistics and engineering. Here, we present our first findings on the application of machinelearning (ML) methods to oceanographic data. The Barents Sea is the region of our interest, and we focus on the analysis of wind waves and surface currents. We applied several ML models to simulate observed time series and obtained short- and long-term forecasts. The Long Short Time Memory model and XGBoost algorithms showed better results in fitting the observed curve.
暂无评论