The generalizedadditive model is a well established and strong tool that allows modelling smooth effects of predictors on the response. However, if the link function, which is typically chosen as the canonical link, ...
详细信息
The generalizedadditive model is a well established and strong tool that allows modelling smooth effects of predictors on the response. However, if the link function, which is typically chosen as the canonical link, is misspecified, estimates can be biased. A procedure is proposed that simultaneously estimates the form of the link function and the unknown form of the predictor functions including selection of predictors. The procedure is based on boosting methodology, which obtains estimates by using a sequence of weak learners. It strongly dominates fitting procedures that are unable to modify a given link function if the true link function deviates from the fixed function. The performance of the procedure is shown in simulation studies and illustrated by real world examples.
Digital soil mapping (DSM) is gaining momentum as a technique to help smallholder farmers secure soil security and food security in developing regions. However, communications of the digital soil mapping information b...
详细信息
Digital soil mapping (DSM) is gaining momentum as a technique to help smallholder farmers secure soil security and food security in developing regions. However, communications of the digital soil mapping information between diverse audiences become problematic due to the inconsistent scale of DSM information. Spatial downscaling can make use of accessible soil information at relatively coarse spatial resolution to provide valuable soil information at relatively fine spatial resolution. The objective of this research was to disaggregate the coarse spatial resolution soil exchangeable potassium (K-ex) and soil total nitrogen (TN) base map into fine spatial resolution soil downscaled map using weighted generalized additive models (GAMs) in two smallholder villages in South India. By incorporating fine spatial resolution spectral indices in the downscaling process, the soil downscaled maps not only conserve the spatial information of coarse spatial resolution soil maps but also depict the spatial details of soil properties at fine spatial resolution. The results of this study demonstrated difference between the fine spatial resolution downscaled maps and fine spatial resolution base maps is smaller than the difference between coarse spatial resolution base maps and fine spatial resolution base maps. The appropriate and economical strategy to promote the DSM technique in smallholder farms is to develop the relatively coarse spatial resolution soil prediction maps or utilize available coarse spatial resolution soil maps at the regional scale and to disaggregate these maps to the fine spatial resolution downscaled soil maps at farm scale.
Elevated bulk tank milk somatic cell count (BMSCC) has a negative impact on milk production, milk quality, and animal health. Seasonal increases in herd level somatic cell count (SCC) are commonly associated with elev...
详细信息
Elevated bulk tank milk somatic cell count (BMSCC) has a negative impact on milk production, milk quality, and animal health. Seasonal increases in herd level somatic cell count (SCC) are commonly associated with elevated environmental temperature and humidity. The Temperature Humidity Index (THI) has been developed to measure general environmental stress in dairy cattle;however, additional work is needed to determine a specific effect of the heat stress index on herd-level SCC. generalizedadditive Model methods were used for a flexible exploration of the relationships between daily temperature, relative humidity, and bulk milk somatic cell count. The data consist of BMSCC and meteorological recordings collected between March 2009 and October 2011 of 10 dairy farms. The results indicate that, an average increase of 0.16% of BMSCC is expected for an increase of 1 degrees C degree of temperature. A complex relationship was found for relative humidity. For example, increase of 0.099%, 0.037% and 0.020% are expected in correspondence to an increase of relative humidity from 50% to 51%, 80% to 81%;and 90% to 91%, respectively. Using this model, it will be possible to provide evidence-based advice to dairy farmers for the use of THI control charts created on the basis of our statistical model.
This thesis focuses on the improvement of generalized additive models (GAMs) using rank estimators. We introduce estimation of the smoothing functions in GAMs via backfitting in a local scoring algorithm using maximiz...
详细信息
This thesis focuses on the improvement of generalized additive models (GAMs) using rank estimators. We introduce estimation of the smoothing functions in GAMs via backfitting in a local scoring algorithm using maximization of the expected log likelihood function with weights. Improvements of GAM estimation have focused on the smoothers used in the local scoring algorithm, but poor prediction for non-Gaussian data motivates the need for robust estimation of GAMs. Rank-based estimation as a robust and efficient alternative to the likelihood-based estimation of GAMs is proposed, and it is shown that rank GAM estimators can be restructured as iteratively reweighted GAM estimators. Simulations further support the use of rank-based GAM estimation for heavy-tailed or contaminated sources of data common in climate studies. Successful application of rank GAM estimation is employed for fisheries data, a field which commonly uses GAMs for their high degree of flexibility in modeling complex systems and could benefit from improved model prediction performance for non-Gaussian data. Cross-validation shows improved prediction performance for rank GAMs over GAMs, and improved adjusted R-squared values highlight the better fit of rank GAMs for the given data.
In the recent years, the frequency of motorcycle collision in Indonesia, especially in Surabaya, is constantly increasing. This article will explain the use of the generalized additive models (GAMs) to estimate motorc...
详细信息
In the recent years, the frequency of motorcycle collision in Indonesia, especially in Surabaya, is constantly increasing. This article will explain the use of the generalized additive models (GAMs) to estimate motorcycle collision on collector roads in Surabaya. This study uses GAMs with Gaussian distribution and logarithmic link function, as well as application of software R in data processing. The case study takes place on urban roads in Surabaya, Indonesia. In this study, 69 roads of 120 collector roads in Surabaya are selected. The final model of this study indicates the relationship between the frequency of motorcycle collision on collector roads with explanatory variables which consists of traffic volume, road length, accessibility, road width, number of lanes, and traffic speed. Increasing values of explanatory variables in the prediction model lead to increased risk of accidents. These findings are expected to be considered in programs planned to reduce motorcycle collision on collector roads in Surabaya and other cities. (C) 2015 The Authors. Published by Elsevier Ltd.
We study generalized additive models, with shape restrictions (e.g. monotonicity, convexity and concavity) imposed on each component of the additive prediction function. We show that this framework facilitates a non-p...
详细信息
We study generalized additive models, with shape restrictions (e.g. monotonicity, convexity and concavity) imposed on each component of the additive prediction function. We show that this framework facilitates a non-parametric estimator of each additive component, obtained by maximizing the likelihood. The procedure is free of tuning parameters and under mild conditions is proved to be uniformly consistent on compact intervals. More generally, our methodology can be applied to generalizedadditive index models. Here again, the procedure can be justified on theoretical grounds and, like the original algorithm, has highly competitive finite sample performance. Practical utility is illustrated through the use of these methods in the analysis of two real data sets. Our algorithms are publicly available in the R package scar, short for shape-constrained additive regression.
In the recent years, the frequency of motorcycle collision in Indonesia, especially in Surabaya, is constantly increasing. This article will explain the use of the generalized additive models (GAMs) to estimate motorc...
详细信息
In the recent years, the frequency of motorcycle collision in Indonesia, especially in Surabaya, is constantly increasing. This article will explain the use of the generalized additive models (GAMs) to estimate motorcycle collision on collector roads in Surabaya. This study uses GAMs with Gaussian distribution and logarithmic link function, as well as application of software R in data processing. The case study takes place on urban roads in Surabaya, Indonesia. In this study, 69 roads of 120 collector roads in Surabaya are selected. The final model of this study indicates the relationship between the frequency of motorcycle collision on collector roads with explanatory variables which consists of traffic volume, road length, accessibility, road width, number of lanes, and traffic speed. Increasing values of explanatory variables in the prediction model lead to increased risk of accidents. These findings are expected to be considered in programs planned to reduce motorcycle collision on collector roads in Surabaya and other cities.
Clinicians are very interested in researching what are important determinants of hospitalization for respiratory disease. In this paper, a general model to explain the relationship between the risk of respiratory dise...
详细信息
ISBN:
(纸本)9781479947195
Clinicians are very interested in researching what are important determinants of hospitalization for respiratory disease. In this paper, a general model to explain the relationship between the risk of respiratory disease and several meteorological variables will be presented by the framework of generalized additive models (GAMs) and its predictive effects will be evaluated. By using 9655 medical records with respiratory disease in a county in central China and daily meteorological data, a reasonably good fit was obtained. The result shows that the general method which was presented by this paper to discover the relationship between the meteorological factors and the hospitalization rate for respiratory disease is can explain most of the variation in the daily counts of hospital admissions.
This study investigates how the inclusion of El Nino episodes affects the modeling of recruitment in North Pacific albacore. The relationship between spawning stock size and recruitment for the 1976 - 2004 period, inc...
详细信息
This study investigates how the inclusion of El Nino episodes affects the modeling of recruitment in North Pacific albacore. The relationship between spawning stock size and recruitment for the 1976 - 2004 period, including environmental variables (Southern Oscillation Index and sea surface temperature anomalies), was conducted by general linear and generalized additive models (GAM). The results indicate that the Southern Oscillation Index and the interaction between spawning stock size and sea surface temperature anomalies have significant effects on the recruitment of North Pacific albacore. GAM fit the original data better than general linear models, according to the Akaike information criterion. The recruitment declined when El Nino episodes occurred, but increased when La Nino episodes occurred. The recruitment also increased when the spawning stock size density was above 11.21 million tons with a -0.3 degrees C sea surface temperature anomaly. El Nino was observed to have both long-period (above 10 - 15 years) and short-period (1 - 2 years) effects on North Pacific recruitment.
We propose Interpretable generalizedadditive Neural Networks (IGANN), a novel machine learning model that uses gradient boosting and tailored neural networks to obtain high predictive performance while being interpre...
详细信息
We propose Interpretable generalizedadditive Neural Networks (IGANN), a novel machine learning model that uses gradient boosting and tailored neural networks to obtain high predictive performance while being interpretable to humans. We derive an efficient training algorithm based on the theory of extreme learning machines, that allows reducing the training process to solving a sequence of regularized linear regressions. We analyze the algorithm theoretically, provide insights into the rate of change of so-called shape functions, and show that the computational complexity of the training process scales linearly with the number of samples in the training dataset. We implement IGANN in PyTorch, which allows the model to be trained on graphics processing units (GPUs) to speed up training. We demonstrate favorable results in a variety of numerical experiments and showcase IGANN's value in three real-world case studies for productivity prediction, credit scoring, and criminal recidivism prediction. (c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license ( http://***/licenses/by/4.0/ )
暂无评论