The continual reassessment method (CRM) is a commonly used dose-finding design for phase I clinical trials. Practical applications of this method have been restricted by two limitations: (1) the requirement that the t...
详细信息
The continual reassessment method (CRM) is a commonly used dose-finding design for phase I clinical trials. Practical applications of this method have been restricted by two limitations: (1) the requirement that the toxicity outcome needs to be observed shortly after the initiation of the treatment;and (2) the potential sensitivity to the prespecified toxicity probability at each dose. To overcome these limitations, we naturally treat the unobserved toxicity outcomes as missing data, and use the expectation-maximization (EM) algorithm to estimate the dose toxicity probabilities based on the incomplete data to direct dose assignment. To enhance the robustness of the design, we propose prespecifying multiple sets of toxicity probabilities, each set corresponding to an individual CRM model. We carry out these multiple CRMs in parallel, across which model selection and model averaging procedures are used to make more robust inference. We evaluate the operating characteristics of the proposed robust EM-CRM designs through simulation studies and show that the proposed methods satisfactorily resolve both limitations of the CRM. Besides improving the MTD selection percentage, the new designs dramatically shorten the duration of the trial, and are robust to the prespecification of the toxicity probabilities.
Spatial autoregressive model is widely concerned in the economic field, whereas when the data is missing, variable selection and parameter estimation of the model is quite challenging. Based on this, we discuss the va...
详细信息
Spatial autoregressive model is widely concerned in the economic field, whereas when the data is missing, variable selection and parameter estimation of the model is quite challenging. Based on this, we discuss the variable selection in spatial autoregressive model with missing data. Under the condition that errors are independent and identically distributed, we have developed a penalized quasi -maximum likelihood method to achieve variable selection and parameter estimation simultaneously in the presence of missing responses. The method's theoretical properties, including consistency and asymptotical normality, are established under certain assumptions. Meanwhile, an improved expectation-maximization algorithm is provided for optimizing the penalized quasi -maximum likelihood function. Simulations are conducted to examine the proposed method and assess the finite -sample performance. Additionally, we present a practical example to illustrate the method's application.
Thresholding, an important approach for image segmentation, is the first step in the image processing for many industrial applications. The efficiency and effectiveness of the thresholding method is the key to the suc...
详细信息
Thresholding, an important approach for image segmentation, is the first step in the image processing for many industrial applications. The efficiency and effectiveness of the thresholding method is the key to the success of the consecutive process steps. This study proposed an optimization algorithm (named as AOE) combining parametric and non-parametric approaches. An ant colony system (ACS-Otsu) algorithm considers the non-parametric objective between-class variance while the expectationmaximization (EM) algorithm focuses on the parametric objective overall fitting error of probability distributions. Since the performance of the EM method is sensitive to the initial solution, the ACS-Otsu algorithm is employed as a robust and efficient initialization strategy. Experimental results of the nine test images show that the AOE algorithm is efficient and effective in the multilevel thresholding problems. Comparisons between the AOE algorithm and the PSO+EM algorithm in the literature also verify that the AOE algorithm not only provides competitive thresholding results but also outperforms PSO+EM in computational expense.
The increased penetration of solar photovoltaic (PV) energy sources into electric grids has increased the need for accurate modeling and prediction of solar irradiance and power production. Existing modeling and predi...
详细信息
ISBN:
(纸本)9781538626993
The increased penetration of solar photovoltaic (PV) energy sources into electric grids has increased the need for accurate modeling and prediction of solar irradiance and power production. Existing modeling and prediction techniques focus on long-term low-resolution prediction over minutes to years. This paper examines the stochastic modeling and short-term high-resolution prediction of solar irradiance and PV power output. We propose a stochastic state-space model to characterize the behaviors of solar irradiance and PV power output. This prediction model is suitable for the development of optimal power controllers for PV sources. A filter-based expectation-maximization and Kalman filtering mechanism is employed to estimate the parameters and states in the statespace model. The mechanism results in a finite dimensional filter which only uses the first and second order statistics. The structure of the scheme contributes to a direct prediction of the solar irradiance and PV power output without any linearization process or simplifying assumptions of the signal's model. This enables the system to accurately predict small as well as large fluctuations of the solar signals. The mechanism is recursive allowing the solar irradiance and PV power to be predicted online from measurements. The mechanism is tested using solar irradiance and PV power measurement data collected locally in our laboratory.
Consider semi-supervised learning for classification, where both labelled and unlabelled data are available for training. The goal is to exploit both datasets to achieve higher prediction accuracy than just using labe...
详细信息
Consider semi-supervised learning for classification, where both labelled and unlabelled data are available for training. The goal is to exploit both datasets to achieve higher prediction accuracy than just using labelled data alone. We develop a semi-supervised logistic learning method based on exponential tilt mixture models by extending a statistical equivalence between logistic regression and exponential tilt modelling. We study maximum nonparametric likelihood estimation and derive novel objective functions that are shown to be Fisher probability consistent. We also propose regularized estimation and construct simple and highly interpretable expectation-maximization (EM) algorithms. Finally, we present numerical results that demonstrate the advantage of the proposed methods compared with existing methods.
We recently introduced the high-resolution nonnegative matrix factorization (HR-NMF) model for representing mixtures of non-stationary signals in the time-frequency domain, and we highlighted its capability to both re...
详细信息
We recently introduced the high-resolution nonnegative matrix factorization (HR-NMF) model for representing mixtures of non-stationary signals in the time-frequency domain, and we highlighted its capability to both reach a high spectral resolution and reconstruct high quality audio signals. An expectation-maximization (EM) algorithm was also proposed for estimating its parameters. In this paper, we replace the maximization step by multiplicative update rules (MUR), in order to improve the convergence rate. We also introduce general MUR that are not limited to nonnegative parameters, and we propose a new insight into the EM algorithm, which shows that MUR and EM actually belong to the same family. We thus introduce a continuum of algorithms between them. Experiments confirm that the proposed approach permits to overcome the convergence rate of the EM algorithm.
According to the level of information provided in images, segmentation techniques can be categorized into two groups. One is region-labeling, which obeys the intensity-based classification methods. Although modeling t...
详细信息
ISBN:
(纸本)9781424478149
According to the level of information provided in images, segmentation techniques can be categorized into two groups. One is region-labeling, which obeys the intensity-based classification methods. Although modeling the tissue intensity is straightforward by applying local statistical methods and spatial dependencies, the results might suffer from noise and incomplete data. The second group of techniques applies active contour models, in which the objective is to find the optimal partition of the image domain using a closed or open curve by using prior constraints on the shape variation. However, estimating optimal curve is intractable due to the incomplete observation data. This paper extends a previously reported joint active contour model for medical image segmentation in a new expectation-maximization (EM) framework, wherein the evolution curve is constrained not only by a shape-based statistical model but also by applying a hidden variable model from the image observation. In this approach, the hidden variable model is defined by the local voxel labeling computed from its likelihood function, depended on the image functions and the prior anatomical knowledge. Comparative results on segmenting putamen and caudate shapes in MR brain images confirmed both robustness and accuracy of the proposed curve evolution algorithm.
Nowadays, nuclear imaging is increasingly used for non-invasive diagnosis. The image modalities in nuclear imaging suffer of worse statistics, in comparison with computed tomography, since they are based on emission t...
详细信息
ISBN:
(纸本)9781479914920
Nowadays, nuclear imaging is increasingly used for non-invasive diagnosis. The image modalities in nuclear imaging suffer of worse statistics, in comparison with computed tomography, since they are based on emission transition tomography. Thus, precise reconstruction methods that can deal with incomplete or missing measurements are needed in order to improve the quality of nuclear images. In this paper we present a generalization of the state of the art EMML and ISRA algorithms for emission computed tomography reconstruction. The proposed method was tested and validated in comparison with the mentioned state of the art methods on a set of synthetic data. Better results (in terms of speed of convergence) were obtained for certain parameter settings.
In a two-mode network, the nodes are divided into two types (primary nodes and secondary nodes), and connections exist only between nodes of different types. In reality, in such a two-mode network, one-mode network co...
详细信息
In a two-mode network, the nodes are divided into two types (primary nodes and secondary nodes), and connections exist only between nodes of different types. In reality, in such a two-mode network, one-mode network connections may also exist among primary nodes, and these two kinds of networks are usually not independent and coexistent. In this paper, we first propose a group Rasch mixture network model that focuses on the connections between primary nodes and secondary nodes, while incorporating the group structure and linkage information of primary nodes. We then develop a modified expectation-maximization algorithm to estimate the proposed model with a lambda-BIC method for selecting the tuning parameter. Additionally, we provide a likelihood-ratio test statistic to examine whether the two kinds of networks are independent and implement the leave-one-out method to construct a network prediction rule. Finally, we establish asymptotic results and demonstrate the numerical performance of the proposed methods using both simulations and the *** dataset.
Although weather regimes are often used as a primary step in many statistical downscaling processes, they are usually defined solely in terms of atmospheric variables and seldom to maximize their correlation to observ...
详细信息
Although weather regimes are often used as a primary step in many statistical downscaling processes, they are usually defined solely in terms of atmospheric variables and seldom to maximize their correlation to observed local meteorological phenomena. This paper compares different clustering methods to perform such a task. The correlation clustering model is introduced to define regimes that are well correlated to local-scale precipitation observed on seven French Mediterranean rain gauges. This clustering method is compared to other approaches such as the k-means and "expectation-maximization" (EM) algorithms. The two latter are applied either to the main principal components of large-scale reanalysis data(geopotential height at 500 mbar and sea level pressure) covering the Mediterranean basin or to the canonical variates associated with large scale and resulting from a canonical correlation analysis performed on reanalyses and local precipitation. The weather regimes obtained by the different approaches are compared, with a focus on the "extreme content" captured within the regimes. Then, cost functions are developed to quantify the errors due to misclassification, in terms of local precipitation. The different clustering approaches show different misclassification and costs. EM applied to canonical variates appears as a good compromise between the other approaches, with high discrimination, overall for extreme precipitation, while the precipitation costs due to bad classification are acceptable. This paper provides tools to help the users choose the clustering method to be used according to the expected goal and the use of the weather regimes.
暂无评论