In this paper,we study the large-scale inference for a linear expectile regression *** mitigate the computational challenges in the classical asymmetric least squares(ALS)estimation under massive data,we propose a com...
详细信息
In this paper,we study the large-scale inference for a linear expectile regression *** mitigate the computational challenges in the classical asymmetric least squares(ALS)estimation under massive data,we propose a communication-efficient divide and conquer algorithm to combine the information from sub-machines through confidence *** resulting pooled estimator has a closed-form expression,and its consistency and asymptotic normality are established under mild ***,we derive the Bahadur representation of the ALS estimator,which serves as an important tool to study the relationship between the number of submachines K and the sample *** studies including both synthetic and real data examples are presented to illustrate the finite-sample performance of our method and support the theoretical results.
A two-ring network coupling by a transverse link is considered in this paper. divide and conquer algorithm is used to analyze the stability and Hopf bifurcation values. Nonlinear waves (such as synchronization and ref...
详细信息
A two-ring network coupling by a transverse link is considered in this paper. divide and conquer algorithm is used to analyze the stability and Hopf bifurcation values. Nonlinear waves (such as synchronization and reflection waves) are studied using the center manifold theorem and normal form theory. It is shown that appropriate settings of transverse coupling can guarantee the backup of the complicated dynamics from one ring to the other. (C) 2017 The Authors. Published by Elsevier B.V.
In this paper, we continue the study of bit-parallel square-based Montgomery multiplier (MM) by Li et al. (Integration 2016, IEEE Access 2018). A square-based MM use an alternative divide and conquer algorithm split a...
详细信息
This study focuses on the optimization of the Big-means algorithm for clustering large-scale datasets, exploring three distinct parallelization strategies. We conducted extensive experiments to assess the computationa...
详细信息
Tato bakalářská práce je zaměřena na tématiku plánování dráhy robotu. Stěžejním problémem je pohyb bezrozměrného (tj. bodového) robotu, bez omezení p...
详细信息
Tato bakalářská práce je zaměřena na tématiku plánování dráhy robotu. Stěžejním problémem je pohyb bezrozměrného (tj. bodového) robotu, bez omezení pohybu a ve scéně s nepohyblivými překážkami (též bodovými). Součástí práce je implementace zmíněných algoritmů pro danou třídu úloh a vymezení vhodné metody.
divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is...
详细信息
divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is to fill this gap. We propose two divide-and conquer-type model averaging estimators for linear models with distributed data. Under some regularity conditions, we show that the weights from Mallows model averaging criterion converge in L-2 to the theoretically optimal weights minimizing the risk of the model averaging estimator. We also give the bounds of the in-sample and out-of-sample mean squared errors and prove the asymptotic optimality for the proposed model averaging estimators. Our conclusions hold even when the dimensions and the number of candidate models are divergent. Simulation results and a real airline data analysis illustrate that the proposed model averaging methods perform better than the commonly used model selection and model averaging methods in distributed data cases. Our approaches contribute to model averaging theory in distributed data and parallel computations, and can be applied in big data analysis to save time and reduce the computational burden.
K-means plays a vital role in data mining and is the simplest and most widely used algorithm under the Euclidean Minimum Sum-of-Squares Clustering (MSSC) model. However, its performance drastically drops when applied ...
详细信息
K-means plays a vital role in data mining and is the simplest and most widely used algorithm under the Euclidean Minimum Sum-of-Squares Clustering (MSSC) model. However, its performance drastically drops when applied to vast amounts of data. Therefore, it is crucial to improve K-means by scaling it to big data using as few of the following computational resources as possible: data, time, and algorith-mic ingredients. We propose a new parallel scheme of using K-means and K-means++ algorithms for big data clustering that satisfies the properties of a "true big data" algorithm and outperforms the classical and recent state-of-the-art MSSC approaches in terms of solution quality and runtime. The new approach naturally implements global search by decomposing the MSSC problem without using additional meta -heuristics. This work shows that data decomposition is the basic approach to solve the big data clustering problem. The empirical success of the new algorithm allowed us to challenge the common belief that more data is required to obtain a good clustering solution. Moreover, the present work questions the es-tablished trend that more sophisticated hybrid approaches and algorithms are required to obtain a better clustering solution.(c) 2022 Elsevier Ltd. All rights reserved.
A two-ring network coupling by a transverse link is considered in this paper. divide and conquer algorithm is used to analyze the stability and Hopf bifurcation values. Nonlinear waves (such as synchronization and ref...
详细信息
A two-ring network coupling by a transverse link is considered in this paper. divide and conquer algorithm is used to analyze the stability and Hopf bifurcation values. Nonlinear waves (such as synchronization and reflection waves) are studied using the center manifold theorem and normal form theory. It is shown that appropriate settings of transverse coupling can guarantee the backup of the complicated dynamics from one ring to the other.
divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is...
详细信息
divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is to fill this gap. We propose two divide-and-conquer-type model averaging estimators for linear models with distributed data. Under some regularity conditions, we show that the weights from Mallows model averaging criterion converge in L2 to the theoretically optimal weights minimizing the risk of the model averaging estimator. We also give the bounds of the in-sample and out-of-sample mean squared errors and prove the asymptotic optimality for the proposed model averaging estimators. Our conclusions hold even when the dimensions and the number of candidate models are divergent. Simulation results and a real airline data analysis illustrate that the proposed model averaging methods perform better than the commonly used model selection and model averaging methods in distributed data cases. Our approaches contribute to model averaging theory in distributed data and parallel computations, and can be applied in big data analysis to save time and reduce the computational burden.
Exponential tail bounds are derived for solutions of max-recursive equations and for max-recursive random sequences, which typically arise as functionals of recursive structures, of random trees or in recursive algori...
详细信息
Exponential tail bounds are derived for solutions of max-recursive equations and for max-recursive random sequences, which typically arise as functionals of recursive structures, of random trees or in recursive algorithms. In particular they arise in the worst case analysis of divide and conquer algorithms, in parallel search algorithms or in the height of random tree models. For the proof we determine asymptotic bounds for the moments or for the Laplace transforms and apply a characterization of exponential tail bounds due to Kasahara (1978).
暂无评论