We describe a method that enables the multiplex screening of a pool of many different donor cell lines. Our method accurately predicts each donor proportion from the pool without requiring the use of unique DNA barcod...
详细信息
We describe a method that enables the multiplex screening of a pool of many different donor cell lines. Our method accurately predicts each donor proportion from the pool without requiring the use of unique DNA barcodes as markers of donor identity. Instead, we take advantage of common single nucleotide polymorphisms, whole-genome sequencing, and an algorithm to calculate the proportions from the sequencing data. By testing using simulated and real data, we showed that our method robustly predicts the individual proportions from a mixed-pool of numerous donors, thus enabling the multiplexed testing of diverse donor cells en masse. More information is available at https://***/poolseq/
The main goal of the motif finding problem is to detect novel, over-represented unknown signals in a set of sequences (e.g. transcription factor binding sites in a genome). The most widely used algorithms for finding ...
详细信息
The main goal of the motif finding problem is to detect novel, over-represented unknown signals in a set of sequences (e.g. transcription factor binding sites in a genome). The most widely used algorithms for finding motifs obtain a generative probabilistic representation of these over-represented signals and try to discover profiles that maximize the information content score. Although these profiles form a very powerful representation of the signals, the major difficulty arises from the fact that the best motif corresponds to the global maximum of a non-convex continuous function. Popular algorithms like expectationmaximization (EM) and Gibbs sampling tend to be very sensitive to the initial guesses and are known to converge to the nearest local maximum very quickly. In order to improve the quality of the results, EM is used with multiple random starts or any other powerful stochastic global methods that might yield promising initial guesses ( like projection algorithms). Global methods do not necessarily give initial guesses in the convergence region of the best local maximum but rather suggest that a promising solution is in the neighborhood region. In this paper, we introduce a novel optimization framework that searches the neighborhood regions of the initial alignment in a systematic manner to explore the multiple local optimal solutions. This effective search is achieved by transforming the original optimization problem into its corresponding dynamical system and estimating the practical stability boundary of the local maximum. Our results show that the popularly used EM algorithm often converges to suboptimal solutions which can be significantly improved by the proposed neighborhood profile search. Based on experiments using both synthetic and real datasets, our method demonstrates significant improvements in the information content scores of the probabilistic models. The proposed method also gives the flexibility in using different local solvers and global
In the fiber production process, the stretching process plays a key role in the quality of the final fiber product. Due to the fiber stretching process with inborn nonlinearity, the performance of a single controller ...
详细信息
In the fiber production process, the stretching process plays a key role in the quality of the final fiber product. Due to the fiber stretching process with inborn nonlinearity, the performance of a single controller and an optimizer may be compromised or even unsatisfactory. Thus, we consider a multi-model identification method for the fiber stretching process. The dynamic transitions among different operating points are achieved by the change of the operating conditions in the fiber stretching process. To excite all of the nonlinearity character in the fiber stretching process, the transitions among different operating conditions is achieved. The structure of each sub-models, operating points, operating range are assumed. Based on the input output data of the process, a linear parameter varying (LPV) model is built by applying a probability identification method. To achieve the smoothly connected among the different operating conditions, an exponential function is used. Then a global LPV model is constructed by synthesizing the local models. Simulated results show that the LPV method has the effectiveness in solving the inherent nonlinearity of the fiber stretching process.
Owing to operating condition changing, physical mutation, and sudden shocks, degradation trajectories usually exhibit multi-phase features, and the abrupt jump often appears at the changing time, which makes the tradi...
详细信息
Owing to operating condition changing, physical mutation, and sudden shocks, degradation trajectories usually exhibit multi-phase features, and the abrupt jump often appears at the changing time, which makes the traditional methods of lifetime estimation unavailable. In this paper, we mainly focus on how to estimate the lifetime of the multi-phase degradation process with abrupt jumps at the change points under the concept of the first passage time (FPT). Firstly, a multi-phase degradation model with jumps based on the Wiener process is formulated to describe the multi-phase degradation pattern. Then, we attain the lifetime's closed-form expression for the two-phase model with fixed jump relying on the distribution of the degradation state at the change point. Furthermore, we continue to investigate the lifetime estimation of the degradation process with random effect caused by unit-to-unit variability and the multi-phase degradation process. We extend the results of the two-phase case with fixed parameters to these two cases. For better implementation, a model identification method with off-line and on-line parts based on expectationmaximization (EM) algorithm and Bayesian rule is proposed. Finally, a numerical case study and a practical example of gyro are provided for illustration.
In order to effectively improve the recommendation accuracy of collaborative screening of e-commerce products, ensure the results of collaborative screening of e-commerce products, and reduce the time consumption of c...
详细信息
Public security heroic model culture is a political-cultural phenomenon with Chinese characteristics formed by public security organs in the process of selecting, recognizing, publicizing, and appeasing heroic models....
详细信息
BackgroundThe Naive Bayes (NB) classifier is a powerful supervised algorithm widely used in Machine Learning (ML). However, its effectiveness relies on a strict assumption of conditional independence, which is often v...
详细信息
BackgroundThe Naive Bayes (NB) classifier is a powerful supervised algorithm widely used in Machine Learning (ML). However, its effectiveness relies on a strict assumption of conditional independence, which is often violated in real-world scenarios. To address this limitation, various studies have explored extensions of NB that tackle the issue of non-conditional independence in the data. These approaches can be broadly categorized into two main categories: feature selection and structure *** this particular study, we propose a novel approach to enhancing NB by introducing a latent variable as the parent of the attributes. We define this latent variable using a flexible technique called Bayesian Latent Class Analysis (BLCA). As a result, our final model combines the strengths of NB and BLCA, giving rise to what we refer to as NB-BLCA. By incorporating the latent variable, we aim to capture complex dependencies among the attributes and improve the overall performance of the *** expectation-maximization (EM) algorithm and the Gibbs sampling approach were offered for parameter learning. A simulation study was conducted to evaluate the classification of the model in comparison with the ordinary NB model. In addition, real-world data related to 976 Gastric Cancer (GC) and 1189 Non-ulcer dyspepsia (NUD) patients was used to show the model's performance in an actual application. The validity of models was evaluated using the 10-fold *** presented model was superior to ordinary NB in all the simulation scenarios according to higher classification sensitivity and specificity in test data. The NB-BLCA model using Gibbs sampling accuracy was 87.77 (95% CI: 84.87-90.29). This index was estimated at 77.22 (95% CI: 73.64-80.53) and 74.71 (95% CI: 71.02-78.15) for the NB-BLCA model using the EM algorithm and ordinary NB classifier, *** considering the modification of the NB classifier, incorporating a lat
The widespread adoption of nonlinear power electronic devices in residential settings has significantly increased the stochasticity and uncertainty of power systems. The original load power data, characterized by nume...
详细信息
The Dynamic State Estimation (DSE) for Inverter-Based Resources (IBRs) is an emerging topic as IBRs gradually replace synchronous generators (SGs) in power systems. Unlike SGs, the dynamic models of IBRs heavily depen...
详细信息
暂无评论