In recommendation system,sparse data and cold-start user have always been a challenging *** a linear upper confidence bound(UCB) bandit approach as the item selection strategy based on the user historical ratings and ...
详细信息
In recommendation system,sparse data and cold-start user have always been a challenging *** a linear upper confidence bound(UCB) bandit approach as the item selection strategy based on the user historical ratings and user-item context,we model the recommendation problem as a multi-arm bandit(MAB)problem in this *** the engine to recommend while it learns,we adopt probabilistic matrix factorization(PMF) in this strategy learning phase after observing the *** particular,we propose a new approach to get the upper bound statistics out of latent feature *** the experiment,we use two public datasets(Netfilx and MovieLens) to evaluate our proposed *** model shows good results especially on cold-start users.
Motivated by the latest research on feasible space monitoring of multiple control barrier functions (CBFs) as well as polytopic collision avoidance, this paper studies the Polytope Volume Monitoring (PVM) problem, who...
详细信息
For constrained piecewise linear (PWL) systems, the possible existing model uncertainty will bring the difficulties to the design approaches of model predictive control (MPC) based on mixed integer programming (...
详细信息
For constrained piecewise linear (PWL) systems, the possible existing model uncertainty will bring the difficulties to the design approaches of model predictive control (MPC) based on mixed integer programming (MIP). This paper combines the robust method and hybrid method to design the MPC for PWL systems with structured uncertainty. For the proposed approach, as the system model is known at current time, a free control move is optimized to be the current control input. Meanwhile, the MPC controller uses a sequence of feedback control laws as the future control actions, where each feedback control law in the sequence corresponds to each partitions and the arbitrary switching technique is adopted to tackle all the possible switching. Furthermore, to reduce the online computational burden of MPC, the segmented design procedure is suggested by utilizing the characteristics of the proposed approach. Then, an offline design algorithm is proposed, and the reserved degree of freedom can be online used to optimize the control input with lower computational burden.
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence *** players are divided into two groups in the learnin...
详细信息
This paper presents a novel cooperative value iteration(VI)-based adaptive dynamic programming method for multi-player differential game models with a convergence *** players are divided into two groups in the learning process and adapt their policies *** method removes the dependence of admissible initial policies,which is one of the main drawbacks of the PI-based ***,this algorithm enables the players to adapt their control policies without full knowledge of others’ system parameters or control *** efficacy of our method is illustrated by three examples.
作者:
ZHANG YunLU RunyanCAI YunzeDepartment of Automation
Key Laboratory of System Control and Information Processing of Ministry of EducationKey Laboratory of Marine Intelligent Equipment and System of Ministry of EducationShanghai Jiao Tong UniversityShanghai 200240China
In situation assessment(SA)of missile versus target fighter,the traditional SA models generally have the characteristics of strong subjectivity and poor dynamic *** paper considers SA as an expectation of future retur...
详细信息
In situation assessment(SA)of missile versus target fighter,the traditional SA models generally have the characteristics of strong subjectivity and poor dynamic *** paper considers SA as an expectation of future returns and establishes a missile-target simulation battle *** actor-critic(AC)algorithm in reinforcement learning(RL)is used to train the evaluation network,and a missile-target SA model is established in simulation battle *** and comparative experiments show that the model can effectively estimate the expected effect of missile attack under the current situation,and it provides an effective basis for missile attack decision.
Cyber-physical systems (CPS) is a system of systems which consists of many subsystems that can stand alone in an individual manner and can be taken as a typical complex network. CPS can be applied in the critical infr...
详细信息
The development of an innovative H∞ controller for looper and tension control in hot strip finishing mills is traced based on approximately linearized model. This solution has been considered thanks to its well- know...
详细信息
The development of an innovative H∞ controller for looper and tension control in hot strip finishing mills is traced based on approximately linearized model. This solution has been considered thanks to its well- known robustness and simplicity characteristics concerning disturbances' attenuation. The controller is designed based on an optimal problem with linear matrix inequality (LMI) constraints, and the problem is solved by the mincx function of Matlab LMI Toolbox. Simulation results show the effectiveness of the proposed controller compared with conventional ones.
Subcellular localization of proteins can provide key hints to infer their functions and structures in cells. With the breakthrough of recent molecule imaging techniques, the usage of 2D bioimages has become increasing...
详细信息
Subcellular localization of proteins can provide key hints to infer their functions and structures in cells. With the breakthrough of recent molecule imaging techniques, the usage of 2D bioimages has become increasingly popular in automatically analyzing the protein subcellular location pat- terns. Compared with the widely used protein 1D amino acid sequence data, the images of protein distribution are more intuitive and interpretable, making the images a better choice at many applications for revealing the dynamic char- acteristics of proteins, such as detecting protein translocation and quantification of proteins. In this paper, we systemati- cally reviewed the recent progresses in the field of automated image-based protein subcellular location prediction, and clas- sified them into four categories including growing of bioim- age databases, description of subcellular location distribution patterns, classification methods, and applications of the pre- diction systems. Besides, we also discussed some potential directions in this field.
Low energy consumption and limited power supply are significant factors for wireless sensor networks(WSNs); thus, distributed state estimation and data fusion with quantized innovation are explored. The universal feat...
详细信息
Low energy consumption and limited power supply are significant factors for wireless sensor networks(WSNs); thus, distributed state estimation and data fusion with quantized innovation are explored. The universal features of practical WSNs are investigated, and a dynamic transmission strategy is introduced. Furthermore,quantization state estimation based on Bayesian theory is derived. Unlike previous algorithms suitable for processing scalar measurement, the proposed distributed data fusion algorithm is applicable to general vector measurement. Furthermore, the efficiency of the proposed dynamic transmission strategy is analyzed. It is concluded that the proposed algorithm is more efficient than previous methods, and its estimation accuracy comparable to that of the standard Kalman filtering, which is based on analog-amplitude vector measurement.
In this paper,a distributed consensus protocol is proposed for discrete-time single-integer multi-agent systems with measurement noises under general fixed directed *** time-varying control gains satisfying the stocha...
详细信息
In this paper,a distributed consensus protocol is proposed for discrete-time single-integer multi-agent systems with measurement noises under general fixed directed *** time-varying control gains satisfying the stochastic approximation conditions are introduced to attenuate noises,thus the closed-loop multi-agent system is intrinsically a linear time-varying stochastic difference *** the mean square consensus convergence analysis is developed based on the Lyapunov technique,and the construction of the Lyapunov function especially does not require the typical balanced network topology condition assumed for the existence of quadratic Lyapunov ***,the proposed consensus protocol can be applicable to more general networked multi-agent systems,particularly when the bidirectional and/or balanced information exchanges between agents are not *** the proposed protocol,it is proved that the state of each agent converges in mean square to a common random variable whose mathematical expectation is the weighted average of agents' initial state values;meanwhile,the random variable's variance is bounded.
暂无评论