Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive ...
详细信息
The key of designing MCMC algorithm is the choice of acceptance function. In this work, Selection criteria of acceptance function is given, and an improved Multi-point Metropolis algorithm with generic acceptance func...
详细信息
In recent years, with the rapid development of stereoscopic display technology, its applications have become increasingly popular in many fields, and, meanwhile, the number of audiences is also growing. The problem of...
详细信息
Face verification is the task of determining whether two given face images represent the same person or not. It is a very challenging task, as the face images, captured in the uncontrolled environments, may have large...
详细信息
This paper addresses the novel design of a biomimetic underwater vehicle (BUV) propelled by undulatory fins and its heading control problems. Inspired by the cuttlefish, which can perform flexible motions by undulator...
详细信息
作者:
Xu, RuijieChen, ShichaoSun, WenqiaoLv, YishengLuo, JialiangTang, YingInstitute of Automation
Chinese Academy of Sciences College of Information Science & Technology Beijing University of Chemical Technology The State Key Laboratory for Management and Control of Complex System State Key Laboratory of Multimodal Artificial Intelligence Systems Beijing China Institute of Automation
Chinese Academy of Sciences The Center of National Railway Intelligent Transportation System Engineering and Technology China Academy of Railway Sciences Corporation Limited The State Key Laboratory for Management and Control of Complex System State Key Laboratory of Multimodal Artificial Intelligence Systems Beijing China Transportation and Economics Research Institute
The Center of National Railway Intelligent Transportation System Engineering and Technology China Academy of Railway Sciences Corporation Limited Beijing China Institute of Automation
Chinese Academy of Sciences The State Key Laboratory for Management and Control of Complex System State Key Laboratory of Multimodal Artificial Intelligence Systems Beijing China Institute of Automation
Chinese Academy of Sciences China University of Geosciences Beijing School of Information Engineering The State Key Laboratory for Management and Control of Complex System State Key Laboratory of Multimodal Artificial Intelligence Systems Beijing China Rowan University
Department of Electrical and Computer Engineering Glassboro United States
Global Navigation Satellite systems (GNSS) can provide real-time positioning information for outdoor users, but cannot for indoor scenarios or heavily occluded outdoor scenarios. Strap-down Inertial Navigation system ...
详细信息
This paper proposes to obtain high-level, domain-robust representations for cross-view face recognition. Specially, we introduce Convolutional Deep Belief Networks (CDBN) as the feature learning model, and an CDBN bas...
详细信息
Online interactions,especially user generated contents on social events,reveal a variety of communicative purposes ranging from expressing feelings to proposing *** intents in users' online interactive behavior fr...
详细信息
ISBN:
(纸本)9781509036202
Online interactions,especially user generated contents on social events,reveal a variety of communicative purposes ranging from expressing feelings to proposing *** intents in users' online interactive behavior from massive social media data can effectively identify users' motives and intents behind communication and provide important information to aid monitoring,analysis and decision making for a variety of ***,user intents recognition from online communication is inherently challenging due to the ambiguity in semantic processing and diversity of syntax ***,the massive online data are usually unlabeled,which greatly hinders the usage of typical machine learning based methods that can automate the recognition *** this paper,we tackle this problem by proposing a Speech Act Theory guided classification scheme,which regards online communication as performative actions of users and classifies user utterances according to their pragmatic *** the basis of this,we construct a dictionary of performative words,expand it using external knowledge sources and refine it by word embedding and similarity *** then use this dictionary to automatically label the online textual data with *** a large amount of the labeled data,we train feature based classifiers to identify user intents in their online *** experimental study using a microblog dataset on social events from Sina Weibo shows the effectiveness of our proposed method.
A sphere-based list forwarding scheme for multiple-input multiple-output(MIMO) relay networks is proposed and analyzed. Firstly, an estimate forwarding(EF) method is proposed, which forwards the minimum mean squared e...
详细信息
A sphere-based list forwarding scheme for multiple-input multiple-output(MIMO) relay networks is proposed and analyzed. Firstly, an estimate forwarding(EF) method is proposed, which forwards the minimum mean squared error(MMSE) estimate of the source data to the destination. Since it performs like amplify-and-forward(AF) and decode-and-forward(DF) for the low and high signal-to-noise ratio(SNR) regions, respectively, the EF relay thus outperforms conventional AF and DF across all SNRs without the need for switching algorithms for different SNRs. Because computational complexity is however high for relays with a large number of antennas(large MIMO) and/or high order constellations, list EF for large MIMO relay networks is proposed. It computes a list sphere decoder based MMSE estimate and retains the advantages of the exact EF relay at a negligible performance loss. The proposed list EF could offer a flexible trade-off between the performance and computational complexity.
The soft continuum arm has extensive application in industrial production and human life due to its superior safety and flexibility. Reinforcement learning is a powerful technique for solving soft arm continuous contr...
详细信息
The soft continuum arm has extensive application in industrial production and human life due to its superior safety and flexibility. Reinforcement learning is a powerful technique for solving soft arm continuous control problems, which can learn an effective control policy with an unknown system model. However, it is often affected by the high sample complexity and requires huge amounts of data to train, which limits its effectiveness in soft arm control. An improved policy gradient method, policy gradient integrating long and short-term rewards denoted as PGLS, is proposed in this paper to overcome this issue. The shortterm rewards provide more dynamic-aware exploration directions for policy learning and improve the exploration efficiency of the algorithm. PGLS can be integrated into current policy gradient algorithms, such as deep deterministic policy gradient(DDPG). The overall control framework is realized and demonstrated in a dynamics simulation environment. Simulation results show that this approach can effectively control the soft arm to reach and track the targets. Compared with DDPG and other model-free reinforcement learning algorithms, the proposed PGLS algorithm has a great improvement in convergence speed and performance. In addition, a fluid-driven soft manipulator is designed and fabricated in this paper, which can verify the proposed PGLS algorithm in real experiments in the future.
暂无评论