With the rapid development of deep neural networks, underwater vision plays an increasingly important role in the underwater robotic operation. However, the scarce underwater datasets greatly limit the performance of ...
详细信息
Reinforcement learning has gained a lot of attention and applications in the field of autonomous driving in recent years. However, the actual scenarios of automatic driving applications are often diverse, so the reinf...
详细信息
ISBN:
(纸本)9780738133669
Reinforcement learning has gained a lot of attention and applications in the field of autonomous driving in recent years. However, the actual scenarios of automatic driving applications are often diverse, so the reinforcement learning algorithm using only a single driving strategy is difficult to meet the multiple requirements of efficiency and safety in the multi-scenarios autonomous driving task. To solve this challenge, we propose a hierarchical reinforcement learning structure to learn a unified top-level switching master policy between different driving styles policies. The whole framework uses a bottom-up training manner with diverse reward function designing. Through experimental comparison, our method exceeds the performance of single policy and rule-based switching strategy. Based on this framework, we won the first place in the DAI 2020 Autonomous Driving Workshop single-agent track competition.
In recent years, control under urban intersection scenarios has become an emerging research topic. In such scenarios, the autonomous vehicle confronts complicated situations since it must deal with the interaction wit...
详细信息
ISBN:
(纸本)9781728190488
In recent years, control under urban intersection scenarios has become an emerging research topic. In such scenarios, the autonomous vehicle confronts complicated situations since it must deal with the interaction with social vehicles timely while obeying the traffic rules. Generally, the autonomous vehicle is supposed to avoid collisions while pursuing better efficiency. The existing work fails to provide a framework that emphasizes the integrity of the scenarios while deploying and testing reinforcement learning(RL) methods. Specifically, we propose a benchmark for training and testing RL-based autonomous driving agents in complex intersection scenarios, which is called RL-CIS. Then, a set of baselines consisting various algorithms are deployed. The test benchmark and baselines provide a fair and comprehensive training and testing platform for the study of RL for autonomous driving in the intersection scenario, advancing RL-based methods for autonomous driving control. The code of our proposed framework can be found at https://***/liuyuqi123/complexUrbanScenarios.
Automatically recognizing surgical gestures plays a key role in computer-assisted surgery and automatic skill assessment. Solutions to this task that only rely on video without additional sensor hardware have attracte...
详细信息
Traffic data analysis and mining are elemental functions of Intelligent Transportation systems. In recent year, tremendous sensors are deployed in order to collect big data, and equipment maintenance costs a lot. With...
详细信息
ISBN:
(纸本)9781728191423
Traffic data analysis and mining are elemental functions of Intelligent Transportation systems. In recent year, tremendous sensors are deployed in order to collect big data, and equipment maintenance costs a lot. With the development of deep learning, especially especially Generative Adversarial Networks, we can generate realistic big artificial traffic flow data and use small real traffic data and synthesized traffic data in traffic data mining tasks. In this paper, we focus on discovering the semantics embedded in latent codes which are fed into Generative Adversarial Networks, and propose to use the interpolation of semantic latent code to generate semantic manipulation of traffic flow. We evaluate our approach using the publicly available data from Caltrans Performance Measurements systems (PeMS), and experimental results show the the effectiveness of the proposed method.
The development of autonomous driving has attracted extensive attention in recent years, and it is essential to evaluate the performance of autonomous driving. However, testing on the road is expensive and inefficient...
详细信息
ISBN:
(纸本)9781450385855
The development of autonomous driving has attracted extensive attention in recent years, and it is essential to evaluate the performance of autonomous driving. However, testing on the road is expensive and inefficient. Virtual testing is the primary way to validate and verify self-driving cars, and the basis of virtual testing is to build simulation scenarios. In this paper, we propose a training, testing, and evaluation pipeline for the lane-changing task from the perspective of deep reinforcement learning. First, we design lane change scenarios for training and testing, where the test scenarios include stochastic and deterministic parts. Then, we deploy a set of benchmarks consisting of learning and non-learning approaches. We train several state-of-the-art deep reinforcement learning methods in the designed training scenarios and provide the benchmark metrics evaluation results of the trained models in the test scenarios. The designed lane-changing scenarios and benchmarks are both opened to provide a consistent experimental environment for the lane-changing task(1).
The deep learning methods trained with large-scale manually annotated datasets have led to significant breakthroughs in medical image community. However, obtaining such datasets remains a challenging work in medical d...
详细信息
Hand gesture recognition plays a significant role in human-computer interaction for understanding various human gestures and their intent. However, most prior works can only recognize gestures of limited labeled class...
详细信息
ISBN:
(纸本)9781728188089
Hand gesture recognition plays a significant role in human-computer interaction for understanding various human gestures and their intent. However, most prior works can only recognize gestures of limited labeled classes and fail to adapt to new categories. The task of Generalized Zero-Shot Learning (GZSL) for hand gesture recognition aims to address the above issue by leveraging semantic representations and detecting both seen and unseen class samples. In this paper, we propose an end-to-end prototype-based GZSL framework for hand gesture recognition which consists of two branches. The first branch is a prototype-based detector that learns gesture representations and determines whether an input sample belongs to a seen or unseen category. The second branch is a zero-shot label predictor which takes the features of unseen classes as input and outputs predictions through a learned mapping mechanism between the feature and the semantic space. We further establish a hand gesture dataset that specifically targets this GZSL task, and comprehensive experiments on this dataset demonstrate the effectiveness of our proposed approach on recognizing both seen and unseen gestures.
Image-based driver fatigue detection remains a challenging problem due to occlusion of face, the variation of head poses and illuminations. This paper implements an effectual technique for investigating the driver'...
详细信息
ISBN:
(纸本)9781665422482
Image-based driver fatigue detection remains a challenging problem due to occlusion of face, the variation of head poses and illuminations. This paper implements an effectual technique for investigating the driver's fatigue state by using infrared image of an eye in the open or closed condition. In this method we use the deep learning technique to monitor the change, i.e., open and closed conditions of eye state. We integrate ResNet and depthwise convolution network together and use as the core of the structure of the network to perform face and facial landmark detection tasks. After acquiring the eye region, we perform the eye state identification task by using its coordinates of feature points. To determine fatigue, we use PERCLOS method and the results confirm accuracy and effectiveness of the algorithm by comparing with other existing methods. We can reach an accuracy of 97.2% and the average time is 31.20 milliseconds, represent that this driver monitoring inference system has significant importance for both the traffic and driver's safety.
暂无评论