Deep reinforcement learning (DRL) has been widely applied in autonomous exploration and mapping tasks, but often struggles with the challenges of sampling efficiency, poor adaptability to unknown map sizes, and slow s...
Deep reinforcement learning (DRL) has been widely applied in autonomous exploration and mapping tasks, but often struggles with the challenges of sampling efficiency, poor adaptability to unknown map sizes, and slow simulation speed. To speed up convergence, we combine curriculum learning (CL) with DRL, and first propose a Cumulative Curriculum Reinforcement Learning (CCRL) training framework to alleviate the issue of catastrophic forgetting faced by general CL. Besides, we present a novel state representation, which considers a local egocentric map and a global exploration map resized to the fixed dimension, so as to flexibly adapt to environments with various sizes and shapes. Additionally, for facilitating the fast training of DRL models, we develop a lightweight grid-based simulator, which can substantially accelerate simulation compared to popular robot simulation platforms such as Gazebo. Based on the customized simulator, comprehensive experiments have been conducted, and the results show that the CCRL framework not only mitigates the catastrophic forgetting problem, but also improves the sample efficiency and generalization of DRL models, compared to general CL as well as without a curriculum. Our code is available at https://***/BeamanLi/CCRL_Exploration.
Indoor distance measurement plays a critical role in positioning services. The complexity of indoor environments, coupled with a limited and small sample size, results in suboptimal distance accuracy. This paper propo...
详细信息
Oral history involves conveying experienced events through spoken narratives, typically preserved and later studied by converting spoken language into text. Named entity recognition (NER) is the recognition of meaning...
详细信息
ISBN:
(数字)9798350380347
ISBN:
(纸本)9798350380354
Oral history involves conveying experienced events through spoken narratives, typically preserved and later studied by converting spoken language into text. Named entity recognition (NER) is the recognition of meaningful named entities in text, such as person names, locations, organizations, etc. Conducting NER research on oral history text is significant as it not only facilitates the verification of named entities by researchers but also allows for the extraction of keyinformation from texts through named entities. By collecting oral history corpus, we establish an oral history text dataset, manually annotating all texts with six entity labels and repeatedly verifying them manually. Utilizing the concept of machine reading comprehension (MRC), a semantic fusion module was added, integrating label information into text information using a cross-attention mechanism, and decoding the corresponding entity labels through span decoding. The final model performs well on the oral history text dataset and also showed good performance on general Chinese datasets.
This paper proposes a building energy optimization strategy based on artifical intelligence technology modeling ***,the data set generated by Energy Plus energy consumption simulation software is used as the training ...
详细信息
This paper proposes a building energy optimization strategy based on artifical intelligence technology modeling ***,the data set generated by Energy Plus energy consumption simulation software is used as the training set and test set of the Biased ReLU neural network(BRNN).Secondly,the building energy consumption prediction model and indoor temperature prediction model are built based on the Biased ReLU neural ***,model predictive control(MPC) is uesd to achieve energy saving by controlling the set temperature of the building’s Heating,Ventilation and Air Conditioning(HVAC) ***,the joint simulation of MATLAB and EnergyPlus is realized by introducing the building control virtual test bed(BCVTB).The results show that our method can effectively reduce building energy consumption.
Continual semantic segmentation (CSS) has risen as a popular field, which aims to acquire new skills constantly without forgetting past knowledge catastrophically. In CSS, we identify that there is a severe imbalance ...
详细信息
ISBN:
(数字)9798350390155
ISBN:
(纸本)9798350390162
Continual semantic segmentation (CSS) has risen as a popular field, which aims to acquire new skills constantly without forgetting past knowledge catastrophically. In CSS, we identify that there is a severe imbalance between new classes and old classes, leading to the classifier weight toward new classes. In this paper, we deal with the continual semantic segmentation problem from the class imbalance perspective via mask-based class rebalancing, avoiding the model suffering from catastrophic forgetting. More specifically, the mask-based class rebalancing depends on a mask to combine resampling with reweighting ingenuously, which mitigates the classifier bias toward new classes. Besides, we also propose a frequency knowledge distillation, leveraging multiple frequency components information to maintain the feature representation space for old classes. We demonstrate the effectiveness of our approach with an extensive evaluation of the Pascal-VOC 2012 and ADE20K datasets, significantly outperforming the state-of-the-art method.
A more precise sparse region convolutional neural network algorithm is developed to increase the accuracy of remote sensing object recognition. Firstly, a new intersection over union method is proposed to solve the pr...
详细信息
Constraints on the control input and system states as well as uncertain disturbances are the practical challenges in many motion controlsystems. In this paper, a disturbance observer (DO) with two-step disturbance co...
详细信息
ISBN:
(纸本)9781665478977
Constraints on the control input and system states as well as uncertain disturbances are the practical challenges in many motion controlsystems. In this paper, a disturbance observer (DO) with two-step disturbance compensation algorithm is proposed to deal with disturbances and enhance the robustness of the controller, while a discrete sliding mode predictive control (DSMPC) scheme is designed to handle saturation constraints on the control input and system states with the application to an autonomous underwater vehicle (AUV). The DSMPC combines the model predictive control (MPC) with the discrete sliding mode control (DSMC) using linear matrix inequalities approach. The stability of the proposed DSMPC are proved via Lyapunov stability theorem. Finally, simulation results are presented to illustrate the effectiveness of the whole control scheme.
Most current representations of protein pockets are the atom-pair graph, which ignore the global structural information of amino acids. Therefore, we propose a new molecular generation model, which uses the hypergraph...
Most current representations of protein pockets are the atom-pair graph, which ignore the global structural information of amino acids. Therefore, we propose a new molecular generation model, which uses the hypergraph to represent protein pocket structure, and combines the structural features obtained by atom-pair graph representation. These two levels of graphs are more capable of representing the complex structural information of protein pockets. Then, the graphs of the two levels of the protein pockets are input into the improved network model, which is named Hypergraph Graph Attention Fusion (HGAF), to obtain the embedding representation of the protein pockets, which is used as a condition to constrain the molecule generation. The molecules sampled by HGAF are subjected to quality assessment and docking targeting validation. Experimental results show that the molecules generated by proposed method can achieve better results in both of these assessment approaches.
Computer-aided drug design and artificial intelligence-driven drug design have accelerated drug discovery. However, how to design effective drugs that have strong interaction ability with target proteins to further im...
Computer-aided drug design and artificial intelligence-driven drug design have accelerated drug discovery. However, how to design effective drugs that have strong interaction ability with target proteins to further improve the efficacy of drugs in treating diseases remains a key issue. This paper proposes a new target-specific drug generative model 3CLpro2mol to generate new drug molecules, which uses features of drug-target interactions (DTIs) to constrain the correlation between the drug and the target protein. To obtain as many drug-target interaction features as possible from a small amount of data, a small molecule extraction strategy is proposed to ensure the diversity of small molecules in the training samples. To improve the efficiency and accuracy of the generative model, a TOP K sampling strategy is used to generate tokens, which can improve the rationality and diversity of the generated molecules. The experimental results show that the proposed model has the potential to generate small molecules that interact better with the target protein.
As a metric to measure the information freshness, Age of information (AoI) is widely used to evaluate low-latency high-reliability communication. Since the communication in Industrial Internet of Things (IIoT) often h...
详细信息
ISBN:
(数字)9798350363760
ISBN:
(纸本)9798350363777
As a metric to measure the information freshness, Age of information (AoI) is widely used to evaluate low-latency high-reliability communication. Since the communication in Industrial Internet of Things (IIoT) often has the special requirements, the relationship among AoI, reliability and block length in time-critical scenarios with bounded latency and non-time-critical scenarios without bounded latency is explored respectively in this paper, based on erasure channels using rateless codes. We build a model using LT codes over erasure channels and consider two metrics: average AoI and bounded AoI. Theoretical analysis and simulation results show that AoI has a minimum value with different block lengths. Furthermore, we draw a conclusion that the minimum value of AoI under different reliability is monotonically decreasing functioned with reliability. Finally, the impact of overhead on reliability and AoI of LT codes in time-critical scenarios is studied.
暂无评论