In most high-risk applications, interpretability is crucial for ensuring system safety and trust. However, existing research often relies on hard-to-understand, highly parameterized models, such as neural networks. In...
详细信息
ISBN:
(纸本)9783031569562;9783031569579
In most high-risk applications, interpretability is crucial for ensuring system safety and trust. However, existing research often relies on hard-to-understand, highly parameterized models, such as neural networks. In this paper, we focus on the problem of policy search in continuous observations and actions spaces. We leverage two graph-based genetic programming (GP) techniques-Cartesian geneticprogramming (CGP) and Linear geneticprogramming (LGP)-to develop effective yet interpretable control policies. Our experimental evaluation on eight continuous robotic control benchmarks shows competitive results compared to state-of-the-art Reinforcement Learning (RL) algorithms. Moreover, we find that graph-based GP tends towards small, interpretable graphs even when competitive with RL. By examining these graphs, we are able to explain the discovered policies, paving the way for trustworthy AI in the domain of continuous control.
graph-based genetic programming (GGP) can create interpretable control policies in graph form, but faces challenges such as local optima and solution fragility, which undermine its efficacy. Quality-Diversity (QD) has...
详细信息
ISBN:
(纸本)9798400704949
graph-based genetic programming (GGP) can create interpretable control policies in graph form, but faces challenges such as local optima and solution fragility, which undermine its efficacy. Quality-Diversity (QD) has been effective in addressing similar issues, traditionally in Artificial Neural Network (ANN) optimization. In this paper, we introduce a general graph Quality-Diversity (G-QD) framework to enhance the performance of GGP with QD optimization, obtaining a variety of interpretable, effective, and resilient policies. Using Cartesian geneticprogramming (CGP) as the GGP technique and MAP-Elites (ME) as the QD algorithm, we leverage a combination of behavior and graph structural descriptors. Experimenting on two navigation and two locomotion continuous control tasks, our framework yields an array of effective yet behaviorally and structurally diverse policies, surpassing the performance of a standard genetic Algorithm (GA). The resulting solution set also increases interpretability, allowing for insight into the control tasks. Additionally, our experiments demonstrate the robustness of the solutions to faults such as sensor damage.
We introduce a form of neutral horizontal gene transfer (HGT) to evolving graphs by graphprogramming (EGGP). We introduce the mu x lambda evolutionary algorithm (EA), where mu parents each produce lambda children who...
详细信息
We introduce a form of neutral horizontal gene transfer (HGT) to evolving graphs by graphprogramming (EGGP). We introduce the mu x lambda evolutionary algorithm (EA), where mu parents each produce lambda children who compete only with their parents. HGT events then copy the entire active component of one surviving parent into the inactive component of another parent, exchanging genetic information without reproduction. Experimental results from symbolic regression problems show that the introduction of the mu x lambda EA and HGT events improve the performance of EGGP. Comparisons with geneticprogramming and Cartesian geneticprogramming strongly favour our proposed approach. We also investigate the effect of using HGT events in neuroevolution tasks. We again find that the introduction of HGT improves the performance of EGGP, demonstrating that HGT is an effective cross-domain mechanism for recombining graphs.
A number of representation schemes have been presented for use within learning classifier systems, ranging from binary encodings to artificial neural networks. This paper presents results from an investigation into us...
详细信息
A number of representation schemes have been presented for use within learning classifier systems, ranging from binary encodings to artificial neural networks. This paper presents results from an investigation into using a temporally dynamic symbolic representation within the XCSF learning classifier system. In particular, dynamical arithmetic networks are used to represent the traditional condition-action production system rules to solve continuous-valued reinforcement learning problems and to perform symbolic regression, finding competitive performance with traditional geneticprogramming on a number of composite polynomial tasks. In addition, the network outputs are later repeatedly sampled at varying temporal intervals to perform multistep-ahead predictions of a financial time series.
Today, a lot of Automatic programming techniques have been proposed and applied various fields. graph Structured Program Evolution (GRAPE) is one of the recent Automatic programming techniques. GRAPE succeeds in gener...
详细信息
ISBN:
(纸本)9781605581316
Today, a lot of Automatic programming techniques have been proposed and applied various fields. graph Structured Program Evolution (GRAPE) is one of the recent Automatic programming techniques. GRAPE succeeds in generating the complex programs automatically. In this paper, a new generation alternation model for GRAPE, called Evolutionary Algorithm Considering Program Size (EACP), is proposed. EACP maintains the diversity of program size in the population by using particular fitness assignment and generation alternation. We apply EACP to three test problems, factorial, exponentiation and sorting a list. And we show the effectiveness of EACP and confirm evolution of maintaining the diversity of program size.
In recent Years a lot of Automatic programming techniques have developed. A typical example of Automatic programming is geneticprogramming (GP), and various extensions and representations for GP have been proposed so...
详细信息
ISBN:
(纸本)9781595936974
In recent Years a lot of Automatic programming techniques have developed. A typical example of Automatic programming is geneticprogramming (GP), and various extensions and representations for GP have been proposed so far. However, it seems that more improvements are necessary to obtain complex programs automatically. lit this paper we proposed a new method called graph Structured Program Evolution (GRAPE). The representation of GRAPE is graph structure, therefore it can represent complex programs (e.g. branches and loops) using its graph structure. Each program is constructed as an arbitrary directed graph of nodes and data set. The GRAPE program handles multiple data types rising the data set for each type, and the genotype of GRAPE is the form of a linear string of integers. We apply GRAPE to four test problems, factorial, Fibonacci sequence, exponentiation and reversing a list, and demonstrate that the optimum solution in each problem is obtained by the GRAPE system.
暂无评论