Skip to content
Snippets Groups Projects
Commit 54ed8f19 authored by Philip Maas's avatar Philip Maas
Browse files

Merge remote-tracking branch 'origin/main'

parents 7bcec2fe e70e9139
No related branches found
No related tags found
No related merge requests found
......@@ -62,17 +62,20 @@ After 1000 episodes, which is about 1h of learning, it will reach ~250 reward.\
✅ Best score until now: 304/300 in under 7000 episodes with a decaying learning rate and mutation factor. \
\
Learning curve:\
![Rewards while learning](./EvolutionStrategies/Experiments/12_1_50_decaying_decaying_300/12_1_50_decaying_decaying_300.png)
![Rewards while learning](./EvolutionStrategies/learningCurve.png)
\
\
Rewards of fully learned agent in 50 episodes:\
![Rewards 50 Episodes](./EvolutionStrategies/Experiments/12_1_50_decaying_decaying_300/50episodes.png)
![Rewards 50 Episodes](./EvolutionStrategies/50episodes.png)
## How it works
1. Generate a randomly weighted neural net
2. Create a population of neural nets with mutated weights
3. Let every net finish an episode and reward it accordingly
4. The better the reward, the higher the chance to pass weights to next gen
Also: Decay alpha and sigma to 0.05 after 1000 gens and 0.01 after 5000 gens for a more precise learning after passing the local extrmum that is standing around.
## Hyperparameters
| Parameter | Description | Interval | Our Choice |
|-------------------|-------------------------------------------------------------|-----------|------------|
......@@ -80,7 +83,7 @@ Rewards of fully learned agent in 50 episodes:\
| `BIAS` | Add a bias neuron to the input layer. | {0,1} | 0 |
| `POP_SIZE` | Size of population. | [0;∞[ | 50 |
| `MUTATION_FACTOR` | Percentage of weights that will be mutated for each mutant. | [0;1] | 0.1 |
| `LEARNING_RATE` | This is the rate of learning. | [0;1] | 0.03 |
| `LEARNING_RATE` | This is the rate of learning. | [0;1] | 0.1 |
| `GENS` | Number of generations. | [0;∞[ | 2000 |
| `MAX_STEPS` | Number of steps that are played in one episode. | [0; 1600] | 300 |
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment