Skip to content
Snippets Groups Projects
Commit bc97e9ab authored by Philip Maas's avatar Philip Maas
Browse files

Update README.md

parent cc34f10b
Branches
No related tags found
No related merge requests found
......@@ -7,7 +7,7 @@ Coming soon
# Action Mutation
Will get 0 reward, which is basically learning to prevent falling on it's head. The more actions the walker can use, the worse the reward.
This is because the walker tries to generate movement by trembling with it's legs. The covered distance doesn't cover the punishment for doing actions. So after 1600 moves the walker will get a reward arounf -60
This is because the walker tries to generate movement by trembling with it's legs. The covered distance doesn't cover the punishment for doing actions. So after 1600 moves the walker will get a reward around -60.
![Reward](./MutateActions/5_50_50_0.2.png)
## How it works
......@@ -61,6 +61,6 @@ We use Windows, Anaconda and Python 3.7 \
# Important Sources
Environment: https://github.com/openai/gym/wiki/BipedalWalker-v2 \
Table of all Environments: https://github.com/openai/gym/wiki/Table-of-environments
Table of all Environments: https://github.com/openai/gym/wiki/Table-of-environments \
OpenAI Website: https://gym.openai.com/envs/BipedalWalker-v2/ \
More on evolution strategies: https://openai.com/blog/evolution-strategies/
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment