diff --git a/README.md b/README.md index 2f7bd37897063f0762c7bcc02dbbe11b3de164d5..c830f8e8332adf20b71e1efdeb96630f237fde07 100644 --- a/README.md +++ b/README.md @@ -27,7 +27,7 @@ Will get 0 reward, which is basically learning to prevent falling on it's head. # Evolution Strategies After 1000 episodes, which is about 1h of learning, it will reach ~250 reward.\ Best score until now: 292/300 in 7000 episodes \ - + ## How it works 1. Generate a randomly weighted neural net