The hindsight experience relay (HER) was developed in 2018 by OpenAI and was designed so that algorithms can learn every time it completes a task. While it is similar to the algorithm behind AphaZero, HER was designed to achieve multiple goals.
The attached article was written early this 2019 by Or Rivlin. It explains how HER works, and the implications to machine learning.
Read Original Article
Click the button below if you wish to read the article on the website where it was originally published.
Click button below if you wish to read the original article offline.