The Deep Reinforcement Learning algorithm Advantage Weighted Regression (AWR) has been around since October 2019.
The original implementation of the novel algorithm in software was done in Python, using the library TensorFlow.
My own implementation of AWR is written with PyTorch, making it very useful for anyone using this library in their DRL project.
My implementation is the first of its kind on GitHub. It also entails a comprehensive training framework which allows for customisation on multiple levels (hyper-parameters, problems, network architectures) and spares the user the usual overhead of setting all that up.
Below you can see the result of my AWR implementation as applied to the Pendulum problem: