We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
#PGQ
A summary of the paper can be found here.