DRQN
===============================

Algorithm description
-------------------------------

Deep Q-networks (DQN) is a value-based DRL algorithm.

Paper link: `Deep recurrent q-learning for partially observable mdps <https://cdn.aaai.org/ocs/11673/11673-51288-1-PB.pdf />`_



BibTex citations:

::

    @inproceedings{hausknecht2015deep,
        title={Deep recurrent q-learning for partially observable mdps},
        author={Hausknecht, Matthew and Stone, Peter},
        booktitle={2015 aaai fall symposium series},
        year={2015}
    }

