Base Off-policy Algorithms
==========================

.. currentmodule:: omnisafe.algorithms.off_policy

.. autosummary::

    DDPG
    TD3
    SAC

Deep Deterministic Policy Gradient
----------------------------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: DDPG
        :members:
        :private-members:


Twin Delayed DDPG
-----------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: TD3
        :members:
        :private-members:

Soft Actor-Critic
-----------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: SAC
        :members:
        :private-members:
