Base On-policy Algorithms
=========================

.. currentmodule:: omnisafe.algorithms.on_policy

.. autosummary::

    PolicyGradient
    NaturalPG
    TRPO
    PPO

Policy Gradient
---------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: PolicyGradient
        :members:
        :private-members:


Natural Policy Gradient
-----------------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: NaturalPG
        :members:
        :private-members:

.. _trpoapi:

Trust Region Policy Optimization
--------------------------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: TRPO
        :members:
        :private-members:




.. _ppoapi:


Proximal Policy Optimization
----------------------------

.. card::
    :class-header: sd-bg-success sd-text-white
    :class-card: sd-outline-success  sd-rounded-1

    Documentation
    ^^^

    .. autoclass:: PPO
        :members:
        :private-members:
