Multi-Agent Learner
=============================================

.. toctree::
  :hidden:

  IQL_Learner <marl/iql>
  VDN_Learner <marl/vdn>
  QMIX_Learner <marl/qmix>
  WQMIX_Learner <marl/wqmix>
  QTRAN_Learner <marl/qtran>
  DCG_Learner <marl/dcg>
  IDDPG_Learner <marl/iddpg>
  MADDPG_Learner <marl/maddpg>
  ISAC_Learner <marl/isac>
  MASAC_Learner <marl/masac>
  IPPO_Learner <marl/ippo>
  MAPPO_Learner <marl/mappo>
  MATD3_Learner <marl/matd3>
  VDAC_Learner <marl/vdac>
  COMA_Learner <marl/coma>
  MFQ_Learner <marl/mfq>
  MFAC_Learner <marl/mfac>


.. list-table:: 
   :header-rows: 1

   * - Multi-Agent Learner
     - PyTorch 
     - TensorFlow
     - MindSpore
   * - :doc:`IQL <marl/iql>`: Independent Q-Learning
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`VDN <marl/vdn>`: Value-Decomposition Networks
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`QMIX <marl/qmix>`: VDN with Q-Mixer
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`WQMIX <marl/wqmix>`: Weighted QMIX
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`QTRAN <marl/qtran>`: Q-Transformation
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`DCG <marl/dcg>`: Deep Coordination Graph
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`IDDPG <marl/iddpg>`: Independent DDPG
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MADDPG <marl/maddpg>`: Multi-Agent DDPG
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`ISAC <marl/isac>`: Independent SAC
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MASAC <marl/masac>`: Multi-Agent SAC
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`IPPO <marl/ippo>`: Independent PPO
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MAPPO <marl/mappo>`: Multi-Agent PPO
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MATD3 <marl/matd3>`: Multi-Agent TD3
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`VDAC <marl/vdac>`: Value-Decomposition Actor-Critic
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`COMA <marl/coma>`: Counterfacutal Multi-Agent PG
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MFQ <marl/mfq>`: Mean-Field Q-Learning
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
   * - :doc:`MFAC <marl/mfac>`: Mean-Field Actor-Critic
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`
     - .. centered:: :math:`\checkmark`