API Guide
=========

.. toctree::
   :maxdepth: 4

   models
   tensor_parallel
   context_parallel
   pipeline_parallel
   custom_fsdp
   fusions
   transformer
   moe
   dist_checkpointing
   dist_optimizer
   distributed
   datasets
   multi_latent_attention
   num_microbatches_calculator
   optimizer_param_scheduler
   optimizer_cpu_offload
   multi_token_prediction
