<!DOCTYPE html>
<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API · NormalizingFlows.jl</title><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">NormalizingFlows.jl</a></span></div><form class="docs-search" action="../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li class="is-active"><a class="tocitem" href>API</a><ul class="internal"><li><a class="tocitem" href="#API"><span>API</span></a></li><li><a class="tocitem" href="#Main-Function"><span>Main Function</span></a></li><li><a class="tocitem" href="#Variational-Objectives"><span>Variational Objectives</span></a></li><li><a class="tocitem" href="#Training-Loop"><span>Training Loop</span></a></li><li><a class="tocitem" href="#Utility-Functions-for-Taking-Gradient"><span>Utility Functions for Taking Gradient</span></a></li></ul></li><li><a class="tocitem" href="../example/">Example</a></li><li><a class="tocitem" href="../customized_layer/">Customize your own flow layer</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/main/docs/src/api.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h2 id="API"><a class="docs-heading-anchor" href="#API">API</a><a id="API-1"></a><a class="docs-heading-anchor-permalink" href="#API" title="Permalink"></a></h2><ul><li><a href="#NormalizingFlows.elbo"><code>NormalizingFlows.elbo</code></a></li><li><a href="#NormalizingFlows.grad!"><code>NormalizingFlows.grad!</code></a></li><li><a href="#NormalizingFlows.loglikelihood"><code>NormalizingFlows.loglikelihood</code></a></li><li><a href="#NormalizingFlows.optimize"><code>NormalizingFlows.optimize</code></a></li><li><a href="#NormalizingFlows.train_flow"><code>NormalizingFlows.train_flow</code></a></li><li><a href="#NormalizingFlows.value_and_gradient!"><code>NormalizingFlows.value_and_gradient!</code></a></li></ul><h2 id="Main-Function"><a class="docs-heading-anchor" href="#Main-Function">Main Function</a><a id="Main-Function-1"></a><a class="docs-heading-anchor-permalink" href="#Main-Function" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.train_flow" href="#NormalizingFlows.train_flow"><code>NormalizingFlows.train_flow</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">train_flow([rng::AbstractRNG, ]vo, flow, args...; kwargs...)</code></pre><p>Train the given normalizing flow <code>flow</code> by calling <code>optimize</code>.</p><p><strong>Arguments</strong></p><ul><li><code>rng::AbstractRNG</code>: random number generator</li><li><code>vo</code>: variational objective</li><li><code>flow</code>: normalizing flow to be trained, we recommend to define flow as <code>&lt;:Bijectors.TransformedDistribution</code> </li><li><code>args...</code>: additional arguments for <code>vo</code></li></ul><p><strong>Keyword Arguments</strong></p><ul><li><code>max_iters::Int=1000</code>: maximum number of iterations</li><li><code>optimiser::Optimisers.AbstractRule=Optimisers.ADAM()</code>: optimiser to compute the steps</li><li><code>ADbackend::ADTypes.AbstractADType=ADTypes.AutoZygote()</code>:    automatic differentiation backend, currently supports   <code>ADTypes.AutoZygote()</code>, <code>ADTypes.ForwardDiff()</code>, and <code>ADTypes.ReverseDiff()</code>. </li><li><code>kwargs...</code>: additional keyword arguments for <code>optimize</code> (See <a href="#NormalizingFlows.optimize"><code>optimize</code></a> for details)</li></ul><p><strong>Returns</strong></p><ul><li><code>flow_trained</code>: trained normalizing flow</li><li><code>opt_stats</code>: statistics of the optimiser during the training process    (See <a href="#NormalizingFlows.optimize"><code>optimize</code></a> for details)</li><li><code>st</code>: optimiser state for potential continuation of training</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/NormalizingFlows.jl#L16-L41">source</a></section></article><p>The flow object can be constructed by <code>transformed</code> function in <code>Bijectors.jl</code> package. For example of Gaussian VI, we can construct the flow as follows:</p><pre><code class="language- hljs">using Distributions, Bijectors
T= Float32
q₀ = MvNormal(zeros(T, 2), ones(T, 2))
flow = Bijectors.transformed(q₀, Bijectors.Shift(zeros(2)) ∘ Bijectors.Scale(ones(T, 2)))</code></pre><p>To train the Gaussian VI targeting at distirbution <span>$p$</span> via ELBO maiximization, we can run</p><pre><code class="language- hljs">using NormalizingFlows

sample_per_iter = 10
flow_trained, stats, _ = train_flow(
    elbo,
    flow,
    logp,
    sample_per_iter;
    max_iters=2_000,
    optimiser=Optimisers.ADAM(0.01 * one(T)),
)</code></pre><h2 id="Variational-Objectives"><a class="docs-heading-anchor" href="#Variational-Objectives">Variational Objectives</a><a id="Variational-Objectives-1"></a><a class="docs-heading-anchor-permalink" href="#Variational-Objectives" title="Permalink"></a></h2><p>We have implemented two variational objectives, namely, ELBO and the log-likelihood objective.  Users can also define their own objective functions, and pass it to the <a href="#NormalizingFlows.train_flow"><code>train_flow</code></a> function. <code>train_flow</code> will optimize the flow parameters by maximizing <code>vo</code>. The objective function should take the following general form:</p><pre><code class="language-julia hljs">vo(rng, flow, args...) </code></pre><p>where <code>rng</code> is the random number generator, <code>flow</code> is the flow object, and <code>args...</code> are the additional arguments that users can pass to the objective function.</p><h4 id="Evidence-Lower-Bound-(ELBO)"><a class="docs-heading-anchor" href="#Evidence-Lower-Bound-(ELBO)">Evidence Lower Bound (ELBO)</a><a id="Evidence-Lower-Bound-(ELBO)-1"></a><a class="docs-heading-anchor-permalink" href="#Evidence-Lower-Bound-(ELBO)" title="Permalink"></a></h4><p>By maximizing the ELBO, it is equivalent to minimizing the reverse KL divergence between <span>$q_\theta$</span> and <span>$p$</span>, i.e., </p><p class="math-container">\[\begin{aligned}
&amp;\min _{\theta} \mathbb{E}_{q_{\theta}}\left[\log q_{\theta}(Z)-\log p(Z)\right]  \quad \text{(Reverse KL)}\\
&amp; = \max _{\theta} \mathbb{E}_{q_0}\left[ \log p\left(T_N \circ \cdots \circ
T_1(Z_0)\right)-\log q_0(X)+\sum_{n=1}^N \log J_n\left(F_n \circ \cdots \circ
F_1(X)\right)\right] \quad \text{(ELBO)} 
\end{aligned}\]</p><p>Reverse KL minimization is typically used for <strong>Bayesian computation</strong>,  where one only has access to the log-(unnormalized)density of the target distribution <span>$p$</span> (e.g., a Bayesian posterior distribution),  and hope to generate approximate samples from it.</p><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.elbo" href="#NormalizingFlows.elbo"><code>NormalizingFlows.elbo</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">elbo(flow, logp, xs) 
elbo([rng, ]flow, logp, n_samples)</code></pre><p>Compute the ELBO for a batch of samples <code>xs</code> from the reference distribution <code>flow.dist</code>.</p><p><strong>Arguments</strong></p><ul><li><code>rng</code>: random number generator</li><li><code>flow</code>: variational distribution to be trained. In particular  <code>flow = transformed(q₀, T::Bijectors.Bijector)</code>,  q₀ is a reference distribution that one can easily sample and compute logpdf</li><li><code>logp</code>: log-pdf of the target distribution (not necessarily normalized)</li><li><code>xs</code>: samples from reference dist q₀</li><li><code>n_samples</code>: number of samples from reference dist q₀</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/objectives/elbo.jl#L9-L24">source</a></section></article><h4 id="Log-likelihood"><a class="docs-heading-anchor" href="#Log-likelihood">Log-likelihood</a><a id="Log-likelihood-1"></a><a class="docs-heading-anchor-permalink" href="#Log-likelihood" title="Permalink"></a></h4><p>By maximizing the log-likelihood, it is equivalent to minimizing the forward KL divergence between <span>$q_\theta$</span> and <span>$p$</span>, i.e., </p><p class="math-container">\[\begin{aligned}
&amp; \min_{\theta} \mathbb{E}_{p}\left[\log q_{\theta}(Z)-\log p(Z)\right] \quad \text{(Forward KL)} \\
&amp; = \max_{\theta} \mathbb{E}_{p}\left[\log q_{\theta}(Z)\right] \quad \text{(Expected log-likelihood)}
\end{aligned}\]</p><p>Forward KL minimization is typically used for <strong>generative modeling</strong>,  where one is given a set of samples from the target distribution <span>$p$</span> (e.g., images) and aims to learn the density or a generative process that outputs high quality samples.</p><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.loglikelihood" href="#NormalizingFlows.loglikelihood"><code>NormalizingFlows.loglikelihood</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">loglikelihood(flow::Bijectors.TransformedDistribution, xs::AbstractVecOrMat)</code></pre><p>Compute the log-likelihood for variational distribution flow at a batch of samples xs from  the target distribution p. </p><p><strong>Arguments</strong></p><ul><li><code>flow</code>: variational distribution to be trained. In particular  &quot;flow = transformed(q₀, T::Bijectors.Bijector)&quot;,  q₀ is a reference distribution that one can easily sample and compute logpdf</li><li><code>xs</code>: samples from the target distribution p.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/objectives/loglikelihood.jl#L4-L16">source</a></section></article><h2 id="Training-Loop"><a class="docs-heading-anchor" href="#Training-Loop">Training Loop</a><a id="Training-Loop-1"></a><a class="docs-heading-anchor-permalink" href="#Training-Loop" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.optimize" href="#NormalizingFlows.optimize"><code>NormalizingFlows.optimize</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">optimize(
    rng::AbstractRNG, 
    ad::ADTypes.AbstractADType, 
    vo, 
    θ₀::AbstractVector{T}, 
    re, 
    args...; 
    kwargs...
)</code></pre><p>Iteratively updating the parameters <code>θ</code> of the normalizing flow <code>re(θ)</code> by calling <code>grad!</code>  and using the given <code>optimiser</code> to compute the steps.</p><p><strong>Arguments</strong></p><ul><li><code>rng::AbstractRNG</code>: random number generator</li><li><code>ad::ADTypes.AbstractADType</code>: automatic differentiation backend</li><li><code>vo</code>: variational objective</li><li><code>θ₀::AbstractVector{T}</code>: initial parameters of the normalizing flow</li><li><code>re</code>: function that reconstructs the normalizing flow from the flattened parameters</li><li><code>args...</code>: additional arguments for <code>vo</code></li></ul><p><strong>Keyword Arguments</strong></p><ul><li><code>max_iters::Int=10000</code>: maximum number of iterations</li><li><code>optimiser::Optimisers.AbstractRule=Optimisers.ADAM()</code>: optimiser to compute the steps</li><li><code>show_progress::Bool=true</code>: whether to show the progress bar. The default information printed in the progress bar is the iteration number, the loss value, and the gradient norm.</li><li><code>callback=nothing</code>: callback function with signature <code>cb(iter, opt_state, re, θ)</code> which returns a dictionary-like object of statistics to be displayed in the progress bar. re and θ are used for reconstructing the normalizing flow in case that user  want to further axamine the status of the flow.</li><li><code>hasconverged = (iter, opt_stats, re, θ, st) -&gt; false</code>: function that checks whether the training has converged. The default is to always return false.</li><li><code>prog=ProgressMeter.Progress(           max_iters; desc=&quot;Training&quot;, barlen=31, showspeed=true, enabled=show_progress       )</code>: progress bar configuration</li></ul><p><strong>Returns</strong></p><ul><li><code>θ</code>: trained parameters of the normalizing flow</li><li><code>opt_stats</code>: statistics of the optimiser</li><li><code>st</code>: optimiser state for potential continuation of training</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/train.jl#L67-L110">source</a></section></article><h2 id="Utility-Functions-for-Taking-Gradient"><a class="docs-heading-anchor" href="#Utility-Functions-for-Taking-Gradient">Utility Functions for Taking Gradient</a><a id="Utility-Functions-for-Taking-Gradient-1"></a><a class="docs-heading-anchor-permalink" href="#Utility-Functions-for-Taking-Gradient" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.grad!" href="#NormalizingFlows.grad!"><code>NormalizingFlows.grad!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">grad!(
    rng::AbstractRNG,
    ad::ADTypes.AbstractADType,
    vo,
    θ_flat::AbstractVector{&lt;:Real},
    reconstruct,
    out::DiffResults.MutableDiffResult,
    args...
)</code></pre><p>Compute the value and gradient for negation of the variational objective <code>vo</code>  at <code>θ_flat</code> using the automatic differentiation backend <code>ad</code>.  </p><p>Default implementation is provided for <code>ad</code> where <code>ad</code> is one of <code>AutoZygote</code>,  <code>AutoForwardDiff</code>, <code>AutoReverseDiff</code> (with no compiled tape), and <code>AutoEnzyme</code>. The result is stored in <code>out</code>.</p><p><strong>Arguments</strong></p><ul><li><code>rng::AbstractRNG</code>: random number generator</li><li><code>ad::ADTypes.AbstractADType</code>: automatic differentiation backend, currently supports   <code>ADTypes.AutoZygote()</code>, <code>ADTypes.ForwardDiff()</code>, and <code>ADTypes.ReverseDiff()</code>. </li><li><code>vo</code>: variational objective</li><li><code>θ_flat::AbstractVector{&lt;:Real}</code>: flattened parameters of the normalizing flow</li><li><code>reconstruct</code>: function that reconstructs the normalizing flow from the flattened parameters</li><li><code>out::DiffResults.MutableDiffResult</code>: mutable diff result to store the value and gradient</li><li><code>args...</code>: additional arguments for <code>vo</code></li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/train.jl#L16-L43">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="NormalizingFlows.value_and_gradient!" href="#NormalizingFlows.value_and_gradient!"><code>NormalizingFlows.value_and_gradient!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">value_and_gradient!(
    ad::ADTypes.AbstractADType,
    f,
    θ::AbstractVector{T},
    out::DiffResults.MutableDiffResult
) where {T&lt;:Real}</code></pre><p>Compute the value and gradient of a function <code>f</code> at <code>θ</code> using the automatic differentiation backend <code>ad</code>.  The result is stored in <code>out</code>.  The function <code>f</code> must return a scalar value. The gradient is stored in <code>out</code> as a vector of the same length as <code>θ</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/TuringLang/NormalizingFlows.jl/blob/c53f306796efe735f63aa9a3af9638d45a540c5c/src/train.jl#L1-L13">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« Home</a><a class="docs-footer-nextpage" href="../example/">Example »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.25 on <span class="colophon-date" title="Saturday 19 August 2023 23:38">Saturday 19 August 2023</span>. Using Julia version 1.9.2.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
