\section{Background}
\input{related}
\subsection{Hybrid Markov Logic Networks}

Markov Logic Networks (MLNs)~\citep{domingos&lowd09} compactly represent probabilistic graphical models (PGMs) in the form of first-order logic formulas to define a distribution over possible worlds, where a world is an assignment to all the ground atoms (an atom substituted with constants) in the MLN. 
Hybrid Markov Logic Networks (HMLNs) generalize MLNs to include both continuous and discrete variables. Specifically, a HMLN consists of pairs $\{(F_i,\Theta_i)\}_{i=1}^n$, where $F_i$ is a first-order formula that can contain one or more real-valued terms and $\Theta_i$ is its weight.

Each ground formula of the HMLN (substituting variables in a formula with constants from their respective domains) represents a potential function in a PGM, where the ground atoms are nodes and a clique between them represents the potential function. Symbolic ground predicates are binary random variables and real-valued ground predicates have a continuous value. The probability distribution is a log-linear model defined as follows.
\begin{equation}\label{eq:loglinearmodel}
    P_{\Theta}({\bf \omega}) = \frac{1}{Z}\exp\left(\sum_i \Theta_is_i({\bf \omega})\right)
\end{equation}
where $\omega$ is a world, i.e., an assignment to all ground predicates and $Z$ is the {\em partition function}, i.e., $Z=\sum_{\omega'}\exp\left(\sum_i \theta_is_i({\bf \omega'})\right)$. $s_i(\omega)$ is the sum of values over all groundings of $F_i$. 
 
We can perform marginal inference in HMLNs using Gibbs sampling~\citep{geman&geman84}. Specifically, given a query variable $Y$ (we assume this to be a single variable) and observed evidence ${\bf X}$ $=$ ${\bf x}$ (that includes real-valued terms), we compute $P(Y=y|{\bf X})$ as follows. We initialize the assignments to all non-evidence variables (${\bf Y}$) as
${\bf y}^{(0)}$. In each iteration, we pick a single variable $Y'\in{\bf Y}$ and sample an assignment to $Y'$ $=$ $y'$ from the conditional HMLN distribution ${P}_{\Theta}(Y'|{\bf y}_{-y'},{\bf x})$ to obtain the next state of the Markov Chain ${\bf y}^{(1)}$. Here, ${\bf y}_{-y'}$ is the assignment to all variables other than $Y'$ and therefore the conditional distribution is typically easy to compute. We estimate the marginal probabilities from samples collected after a {\em burn-in} period (initial samples are ignored to allow the MCMC chain to mix) using the following estimator.
\begin{equation}\label{eq:margest}
    \hat{P}(Y=y|{\bf x}) = \frac{1}{T}\sum_{t=1}^T{P}_{\theta}(Y=y|{\bf y}_{-y}^{(t)},{\bf x})
\end{equation}
It can be shown that as $T\rightarrow\infty$, $\hat{P}(Y=y|{\bf x})$ converges to the true marginal probability $P(Y=y|{\bf x})$.
