Our goal is to find the parameters
which maximizes the
incomplete data likelihood
. Directly marginalizing the
complete data likelihood
cannot give us a close-form
solution to the maximization problem. However, it fits well into
the category of the problems which can be solved by the EM
algorithm [2]. Our target function is given by
| (6) | |||
| (7) |
which can be decoupled into two parts,
![]() |
(8) | ||
![]() |
(9) |
The first term is not dependent on the parameters
, we only need
to maximize the second term in the maximization step. A further
derivation shows it is equivalent to maximize,
![]() |
(10) |
The EM is iteratively performed by two steps. In the
``expectation'' step, we compute the posterior probabilities
based on the current parameters,
In the ``maximization'' step, we take the derivatives with respect
to
and
and let them be zero, we have the estimated
epitome means and variances as follows,
![]() |
(12) | ||
![]() |
(13) |