Mixture of distributions | MonolixSuite Documentation

Objectives: learn how to implement a mixture of distributions for the individual parameters.

Projects: PKgroup_project, PKmixt_project

Introduction

Mixed effects models allow us to take into account between-subject variability.

One complicating factor arises when data is obtained from a population with some underlying heterogeneity. If we assume that the population consists of several homogeneous subpopulations, a straightforward extension of mixed effects models is a finite mixture of mixed effects models.

There are two approaches to define a mixture of models:

defining a mixture of structural models (via a regressor or via the bsmm function),
introducing a categorical covariate (known or latent). This approach is detailed here.

The second approach assumes that the probability distribution of some individual parameters vary from one subpopulation to another one. The introduction of a categorical covariate (e.g., sex, phenotype, treatment, status, etc.) into such a model already supposes that the whole population can be decomposed into subpopulations. The covariate then serves as a label for assigning each individual to a subpopulation.

In practice, the covariate can either be known or not. If it is unknown, the covariate is called a latent covariate and is defined as a random variable with a user-defined number of modalities in the statistical model. Differences in estimation and diagnosis methods appear to deal with this additional random variable: this difference represents a task of unsupervised classification.
Mixture models usually refer to models for which the categorical covariate is unknown and unsupervised classification is needed.
For the sake of simplicity, we will consider a basic model that involves individual parameters and observations . Then, the easiest way to model a finite mixture model is to introduce a label sequence that takes its values in such that if subject i belongs to subpopulation m.
In some situations, the label sequence is known and can be used as a categorical covariate in the model. If is unknown, it can be modeled as a set of independent random variables taking their values in where for , is the probability that individual i belongs to group m. We will assume furthermore that the are identically distributed, i.e., does not depend on i for .

Mixture of distributions based on a categorical covariate

PKgroup_project (data = ‘PKmixt_data.txt’, model = ‘lib:oral1_1cpt_kaVCl.txt’)

The sequence of labels is known as GROUP in this project and comes from the dataset. It is therefore defined as a categorical covariate that classifies We can then assume, for instance different population values for the volume in the two groups and estimate the population parameters using this covariate model.

Then, this covariate GROUP can be used as a stratification variable and is very important in the modeling.

Mixture of distributions based on unsupervised classification with a latent covariate

A latent covariate is defined as a random variable , and the probability of each modality is part of the statistical model and is estimated as well. Methods for estimation and diagnosis are different.

The latent covariate belongs to a discrete set where is the number of modalities. For each individual, indicates which modality they belong to.

Estimation Process

The SAEM algorithm handles latent covariates in the following way:

The algorithm starts sampling individual parameters based on initial estimates of population parameters . Initial probabilities are assigned uniformly to each latent modality

At each iteration, for each individual the algorithm calculates the conditional probability of each latent category (the posterior distribution of ) given the data and the current individual parameters :

This formula is based on Baye’s theorem, where

is the probability of the observed data given the current individual parameter and latent covariate modality
is the prior probability of the individual parameters given that the latent covariate modality
is the prior probability of the modality
is the joint probability density of the observed data and individual parameter

To ensure probabilities sum to 1 across all possible modalities , the posterior probability distribution needs to be normalized over the sum of probability distribution of all modalities:

Notice, due to the normalization the term cancels out.

Assuming a normal distribution for , and a log-normal distribution for the calculation admits an analytical solution, as all terms in the expression above have a closed-form representation:

(at initialization)

From this closed form the algorithm draws a new value of for each individual from the calculated conditional distribution . Using the newly sampled values, the algorithm generates new random effects, transforms them into individual parameters, and updates the population parameters following standard SAEM procedures. As the iterations progress, population parameters and the latent category probabilities are progressively refined.

After the estimation, for each individual the categorical covariate is not perfectly known, only the probabilities of each modality are estimated.

Note also that latent covariates can be useful to model statistical mixtures of populations, but they provide no biological interpretation for the cause of the heterogeneity in the population since they do not come from the dataset.

Latent covariates can not be handled with IOV.

PKmixt_project (data = ‘PKmixt_data.txt’, model = ‘lib:oral1_1cpt_kaVCl.txt’)

We will use the same data with this project but ignoring the column GROUP (which is equivalent to assuming that the label is unknown). If we suspect some heterogeneity in the population, we can introduce a “latent covariate” by clicking on the grey button LATENT.

It is possible to change the name and the number of modalities of this latent covariate.

Several latent covariates can be introduced in the model, with different number of categories.

We can then use this latent covariate lcat as any observed categorical covariate. Again, we can assume again different population values for the volume in the two groups by applying it on the volume random effect and estimating the population parameters using this covariate model. Proportions of each group are also estimated, plcat_1 which is the probability to have modality 1:

Once the population parameters are estimated, the sequence of latent covariates, i.e. the group to which belongs each subject, can be estimated together with the individual parameters, as the modes of the conditional distributions.

The sequence of estimated latent covariates lcat can be used as a stratification variable. We can for example display the VPC in the 2 groups:

By plotting the distribution of the individual parameters, we see that V has a bimodal distribution:

Last updated: March 13, 2026