Laplace Diagonal Fisher𝞡

`posteriors.laplace.diag_fisher.build(log_posterior, per_sample=False, init_prec_diag=0.0)` 𝞡

Builds a transform for diagonal empirical Fisher information Laplace approximation.

The empirical Fisher is defined here as: $$ F(θ) = \sum_i ∇_θ \log p(y_i, θ | x_i) ∇_θ \log p(y_i, θ | x_i)^T $$ where $p(y_i, θ | x_i)$ is the joint model distribution (equivalent to the posterior up to proportionality) with parameters $θ$, inputs $x_i$ and labels $y_i$.

More info on empirical Fisher matrices can be found in Martens, 2020 and their use within a Laplace approximation in Daxberger et al, 2021.

Parameters:

Name	Type	Description	Default
`log_posterior`	`LogProbFn`	Function that takes parameters and input batch and returns the log posterior value (which can be unnormalised) as well as auxiliary information, e.g. from the model call.	required
`per_sample`	`bool`	If True, then log_posterior is assumed to return a vector of log posteriors for each sample in the batch. If False, then log_posterior is assumed to return a scalar log posterior for the whole batch, in this case torch.func.vmap will be called, this is typically slower than directly writing log_posterior to be per sample.	`False`
`init_prec_diag`	`TensorTree \| float`	Initial diagonal precision matrix. Can be tree like params or scalar.	`0.0`

Returns:

Type	Description
`Transform`	Diagonal empirical Fisher information Laplace approximation transform instance.

Source code in posteriors/laplace/diag_fisher.py

def build(
    log_posterior: LogProbFn,
    per_sample: bool = False,
    init_prec_diag: TensorTree | float = 0.0,
) -> Transform:
    """Builds a transform for diagonal empirical Fisher information
    Laplace approximation.

    The empirical Fisher is defined here as:
    $$
    F(θ) = \\sum_i ∇_θ \\log p(y_i, θ | x_i) ∇_θ \\log p(y_i, θ | x_i)^T
    $$
    where $p(y_i, θ | x_i)$ is the joint model distribution (equivalent to the posterior
    up to proportionality) with parameters $θ$, inputs $x_i$ and labels $y_i$.

    More info on empirical Fisher matrices can be found in
    [Martens, 2020](https://jmlr.org/papers/volume21/17-678/17-678.pdf) and
    their use within a Laplace approximation in [Daxberger et al, 2021](https://arxiv.org/abs/2106.14806).

    Args:
        log_posterior: Function that takes parameters and input batch and
            returns the log posterior value (which can be unnormalised)
            as well as auxiliary information, e.g. from the model call.
        per_sample: If True, then log_posterior is assumed to return a vector of
            log posteriors for each sample in the batch. If False, then log_posterior
            is assumed to return a scalar log posterior for the whole batch, in this
            case torch.func.vmap will be called, this is typically slower than
            directly writing log_posterior to be per sample.
        init_prec_diag: Initial diagonal precision matrix.
            Can be tree like params or scalar.

    Returns:
        Diagonal empirical Fisher information Laplace approximation transform instance.
    """
    init_fn = partial(init, init_prec_diag=init_prec_diag)
    update_fn = partial(update, log_posterior=log_posterior, per_sample=per_sample)
    return Transform(init_fn, update_fn)

`posteriors.laplace.diag_fisher.DiagLaplaceState` 𝞡

Bases: TensorClass['frozen']

State encoding a diagonal Normal distribution over parameters.

Attributes:

Name	Type	Description
`params`	`TensorTree`	Mean of the Normal distribution.
`prec_diag`	`TensorTree`	Diagonal of the precision matrix of the Normal distribution.
`step`	`Tensor`	Current step count.

Source code in posteriors/laplace/diag_fisher.py

class DiagLaplaceState(TensorClass["frozen"]):
    """State encoding a diagonal Normal distribution over parameters.

    Attributes:
        params: Mean of the Normal distribution.
        prec_diag: Diagonal of the precision matrix of the Normal distribution.
        step: Current step count.
    """

    params: TensorTree
    prec_diag: TensorTree
    step: torch.Tensor = torch.tensor(0)

`posteriors.laplace.diag_fisher.init(params, init_prec_diag=0.0)` 𝞡

Initialise diagonal Normal distribution over parameters.

Parameters:

Name	Type	Description	Default
`params`	`TensorTree`	Mean of the Normal distribution.	required
`init_prec_diag`	`TensorTree \| float`	Initial diagonal precision matrix. Can be tree like params or scalar.	`0.0`

Returns:

Type	Description
`DiagLaplaceState`	Initial DiagLaplaceState.

Source code in posteriors/laplace/diag_fisher.py

def init(
    params: TensorTree,
    init_prec_diag: TensorTree | float = 0.0,
) -> DiagLaplaceState:
    """Initialise diagonal Normal distribution over parameters.

    Args:
        params: Mean of the Normal distribution.
        init_prec_diag: Initial diagonal precision matrix.
            Can be tree like params or scalar.

    Returns:
        Initial DiagLaplaceState.
    """
    if is_scalar(init_prec_diag):
        init_prec_diag = tree_map(
            lambda x: torch.full_like(x, init_prec_diag, requires_grad=x.requires_grad),
            params,
        )

    return DiagLaplaceState(params, init_prec_diag)

`posteriors.laplace.diag_fisher.update(state, batch, log_posterior, per_sample=False, inplace=False)` 𝞡

Adds diagonal empirical Fisher information matrix of covariance summed over given batch.

Parameters:

Name	Type	Description	Default
`state`	`DiagLaplaceState`	Current state.	required
`batch`	`Any`	Input data to log_posterior.	required
`log_posterior`	`LogProbFn`	Function that takes parameters and input batch and returns the log posterior value (which can be unnormalised) as well as auxiliary information, e.g. from the model call.	required
`per_sample`	`bool`	If True, then log_posterior is assumed to return a vector of log posteriors for each sample in the batch. If False, then log_posterior is assumed to return a scalar log posterior for the whole batch, in this case torch.func.vmap will be called, this is typically slower than directly writing log_posterior to be per sample.	`False`
`inplace`	`bool`	If True, then the state is updated in place, otherwise a new state is returned.	`False`

Returns:

Type	Description
`tuple[DiagLaplaceState, TensorTree]`	Updated DiagLaplaceState and auxiliary information.

Source code in posteriors/laplace/diag_fisher.py

def update(
    state: DiagLaplaceState,
    batch: Any,
    log_posterior: LogProbFn,
    per_sample: bool = False,
    inplace: bool = False,
) -> tuple[DiagLaplaceState, TensorTree]:
    """Adds diagonal empirical Fisher information matrix of covariance summed over
    given batch.

    Args:
        state: Current state.
        batch: Input data to log_posterior.
        log_posterior: Function that takes parameters and input batch and
            returns the log posterior value (which can be unnormalised)
            as well as auxiliary information, e.g. from the model call.
        per_sample: If True, then log_posterior is assumed to return a vector of
            log posteriors for each sample in the batch. If False, then log_posterior
            is assumed to return a scalar log posterior for the whole batch, in this
            case torch.func.vmap will be called, this is typically slower than
            directly writing log_posterior to be per sample.
        inplace: If True, then the state is updated in place, otherwise a new state
            is returned.

    Returns:
        Updated DiagLaplaceState and auxiliary information.
    """
    if not per_sample:
        log_posterior = per_samplify(log_posterior)

    with torch.no_grad(), CatchAuxError():
        jac, aux = jacrev(log_posterior, has_aux=True)(state.params, batch)
        batch_diag_score_sq = tree_map(lambda j: j.square().sum(0), jac)

    def update_func(x, y):
        return x + y

    prec_diag = flexi_tree_map(
        update_func, state.prec_diag, batch_diag_score_sq, inplace=inplace
    )

    if inplace:
        tree_insert_(state.step, state.step + 1)
        return state, aux
    return DiagLaplaceState(state.params, prec_diag, state.step + 1), aux

`posteriors.laplace.diag_fisher.sample(state, sample_shape=torch.Size([]))` 𝞡

Sample from diagonal Normal distribution over parameters.

Parameters:

Name	Type	Description	Default
`state`	`DiagLaplaceState`	State encoding mean and diagonal precision.	required
`sample_shape`	`Size`	Shape of the desired samples.	`Size([])`

Returns:

Type	Description
`TensorTree`	Sample(s) from Normal distribution.

Source code in posteriors/laplace/diag_fisher.py

def sample(
    state: DiagLaplaceState, sample_shape: torch.Size = torch.Size([])
) -> TensorTree:
    """Sample from diagonal Normal distribution over parameters.

    Args:
        state: State encoding mean and diagonal precision.
        sample_shape: Shape of the desired samples.

    Returns:
        Sample(s) from Normal distribution.
    """
    sd_diag = tree_map(lambda x: x.sqrt().reciprocal(), state.prec_diag)
    return diag_normal_sample(state.params, sd_diag, sample_shape=sample_shape)

Laplace Diagonal Fisher𝞡

posteriors.laplace.diag_fisher.build(log_posterior, per_sample=False, init_prec_diag=0.0) 𝞡

posteriors.laplace.diag_fisher.DiagLaplaceState 𝞡

posteriors.laplace.diag_fisher.init(params, init_prec_diag=0.0) 𝞡

posteriors.laplace.diag_fisher.update(state, batch, log_posterior, per_sample=False, inplace=False) 𝞡

posteriors.laplace.diag_fisher.sample(state, sample_shape=torch.Size([])) 𝞡

`posteriors.laplace.diag_fisher.build(log_posterior, per_sample=False, init_prec_diag=0.0)` 𝞡

`posteriors.laplace.diag_fisher.DiagLaplaceState` 𝞡

`posteriors.laplace.diag_fisher.init(params, init_prec_diag=0.0)` 𝞡

`posteriors.laplace.diag_fisher.update(state, batch, log_posterior, per_sample=False, inplace=False)` 𝞡

`posteriors.laplace.diag_fisher.sample(state, sample_shape=torch.Size([]))` 𝞡