Convergence Diagnostics

This page documents the C++ implementation of convergence diagnostics in bgms. The ESS and R-hat computations are implemented in C++ for speed, enabling fast diagnostics even with long chains and many parameters.

Overview

After sampling, bgms computes convergence diagnostics for all parameters:

Effective sample size (ESS) — equivalent number of independent draws
R-hat — between-chain vs within-chain variance ratio
Monte Carlo standard error (MCSE) — precision of the posterior mean estimate

The C++ implementation matches coda::effectiveSize for ESS and coda::gelman.diag for R-hat, but runs in parallel across parameters for speed.

C++ implementation

The diagnostic computations are in src/mcmc_diagnostics.cpp. The main entry points are:

ESS via AR spectral density

// [[Rcpp::export(.compute_ess_cpp)]]
Rcpp::NumericVector compute_ess_cpp(Rcpp::NumericVector array3d);

Computes ESS for a 3D array [niter x nchains x nparam]. Multi-chain ESS is the sum of per-chain ESS values.

The per-chain algorithm follows Plummer et al. (2006):

Compute biased autocovariance \(c[0..L]\) where \(L = \lfloor 10 \log_{10}(n) \rfloor\)
Levinson-Durbin recursion for AR orders \(1..L\)
AIC order selection: \(\arg\min_k \{ n \log(v_k) + 2k \}\)
Spectral density at 0: \(\text{spec}_0 = v_{\text{best}} / (1 - \sum a_{\text{best}})^2\)
ESS = \(n \cdot \text{var}_{\text{unbiased}} / \text{spec}_0\)

This matches coda::spectrum0.ar + coda::effectiveSize.

Gelman-Rubin R-hat

// [[Rcpp::export(.compute_rhat_cpp)]]
Rcpp::NumericVector compute_rhat_cpp(Rcpp::NumericVector array3d);

Computes R-hat for a 3D array [niter x nchains x nparam]. Returns NA for single-chain input.

The algorithm follows Gelman & Rubin (1992) with the degrees-of-freedom adjustment from coda::gelman.diag:

Compute per-chain means \(\bar{x}_c\) and variances \(s^2_c\)
Within-chain variance: \(W = \frac{1}{m} \sum_c s^2_c\)
Between-chain variance: \(B = \frac{n}{m-1} \sum_c (\bar{x}_c - \bar{x})^2\)
Pooled variance estimate: \(\hat{V} = \frac{n-1}{n} W + \frac{1}{n} B\)
Degrees-of-freedom adjustment via estimated variance of \(\hat{V}\)
R-hat = \(\sqrt{\frac{df + 3}{df + 1} \cdot \hat{R}^2}\)

Mixture ESS for indicators

// [[Rcpp::export(.compute_indicator_ess_cpp)]]
Rcpp::NumericMatrix compute_indicator_ess_cpp(Rcpp::NumericVector array3d);

Computes ESS for binary indicator sequences using the two-state Markov chain approach:

Counts transitions: \(n_{01}\) (off→on), \(n_{10}\) (on→off), \(n_{00}\), \(n_{11}\)
Estimates transition probabilities: \(\hat{a} = n_{01} / (n_{00} + n_{01})\), \(\hat{b} = n_{10} / (n_{10} + n_{11})\)
Returns ESS = \(T \cdot (a + b) / (2 - a - b)\)

This formula derives from the integrated autocorrelation time of a two-state Markov chain where \(\rho(k) = (1-a-b)^k\) (van den Bergh et al., 2026).

When the total number of transitions (\(n_{01} + n_{10}\)) is below 5, the estimate is unreliable and flagged in the output.

Parallelization

Diagnostics are computed in parallel across parameters using RcppParallel (TBB). Each parameter’s ESS and R-hat can be computed independently:

struct ESSWorker : public RcppParallel::Worker {
  void operator()(std::size_t begin, std::size_t end) {
    for(std::size_t j = begin; j < end; j++) {
      double total_ess = 0.0;
      for(int c = 0; c < nchains; c++) {
        const double* col = data + c * niter + j * niter * nchains;
        total_ess += compute_column_ess(col, niter, max_order);
      }
      ess[j] = total_ess;
    }
  }
};

R interface

The diagnostics are exposed to R through: - .compute_ess_cpp() — batch ESS for all parameters - .compute_rhat_cpp() — batch R-hat for all parameters - .compute_indicator_ess_cpp() — batch indicator ESS with transition counts

These are called from R/mcmc_summary.R which assembles the summary tables.

User-facing extractors: - extract_ess(fit) — returns ESS for all parameters - extract_rhat(fit) — returns R-hat for all parameters - summary(fit) — includes ESS, R-hat, and MCSE in tabular output

References

Plummer et al. (2006) — AR spectral density ESS (coda package)
Gelman & Rubin (1992) — original R-hat formulation
van den Bergh et al. (2026) — mixture ESS for spike-and-slab models

References

Gelman, A., & Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472. https://doi.org/10.1214/ss/1177011136

Plummer, M., Best, N., Cowles, K., & Vines, K. (2006). CODA: Convergence diagnosis and output analysis for MCMC. R News, 6(1), 7–11. https://journal.r-project.org/archive/2006-1/RNews_2006-1.pdf

van den Bergh, D., Clyde, M. A., Raftery, A. E., & Marsman, M. (2026). Reversible jump MCMC with no regrets: Bayesian variable selection using mixtures of mutually singular distributions. Manuscript in Preparation.