Model description
The skew-normal random-effects model allows the distribution of true study effects to be asymmetric. This is appropriate when effects are bounded below (or above) by a natural floor or ceiling, or when the literature is characterised by many small effects and a long tail of large effects in one direction.
The skew-normal distribution generalises the Gaussian with a shape parameter . Positive produces right skew; negative produces left skew; recovers the Gaussian.
Mathematical specification
Likelihood:
Random effects:
where is the location, is the scale, and is the shape. The mean and variance of the skew-normal are:
Priors:
Stan code
data {
int<lower=1> N;
int<lower=1> K;
vector[N] y;
vector<lower=0>[N] se;
array[N] int<lower=1> study;
}
parameters {
real xi;
real<lower=0> omega;
real alpha_sk;
vector[K] theta_raw;
}
transformed parameters {
vector[K] theta;
{
real delta = alpha_sk / sqrt(1 + square(alpha_sk));
real sigma1 = omega * sqrt(1 - square(delta));
vector[K] mu_sn = xi + omega * delta * abs(theta_raw);
theta = mu_sn + sigma1 * theta_raw;
}
}
model {
target += normal_lpdf(xi | 0, 1);
target += cauchy_lpdf(omega | 0, 0.5);
target += normal_lpdf(alpha_sk | 0, 1);
target += std_normal_lpdf(theta_raw);
target += skew_normal_lpdf(theta | xi, omega, alpha_sk);
target += normal_lpdf(y | theta[study], se);
}
generated quantities {
real b_Intercept = xi + omega * (alpha_sk / sqrt(1 + square(alpha_sk))) * sqrt(2.0 / pi());
}How bayesma calls this model
Selected by model_type = "random_effect" with re_dist = "skew_normal".
b_Intercept is set to the mean of the skew-normal distribution (not the location ), so that the reported pooled effect is comparable to the Gaussian and Student- models.
Parameterisation notes
Stan’s built-in skew_normal_lpdf is used for efficiency. The non-centred parameterisation for the skew-normal requires generating truncated normal auxiliary variables; the current implementation uses the direct skew-normal log-density instead, which is well-calibrated for moderate .
When is large (above 5), the skew-normal is highly asymmetric and MCMC can be slow. In practice, meaningful skewness is captured by .
Identifiability
The shape parameter is identified only when is sufficient to observe the distributional tail. Simulation studies suggest for reliable estimation. With smaller , set to a fixed value or use the Gaussian model.
Known sampling difficulties
The posterior for and can be multimodal when is small, because the data are consistent with both (wide symmetric) and (narrow skewed). Multiple chains and trace plot inspection are essential.
