Skip to contents

A simulated eight-trial systematic review used to demonstrate the full INSPECT-SR workflow. The eight trials are hand-crafted to span a range of trustworthiness profiles and to exercise every automated check.

Usage

inspect_sr_example

Format

A tibble with one row per study. Every INSPECT-SR item has its own column so that the expected layout is immediately obvious. Variable-length inputs to the automated Domain 4 checks (Table 1 baselines and reported test statistics) are stored in list-columns.

study

Character. Study identifier (must be unique).

n_randomised_int, n_randomised_ctrl, n_randomised_total, n_analysed_int, n_analysed_ctrl, n_lost_int, n_lost_ctrl

Integer participant-flow columns feeding check 4.6.

d1_1..d1_3, d2_1..d2_5, d3_1..d3_2, d4_1..d4_11

Character. Manual judgement for each INSPECT-SR item. Allowed values: "No concerns", "Some concerns", "Serious concerns", NA. The four automated items (d4_3, d4_6, d4_8, d4_9) are left as NA and filled in by inspect_sr().

d1_1_comment..d4_11_comment

Character. Optional free-text comment attached to each judgement.

baseline

List-column. Each element is a data frame of Table 1 variables with columns variable, mean_int, sd_int, mean_ctrl, sd_ctrl, p_value, integer_scale. Feeds GRIM (4.8) and Carlisle (4.3).

statistics

List-column. Each element is a data frame of reported tests with columns test_type, statistic, df, df2, reported_p, context. Feeds the p-value check (4.9).

outcome_estimate, outcome_ci_lower, outcome_ci_upper, outcome_se, outcome_log_scale

Primary outcome (odds ratio for delirium incidence) — values are on the log scale.

Source

Simulated.

Details

The eight trials, their expected behaviour under the automated checks, and the INSPECT-SR item they illustrate:

StudyProfileExpected flag
Callahan (1994)Clean small trialNone
Peek (2005)Clean large multi-centreNone
Clayton (2010)CleanNone
Singer (2003)Clean (large trial)None
Underwood (2013)Some concerns (late registration)Manual D2
Rosa (2008)Too-perfect baseline balanceCarlisle (4.3)
Doe (1995)Retracted; GRIM failures on integer scaleGRIM (4.8), manual D1/D2/D3
Kint (1995)CONSORT arithmetic offN consistency (4.6)

All values are fabricated for didactic purposes. The layout matches the schema expected by inspect_sr(), so the whole review can be assessed in one call.

Examples

data(inspect_sr_example)

# Frequentist assessment
res <- inspect_sr(inspect_sr_example, verbose = FALSE)
res
#> 
#> INSPECT-SR Trustworthiness Assessment
#> ================================================== 
#> 
#> Study                     D1    D2    D3    D4       Overall           
#> ---------------------------------------------------------------------- 
#> Callahan (1994)           OK    OK    OK    OK       No concerns       
#> Peek (2005)               OK    OK    OK    OK       No concerns       
#> Clayton (2010)            OK    OK    OK    OK       No concerns       
#> Singer (2003)             OK    OK    OK    OK       No concerns       
#> Underwood (2013)          OK    SOME  --    OK       Some concerns     
#> Rosa (2008)               OK    SOME  --    SERIOUS  Serious concerns  
#> Doe (1995)                SERIOUS SERIOUS SOME  SERIOUS  Serious concerns  
#> Kint (1995)               --    --    --    SERIOUS  Serious concerns  
#> 
#> Domains: D1 post-publication, D2 conduct/governance,
#>          D3 text/figures, D4 results (auto-filled for 4.3/4.6/4.8/4.9)
#> OK = No concerns, SOME = Some concerns, SERIOUS = Serious concerns
#> -- = Not assessed
#> 
#> For a per-check table, call inspect_sr_table().
#> 

# Per-check table
inspect_sr_table(res)
INSPECT-SR Automated Check Details
Domain 4 per-check results
Check Item Detail Result
Callahan (1994) Carlisle Baseline p-value distribution k = 3, fisher combined p = 0.3788, plausible Pass
GRIM ASA_score (Intervention) mean = 2.5, n = 46 Pass
ASA_score (Control) mean = 2.5, n = 46 Pass
N-consistency Total randomised = Intervention + Control expected = 92, observed = 92 Pass
Intervention: Randomised = Analysed + Lost expected = 46, observed = 46 Pass
Control: Randomised = Analysed + Lost expected = 46, observed = 46 Pass
Intervention: Lost <= Randomised expected = 46, observed = 3 Pass
Control: Lost <= Randomised expected = 46, observed = 2 Pass
P-value Delirium incidence reported p = 0.27, recalculated p = 0.2674 (diff 0.002593) Pass
Clayton (2010) Carlisle Baseline p-value distribution k = 3, fisher combined p = 0.46, plausible Pass
N-consistency Total randomised = Intervention + Control expected = 202, observed = 202 Pass
Intervention: Randomised = Analysed + Lost expected = 101, observed = 101 Pass
Control: Randomised = Analysed + Lost expected = 101, observed = 101 Pass
Intervention: Lost <= Randomised expected = 101, observed = 3 Pass
Control: Lost <= Randomised expected = 101, observed = 5 Pass
P-value Delirium incidence reported p = 0.1, recalculated p = 0.09545 (diff 0.004552) Pass
Doe (1995) Carlisle Baseline p-value distribution k = 4, fisher combined p = 0.05859, plausible Pass
GRIM ASA_score (Intervention) mean = 2.45, n = 90 Fail
ASA_score (Control) mean = 2.55, n = 90 Fail
Pain_VAS (Intervention) mean = 4.75, n = 90 Fail
Pain_VAS (Control) mean = 4.65, n = 90 Fail
N-consistency Total randomised = Intervention + Control expected = 180, observed = 180 Pass
Intervention: Randomised = Analysed + Lost expected = 90, observed = 90 Pass
Control: Randomised = Analysed + Lost expected = 90, observed = 90 Pass
Intervention: Lost <= Randomised expected = 90, observed = 2 Pass
Control: Lost <= Randomised expected = 90, observed = 3 Pass
P-value Delirium incidence reported p = 0.001, recalculated p = 0.000407 (diff 0.000593) Pass
Delirium duration reported p = 0.002, recalculated p = 0.001582 (diff 0.0004176) Pass
Kint (1995) Carlisle Baseline p-value distribution k = 3, fisher combined p = 0.4555, plausible Pass
GRIM Duration_surgery_min (Intervention) mean = 175, n = 150 Pass
Duration_surgery_min (Control) mean = 180, n = 150 Pass
N-consistency Total randomised = Intervention + Control expected = 300, observed = 300 Pass
Intervention: Randomised = Analysed + Lost expected = 150, observed = 147 Fail
Control: Randomised = Analysed + Lost expected = 150, observed = 146 Fail
Intervention: Lost <= Randomised expected = 150, observed = 5 Pass
Control: Lost <= Randomised expected = 150, observed = 8 Pass
P-value Delirium incidence reported p = 0.08, recalculated p = 0.07734 (diff 0.002663) Pass
Peek (2005) Carlisle Baseline p-value distribution k = 4, fisher combined p = 0.3872, plausible Pass
GRIM Duration_surgery_min (Intervention) mean = 185, n = 230 Pass
Duration_surgery_min (Control) mean = 190, n = 230 Pass
N-consistency Total randomised = Intervention + Control expected = 460, observed = 460 Pass
Intervention: Randomised = Analysed + Lost expected = 230, observed = 230 Pass
Control: Randomised = Analysed + Lost expected = 230, observed = 230 Pass
Intervention: Lost <= Randomised expected = 230, observed = 9 Pass
Control: Lost <= Randomised expected = 230, observed = 12 Pass
P-value Delirium incidence reported p = 0.04, recalculated p = 0.04238 (diff 0.002379) Pass
Duration of delirium reported p = 0.4, recalculated p = 0.3958 (diff 0.004209) Pass
Rosa (2008) Carlisle Baseline p-value distribution k = 6, fisher combined p = 4.265e-05, too_similar Fail
N-consistency Total randomised = Intervention + Control expected = 240, observed = 240 Pass
Intervention: Randomised = Analysed + Lost expected = 120, observed = 120 Pass
Control: Randomised = Analysed + Lost expected = 120, observed = 120 Pass
Intervention: Lost <= Randomised expected = 120, observed = 2 Pass
Control: Lost <= Randomised expected = 120, observed = 3 Pass
P-value Delirium incidence reported p = 0.005, recalculated p = 0.00511 (diff 0.0001103) Pass
Singer (2003) Carlisle Baseline p-value distribution k = 3, fisher combined p = 0.4289, plausible Pass
N-consistency Total randomised = Intervention + Control expected = 1232, observed = 1232 Pass
Intervention: Randomised = Analysed + Lost expected = 614, observed = 614 Pass
Control: Randomised = Analysed + Lost expected = 618, observed = 618 Pass
Intervention: Lost <= Randomised expected = 614, observed = 16 Pass
Control: Lost <= Randomised expected = 618, observed = 13 Pass
P-value Delirium incidence reported p = 0.67, recalculated p = 0.6714 (diff 0.001373) Pass
Underwood (2013) Carlisle Baseline p-value distribution k = 3, fisher combined p = 0.4773, plausible Pass
GRIM Duration_anaesthesia_min (Intervention) mean = 210, n = 80 Pass
Duration_anaesthesia_min (Control) mean = 198, n = 80 Pass
N-consistency Total randomised = Intervention + Control expected = 160, observed = 160 Pass
Intervention: Randomised = Analysed + Lost expected = 80, observed = 80 Pass
Control: Randomised = Analysed + Lost expected = 80, observed = 80 Pass
Intervention: Lost <= Randomised expected = 80, observed = 4 Pass
Control: Lost <= Randomised expected = 80, observed = 6 Pass
P-value Delirium incidence reported p = 0.02, recalculated p = 0.02506 (diff 0.005056) Pass
if (FALSE) { # \dontrun{ # Bayesian assessment bres <- inspect_sr(inspect_sr_example, bayes = TRUE, verbose = FALSE) bres } # }