Example INSPECT-SR Dataset: EEG-Guided Anaesthesia and Delirium

A simulated eight-trial systematic review used to demonstrate the full INSPECT-SR workflow. The eight trials are hand-crafted to span a range of trustworthiness profiles and to exercise every automated check.

Usage

inspect_sr_example

Format

A tibble with one row per study. Every INSPECT-SR item has its own column so that the expected layout is immediately obvious. Variable-length inputs to the automated Domain 4 checks (Table 1 baselines and reported test statistics) are stored in list-columns.

study: Character. Study identifier (must be unique).
n_randomised_int, n_randomised_ctrl, n_randomised_total, n_analysed_int, n_analysed_ctrl, n_lost_int, n_lost_ctrl: Integer participant-flow columns feeding check 4.6.
d1_1..d1_3, d2_1..d2_5, d3_1..d3_2, d4_1..d4_11: Character. Manual judgement for each INSPECT-SR item. Allowed values: "No concerns", "Some concerns", "Serious concerns", NA. The four automated items (d4_3, d4_6, d4_8, d4_9) are left as NA and filled in by inspect_sr().
d1_1_comment..d4_11_comment: Character. Optional free-text comment attached to each judgement.
baseline: List-column. Each element is a data frame of Table 1 variables with columns variable, mean_int, sd_int, mean_ctrl, sd_ctrl, p_value, integer_scale. Feeds GRIM (4.8) and Carlisle (4.3).
statistics: List-column. Each element is a data frame of reported tests with columns test_type, statistic, df, df2, reported_p, context. Feeds the p-value check (4.9).
outcome_estimate, outcome_ci_lower, outcome_ci_upper, outcome_se, outcome_log_scale: Primary outcome (odds ratio for delirium incidence) — values are on the log scale.

Source

Simulated.

Details

The eight trials, their expected behaviour under the automated checks, and the INSPECT-SR item they illustrate:

Study	Profile	Expected flag
Callahan (1994)	Clean small trial	None
Peek (2005)	Clean large multi-centre	None
Clayton (2010)	Clean	None
Singer (2003)	Clean (large trial)	None
Underwood (2013)	Some concerns (late registration)	Manual D2
Rosa (2008)	Too-perfect baseline balance	Carlisle (4.3)
Doe (1995)	Retracted; GRIM failures on integer scale	GRIM (4.8), manual D1/D2/D3
Kint (1995)	CONSORT arithmetic off	N consistency (4.6)

All values are fabricated for didactic purposes. The layout matches the schema expected by inspect_sr(), so the whole review can be assessed in one call.

Examples

data(inspect_sr_example)

# Frequentist assessment
res <- inspect_sr(inspect_sr_example, verbose = FALSE)
res
#> 
#> INSPECT-SR Trustworthiness Assessment
#> ================================================== 
#> 
#> Study                     D1    D2    D3    D4       Overall           
#> ---------------------------------------------------------------------- 
#> Callahan (1994)           OK    OK    OK    OK       No concerns       
#> Peek (2005)               OK    OK    OK    OK       No concerns       
#> Clayton (2010)            OK    OK    OK    OK       No concerns       
#> Singer (2003)             OK    OK    OK    OK       No concerns       
#> Underwood (2013)          OK    SOME  --    OK       Some concerns     
#> Rosa (2008)               OK    SOME  --    SERIOUS  Serious concerns  
#> Doe (1995)                SERIOUS SERIOUS SOME  SERIOUS  Serious concerns  
#> Kint (1995)               --    --    --    SERIOUS  Serious concerns  
#> 
#> Domains: D1 post-publication, D2 conduct/governance,
#>          D3 text/figures, D4 results (auto-filled for 4.3/4.6/4.8/4.9)
#> OK = No concerns, SOME = Some concerns, SERIOUS = Serious concerns
#> -- = Not assessed
#> 
#> For a per-check table, call inspect_sr_table().
#> 

# Per-check table
inspect_sr_table(res)


  INSPECT-SR Automated Check Details
    
Domain 4 per-check results
    

      Check
      Item
      Detail
      Result
    
Callahan (1994)
Carlisle
Baseline p-value distribution
k = 3, fisher combined p = 0.3788, plausible
Pass
GRIM
ASA_score (Intervention)
mean = 2.5, n = 46
Pass

ASA_score (Control)
mean = 2.5, n = 46
Pass
N-consistency
Total randomised = Intervention + Control
expected = 92, observed = 92
Pass

Intervention: Randomised = Analysed + Lost
expected = 46, observed = 46
Pass

Control: Randomised = Analysed + Lost
expected = 46, observed = 46
Pass

Intervention: Lost <= Randomised
expected = 46, observed = 3
Pass

Control: Lost <= Randomised
expected = 46, observed = 2
Pass
P-value
Delirium incidence
reported p = 0.27, recalculated p = 0.2674 (diff 0.002593)
Pass
Clayton (2010)
Carlisle
Baseline p-value distribution
k = 3, fisher combined p = 0.46, plausible
Pass
N-consistency
Total randomised = Intervention + Control
expected = 202, observed = 202
Pass

Intervention: Randomised = Analysed + Lost
expected = 101, observed = 101
Pass

Control: Randomised = Analysed + Lost
expected = 101, observed = 101
Pass

Intervention: Lost <= Randomised
expected = 101, observed = 3
Pass

Control: Lost <= Randomised
expected = 101, observed = 5
Pass
P-value
Delirium incidence
reported p = 0.1, recalculated p = 0.09545 (diff 0.004552)
Pass
Doe (1995)
Carlisle
Baseline p-value distribution
k = 4, fisher combined p = 0.05859, plausible
Pass
GRIM
ASA_score (Intervention)
mean = 2.45, n = 90
Fail

ASA_score (Control)
mean = 2.55, n = 90
Fail

Pain_VAS (Intervention)
mean = 4.75, n = 90
Fail

Pain_VAS (Control)
mean = 4.65, n = 90
Fail
N-consistency
Total randomised = Intervention + Control
expected = 180, observed = 180
Pass

Intervention: Randomised = Analysed + Lost
expected = 90, observed = 90
Pass

Control: Randomised = Analysed + Lost
expected = 90, observed = 90
Pass

Intervention: Lost <= Randomised
expected = 90, observed = 2
Pass

Control: Lost <= Randomised
expected = 90, observed = 3
Pass
P-value
Delirium incidence
reported p = 0.001, recalculated p = 0.000407 (diff 0.000593)
Pass

Delirium duration
reported p = 0.002, recalculated p = 0.001582 (diff 0.0004176)
Pass
Kint (1995)
Carlisle
Baseline p-value distribution
k = 3, fisher combined p = 0.4555, plausible
Pass
GRIM
Duration_surgery_min (Intervention)
mean = 175, n = 150
Pass

Duration_surgery_min (Control)
mean = 180, n = 150
Pass
N-consistency
Total randomised = Intervention + Control
expected = 300, observed = 300
Pass

Intervention: Randomised = Analysed + Lost
expected = 150, observed = 147
Fail

Control: Randomised = Analysed + Lost
expected = 150, observed = 146
Fail

Intervention: Lost <= Randomised
expected = 150, observed = 5
Pass

Control: Lost <= Randomised
expected = 150, observed = 8
Pass
P-value
Delirium incidence
reported p = 0.08, recalculated p = 0.07734 (diff 0.002663)
Pass
Peek (2005)
Carlisle
Baseline p-value distribution
k = 4, fisher combined p = 0.3872, plausible
Pass
GRIM
Duration_surgery_min (Intervention)
mean = 185, n = 230
Pass

Duration_surgery_min (Control)
mean = 190, n = 230
Pass
N-consistency
Total randomised = Intervention + Control
expected = 460, observed = 460
Pass

Intervention: Randomised = Analysed + Lost
expected = 230, observed = 230
Pass

Control: Randomised = Analysed + Lost
expected = 230, observed = 230
Pass

Intervention: Lost <= Randomised
expected = 230, observed = 9
Pass

Control: Lost <= Randomised
expected = 230, observed = 12
Pass
P-value
Delirium incidence
reported p = 0.04, recalculated p = 0.04238 (diff 0.002379)
Pass

Duration of delirium
reported p = 0.4, recalculated p = 0.3958 (diff 0.004209)
Pass
Rosa (2008)
Carlisle
Baseline p-value distribution
k = 6, fisher combined p = 4.265e-05, too_similar
Fail
N-consistency
Total randomised = Intervention + Control
expected = 240, observed = 240
Pass

Intervention: Randomised = Analysed + Lost
expected = 120, observed = 120
Pass

Control: Randomised = Analysed + Lost
expected = 120, observed = 120
Pass

Intervention: Lost <= Randomised
expected = 120, observed = 2
Pass

Control: Lost <= Randomised
expected = 120, observed = 3
Pass
P-value
Delirium incidence
reported p = 0.005, recalculated p = 0.00511 (diff 0.0001103)
Pass
Singer (2003)
Carlisle
Baseline p-value distribution
k = 3, fisher combined p = 0.4289, plausible
Pass
N-consistency
Total randomised = Intervention + Control
expected = 1232, observed = 1232
Pass

Intervention: Randomised = Analysed + Lost
expected = 614, observed = 614
Pass

Control: Randomised = Analysed + Lost
expected = 618, observed = 618
Pass

Intervention: Lost <= Randomised
expected = 614, observed = 16
Pass

Control: Lost <= Randomised
expected = 618, observed = 13
Pass
P-value
Delirium incidence
reported p = 0.67, recalculated p = 0.6714 (diff 0.001373)
Pass
Underwood (2013)
Carlisle
Baseline p-value distribution
k = 3, fisher combined p = 0.4773, plausible
Pass
GRIM
Duration_anaesthesia_min (Intervention)
mean = 210, n = 80
Pass

Duration_anaesthesia_min (Control)
mean = 198, n = 80
Pass
N-consistency
Total randomised = Intervention + Control
expected = 160, observed = 160
Pass

Intervention: Randomised = Analysed + Lost
expected = 80, observed = 80
Pass

Control: Randomised = Analysed + Lost
expected = 80, observed = 80
Pass

Intervention: Lost <= Randomised
expected = 80, observed = 4
Pass

Control: Lost <= Randomised
expected = 80, observed = 6
Pass
P-value
Delirium incidence
reported p = 0.02, recalculated p = 0.02506 (diff 0.005056)
Pass

if (FALSE) { # \dontrun{
# Bayesian assessment
bres <- inspect_sr(inspect_sr_example, bayes = TRUE, verbose = FALSE)
bres
} # }

	Check	Item	Detail	Result
INSPECT-SR Automated Check Details
Domain 4 per-check results
Callahan (1994)	Carlisle	Baseline p-value distribution	k = 3, fisher combined p = 0.3788, plausible	Pass
	GRIM	ASA_score (Intervention)	mean = 2.5, n = 46	Pass
		ASA_score (Control)	mean = 2.5, n = 46	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 92, observed = 92	Pass
		Intervention: Randomised = Analysed + Lost	expected = 46, observed = 46	Pass
		Control: Randomised = Analysed + Lost	expected = 46, observed = 46	Pass
		Intervention: Lost <= Randomised	expected = 46, observed = 3	Pass
		Control: Lost <= Randomised	expected = 46, observed = 2	Pass
	P-value	Delirium incidence	reported p = 0.27, recalculated p = 0.2674 (diff 0.002593)	Pass
Clayton (2010)	Carlisle	Baseline p-value distribution	k = 3, fisher combined p = 0.46, plausible	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 202, observed = 202	Pass
		Intervention: Randomised = Analysed + Lost	expected = 101, observed = 101	Pass
		Control: Randomised = Analysed + Lost	expected = 101, observed = 101	Pass
		Intervention: Lost <= Randomised	expected = 101, observed = 3	Pass
		Control: Lost <= Randomised	expected = 101, observed = 5	Pass
	P-value	Delirium incidence	reported p = 0.1, recalculated p = 0.09545 (diff 0.004552)	Pass
Doe (1995)	Carlisle	Baseline p-value distribution	k = 4, fisher combined p = 0.05859, plausible	Pass
	GRIM	ASA_score (Intervention)	mean = 2.45, n = 90	Fail
		ASA_score (Control)	mean = 2.55, n = 90	Fail
		Pain_VAS (Intervention)	mean = 4.75, n = 90	Fail
		Pain_VAS (Control)	mean = 4.65, n = 90	Fail
	N-consistency	Total randomised = Intervention + Control	expected = 180, observed = 180	Pass
		Intervention: Randomised = Analysed + Lost	expected = 90, observed = 90	Pass
		Control: Randomised = Analysed + Lost	expected = 90, observed = 90	Pass
		Intervention: Lost <= Randomised	expected = 90, observed = 2	Pass
		Control: Lost <= Randomised	expected = 90, observed = 3	Pass
	P-value	Delirium incidence	reported p = 0.001, recalculated p = 0.000407 (diff 0.000593)	Pass
		Delirium duration	reported p = 0.002, recalculated p = 0.001582 (diff 0.0004176)	Pass
Kint (1995)	Carlisle	Baseline p-value distribution	k = 3, fisher combined p = 0.4555, plausible	Pass
	GRIM	Duration_surgery_min (Intervention)	mean = 175, n = 150	Pass
		Duration_surgery_min (Control)	mean = 180, n = 150	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 300, observed = 300	Pass
		Intervention: Randomised = Analysed + Lost	expected = 150, observed = 147	Fail
		Control: Randomised = Analysed + Lost	expected = 150, observed = 146	Fail
		Intervention: Lost <= Randomised	expected = 150, observed = 5	Pass
		Control: Lost <= Randomised	expected = 150, observed = 8	Pass
	P-value	Delirium incidence	reported p = 0.08, recalculated p = 0.07734 (diff 0.002663)	Pass
Peek (2005)	Carlisle	Baseline p-value distribution	k = 4, fisher combined p = 0.3872, plausible	Pass
	GRIM	Duration_surgery_min (Intervention)	mean = 185, n = 230	Pass
		Duration_surgery_min (Control)	mean = 190, n = 230	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 460, observed = 460	Pass
		Intervention: Randomised = Analysed + Lost	expected = 230, observed = 230	Pass
		Control: Randomised = Analysed + Lost	expected = 230, observed = 230	Pass
		Intervention: Lost <= Randomised	expected = 230, observed = 9	Pass
		Control: Lost <= Randomised	expected = 230, observed = 12	Pass
	P-value	Delirium incidence	reported p = 0.04, recalculated p = 0.04238 (diff 0.002379)	Pass
		Duration of delirium	reported p = 0.4, recalculated p = 0.3958 (diff 0.004209)	Pass
Rosa (2008)	Carlisle	Baseline p-value distribution	k = 6, fisher combined p = 4.265e-05, too_similar	Fail
	N-consistency	Total randomised = Intervention + Control	expected = 240, observed = 240	Pass
		Intervention: Randomised = Analysed + Lost	expected = 120, observed = 120	Pass
		Control: Randomised = Analysed + Lost	expected = 120, observed = 120	Pass
		Intervention: Lost <= Randomised	expected = 120, observed = 2	Pass
		Control: Lost <= Randomised	expected = 120, observed = 3	Pass
	P-value	Delirium incidence	reported p = 0.005, recalculated p = 0.00511 (diff 0.0001103)	Pass
Singer (2003)	Carlisle	Baseline p-value distribution	k = 3, fisher combined p = 0.4289, plausible	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 1232, observed = 1232	Pass
		Intervention: Randomised = Analysed + Lost	expected = 614, observed = 614	Pass
		Control: Randomised = Analysed + Lost	expected = 618, observed = 618	Pass
		Intervention: Lost <= Randomised	expected = 614, observed = 16	Pass
		Control: Lost <= Randomised	expected = 618, observed = 13	Pass
	P-value	Delirium incidence	reported p = 0.67, recalculated p = 0.6714 (diff 0.001373)	Pass
Underwood (2013)	Carlisle	Baseline p-value distribution	k = 3, fisher combined p = 0.4773, plausible	Pass
	GRIM	Duration_anaesthesia_min (Intervention)	mean = 210, n = 80	Pass
		Duration_anaesthesia_min (Control)	mean = 198, n = 80	Pass
	N-consistency	Total randomised = Intervention + Control	expected = 160, observed = 160	Pass
		Intervention: Randomised = Analysed + Lost	expected = 80, observed = 80	Pass
		Control: Randomised = Analysed + Lost	expected = 80, observed = 80	Pass
		Intervention: Lost <= Randomised	expected = 80, observed = 4	Pass
		Control: Lost <= Randomised	expected = 80, observed = 6	Pass
	P-value	Delirium incidence	reported p = 0.02, recalculated p = 0.02506 (diff 0.005056)	Pass