This R package provides two databases of ClinicalTrials.gov NCT
Numbers corresponding to clinical trials that were "stopped" (had
their overall status changed to "Terminated," "Suspended," or
"Withdrawn"). The c19stoppedtrials
dataset contains NCT numbers for
all trials that stopped between 2019-12-01 (the month of the first
human cases of SARS-CoV-2) and 2022-11-30 (three-year data
cutoff). Trials that stopped during the pandemic were checked for
whether they started again in December 2023, to allow at least one
year of follow-up for the restart_date
and restart_status
columns. The comparator
dataset contains NCT numbers for trials that
stopped in the three years prior to the bounds for the
c19stoppedtrials
dataset (2016-12-01 to 2019-11-30).
The c19stoppedtrials
dataset indicates the date that a trial was
stopped, whether it was started again and on what date, and the
contents of the "why stopped?" field on the date the trial
stopped. This dataset also includes columns with manually coded data
for whether the "why stopped?" field explicitly indicates that the
reason for stopping included the SARS-CoV-2 pandemic.
Manually extracted data columns were single-coded by Dr Benjamin Gregory Carlisle. To ensure data quality, a random sample of 100 trials were tripled-coded by two other independent raters. A Light's kappa of 1 was calculated among the three sets of ratings, indicating perfect agreement.
The comparator
dataset for stopped trials for the three years prior
includes all the same columns as the c19stoppedtrials
dataset,
except for the manually coded fields whether the trial stopped due to
the SARS-CoV-2 pandemic.
This package is not available on CRAN, and must be installed via Github:
install.packages("devtools")
library(devtools)
install_github("bgcarlisle/ctcovidstop")
After installation, the package and data set can be loaded as follows:
library(ctcovidstop)
data(c19stoppedtrials)
data(comparator)
This package provides two data frames, c19stoppedtrials
and
comparator
, which can be loaded via the R package with
data(c19stoppedtrials)
and data(comparator)
, respectively. The
same data frames are also provided as CSV files in this repository as
inst/extdata/c19stoppedtrials.csv
and inst/extdata/comparator.csv
.
c19stoppedtrials
contains 13,323 rows of 8 columns. See below for
example rows:
nctid |
stop_date |
stop_status |
restart_date |
restart_status |
why_stopped |
covid19_explicit |
restart_expected |
---|---|---|---|---|---|---|---|
NCT04007003 | 2019-12-02 | Terminated | NA | NA | Sponsor decision | FALSE | NA |
NCT03693833 | 2020-03-16 | Suspended | 2020-06-15 | Recruiting | COVID-19 | TRUE | FALSE |
NCT04161976 | 2020-04-20 | Suspended | 2020-06-03 | Recruiting | Enrollment on hold due to COVID-19 pandemic. | TRUE | TRUE |
Each row in this data frame contains an NCT Number from
ClinicalTrials.gov (nctid
column) and a date on which the
corresponding clinical trial record's overall status was first changed
to "Terminated", "Suspended" or "Withdrawn" from any other overall
status after 2019-12-01 but before 2022-11-30 (inclusive, stop_date
column). The status that the trial was changed to on that date is
indicated in the stop_status
column.
A trial is only included if the study's overall status changed to "Terminated", "Suspended" or "Withdrawn" from any other overall status after 2019-12-01 but before 2022-11-30 (inclusive). If a trial's overall status was already "Terminated", "Suspended" or "Withdrawn" prior to 2019-12-01 and it never became active and then stopped after 2019-12-01, it would not be included, even if the "why stopped?" field was updated to include a reference to Covid-19 (e.g. NCT03365921).
If the trial started again (overall status changed from "Terminated",
"Suspended" or "Withdrawn" to any other overall status) after being
"stopped" according to the definition above by the date that this data
set was last updated, the date that the trial restarted is recorded
under restart_date
; otherwise this column contains NA. The status
that the stopped trial was changed to is indicated in the
restart_status
column.
The reason that the trial was stopped, as reported on the first
stopped historical version of the clinical trial registry entry on
ClinicalTrials.gov, is recorded in the why_stopped
field. If no
reason is given, this column contains NA.
If why_stopped
cites Covid-19 explicitly as a reason why the trial
was stopped, the covid19_explicit
column is TRUE, otherwise
FALSE. In the case that there is no value for why_stopped
,
covid19_explicit
is FALSE. This data point was manually rated by
BGC.
Trials that cite waning levels of Covid-19 infections, etc. as their
rationale for stopping were not considered to be stopped because of
Covid-19, and so covid19_explicit
would be FALSE
(e.g. NCT04390191).
The stop_date
, stop_status
and why_stopped
will reflect only the
first time the trial was stopped after 2019-12-01. In cases where a
trial stops after 2019-12-01 without citing Covid-19 in the
why_stopped
field, starts again, and then stops a second time,
citing Covid-19 as a reason why the study stopped
(e.g. NCT03728504), the trial's covid19_explicit
column is FALSE.
In cases where a trial was stopped with no rationale reported in the
version of the trial history where it was stopped, and then in a later
version, the why_stopped
field was updated to include a rationale,
the updated rationale for stopping would not be included in this data
set.
If covid19_explicit
is FALSE, restart_expected
is NA. If
covid19_explicit
is TRUE, and there is also a stated expectation
that the trial will start again in why_stopped
, restart_expected
is TRUE, otherwise FALSE. Trials that mention the study is "on hold"
or "expected to resume" or that the stop was "temporary", etc. were
included. This data point was manually rated by BGC.
comparator
contains 9665 rows of 6 columns. See below for example
rows:
nctid |
stop_date |
stop_status |
restart_date |
restart_status |
why_stopped |
---|---|---|---|---|---|
NCT02464891 | 2017-03-02 | Terminated | NA | NA | NA |
NCT02424955 | 2017-05-10 | Suspended | 2017-06-29 | Recruiting | Logistics |
NCT02937402 | 2019-09-05 | Terminated | NA | NA | Slow accrual |
Each row in this data frame contains an NCT Number from
ClinicalTrials.gov (nctid
column) and a date on which the
corresponding clinical trial record's overall status was first changed
to "Terminated", "Suspended" or "Withdrawn" from any other overall
status after 2016-12-01 but before 2019-11-30 (inclusive, stop_date
column). The status that the trial was changed to on that date is
indicated in the stop_status
column.
The definitions for the columns are identical to the
c19stoppedtrials
table, above.
The covid19_explicit
and restart_expected
columns are not included
in the comparator
dataset.
These data are provided under a Creative Commons by-attribution licence.
Here is a BibTeX entry for ctcovidstop
:
@Manual{ctcovidstop-carlisle,
Title = {ctcovidstop},
Author = {Carlisle, Benjamin Gregory},
Organization = {The Grey Literature},
Address = {Montreal, Canada},
url = {https://github.com/bgcarlisle/ctcovidstop},
year = 2022
}
If you use this data set and you found it useful, I would take it as a kindness if you cited it.
Best,
Benjamin Gregory Carlisle PhD