General Practice Extraction Service (GPES) Data for Pandemic Planning and Research (GDPPR) Dataset

General Practice Extraction Service (GPES) Data for Pandemic Planning and Research (GDPPR) Dataset#

# define target dataset to document
schema = 'nhsd'
table = 'GDPPR'
version = 'v0003'
# import functions from script helper
import sys
script_fp = "../../../../scripts/"
sys.path.insert(0, script_fp)
from data_doc_helper import DocHelper
# create instance
document = DocHelper(schema, table, version, script_fp)
# markdown/code hybrid cell module requirement
from IPython.display import display, Markdown

1. Summary#

display(document.style_table(document.get_dataset_info()))
Dataset descriptor Dataset-specific information
Name of dataset in TRE GDPPR
Other name None
Keywords COVID-19, Primary care, GP records
Short description Dataset comprises extracts from General Practices in England that have opted into contributing to the GDPPR dataset. Included in the extracts are all patients currently registered with a GP or with a date of death on or after 1 November 2019 whose health record contains coded information relevant to pandemic planning and research.
Owner NHS England
Geographical coverage England
Temporal coverage 01/06/2020 onwards
TRE temporal coverage 1940s onwards
Frequency of update Annually
Date of last extract None
DOI None
Data resolution None
Number of variables None
Number of participants None
Number of observations None
Version None
Key link https://digital.nhs.uk/coronavirus/gpes-data-for-pandemic-planning-and-research/guide-for-analysts-and-users-of-the-data
Specific restrictions to data use 1. Research must be related to COVID-19. For examples of research purposes acceptable to NHS England see the COVID-19 Public Health Directions 2020: https://digital.nhs.uk/about-nhs-digital/corporate-information-and-documents/directions-and-data-provision-notices/secretary-of-state-directions/covid-19-public-health-directions-2020 2. GDPPR data must not be used for any form of performance management of General Practices.

2. Metrics#

The tables below summarise the GDPPR dataset in the UK LLC TRE.

Table 1 The number of participants from each LPS that are represented in the GDPPR dataset in the UK LLC TRE
(Note: numbers relate to the most recent extract of NHS England data)

display(document.style_table(document.get_cohort_count()))
LPS Participant count
ALSPAC 5886
BCS70 5789
BIB 27323
ELSA 6779
EPICN 14119
EXCEED 9417
FENLAND 10105
GLAD 64692
MCS 17540
NCDS58 5924
NEXTSTEP 5169
NIHRBIO_COPING 16063
NSHD46 2280
TEDS 8038
TRACKC19 13738
TWINSUK 11711
UKHLS 6729
TOTAL 231302

3. Helpful syntax#

Below we will include syntax that may be helpful to other researchers in the UK LLC TRE. For longer scripts, we will include a snippet of the code plus a link to Git where you can find the full script.