NHS England hospital datasets#

Last modified: 01 Aug 2025

The hospital datasets cover all NHS funded hospital activity in England. This includes inpatient, outpatient and accident & emergency.

Summary:

../../../../_images/Hospital_datasets.jpg

Figure 1 The seven NHS England hospital datasets

For further details see the NHS England monthly provisional and annual reports, HES processing cycle and data quality checks and Processing cycle and ECDS data quality webpages.

Overview of hospital datasets#

For further information see the individual dataset guides.

Table 1 Names, geographical coverage and availability of hospital datasets in the UK LLC TRE (datasets in italics are retired)

Dataset

Dataset name

Grouping

Coverage

Data available in TRE

Data owner

HESOP

Hospital Episode Statistics (HES) Outpatients

Hospital

England

04/2003 onwards

NHSE

HESAPC

HES Admitted Patient Care (APC)

Hospital

England

04/1998 onwards

NHSE

HESAPC_ACP

HES APC Augmented Care Periods

Hospital

England

1997 to 2006

NHSE

HESAPC_MAT

HES APC Maternity

Hospital

England

2000 onwards

ONS

HESCC

HES Critical Care

Hospital

England

04/2008 onwards

NHSE

HESAE

HES Accident & Emergency

Hospital

England

04/2007 to 12/2020

NHSE

ECDS

Emergency Care Data Set

Hospital

England

10/2017 onwards

NHSE

Purpose of data collection#

The datasets were set up for the management and planning of hospital services and for the financial reimbursement to hospitals for the care they have provided to patients. Secondary uses of the hospital datasets, including research, are accommodated within the data collection system as described in the next section.

Processing cycle and frequency of data collection#

../../../../_images/Hospital_processing_Figure2.jpg

Figure 2 The compilation of the hospital datasets via NHS England’s Secondary Uses Service (SUS)

Stage 1: Collection of the Commissioning Data Sets (CDS)#

The NHS Standard Contract requires all providers of NHS hospital care in England to collect clinical and administrative information as part of the Commissioning Data Sets (CDS)(). These data are used by regional commissioners of healthcare for NHS patients – Integrated Care Boards (ICBs) – for payment and monitoring purposes.

Stage 2: Submission of the CDS to the Secondary Uses Service (SUS)#

NHS secondary care providers electronically submit their CDS to the Secondary Uses Service (SUS), a national data warehouse located within NHS England. Data from the SUS are made available to the ICBs who use the data to pay hospitals for the care they delivered. These same data can also be processed and used for non-clinical purposes, such as research. Data for these purposes are stored in the SUS Secure Data Warehouse. \

Stage 3: Extraction of hospital datasets from the SUS Secure Data Warehouse#

On a monthly basis, NHS England takes a provisional extract from the SUS Secure Data Warehouse and carries out basic data checks and cleaning, adds geographical fields and attaches pseudonymised patient identifiers (Person_ID) to each episode of care. Each extract is cumulative and contains data submitted for the financial year so far, i.e. month 6 will contain data from April to September. At the end of each financial year, providers have the opportunity to revise and update their submissions for the year via the ‘Annual Refresh’. The finalised hospital datasets are published and made available for research purposes and to NHS England statisticians around September each year.

Equivalent data held across the UK#

Equivalent information about hospital activity across the UK is recorded in the following datasets:

  • Northern Ireland: Hospital Inpatient Activity dataset, Hospital Outpatient Activity Dataset, Day case Activity Dataset and A&E Dataset.

  • Scotland: Outpatient Appointment (SMR00), Inpatient/daycase activity (SMR01), Maternity interaction/stay (SMR02), Mental health stay (SMR04) and Unscheduled care - A&E.

  • Wales: Outpatients Database for Wales (OPDW), Patient Episode Dataset for Wales (PEDW), Critical Care Dataset (CCDS), Emergency Department Dataset (EDDS) and Intensive Care National Audit and Research Centre (ICNC).

Measures#

  • Diagnoses based on ICD-10 and SNOMED codes.

  • Procedures based on OPCS-4 and SNOMED codes.

  • Admission type - e.g. elective, emergency, maternity.

  • Length of stay - calculated from admission and discharge dates.

  • Patient demographics - includes ethnicity, age and sex.

Strengths#

  • National coverage - all NHS-funded hospital care in England.

  • Longitudinal coverage - HESID allows patients to be tracked, enabling e.g. care pathway analysis and re-admissions tracking.

  • Linkage with other datasets - e.g. other hospital datasets, Mental Health Services Data Set, etc., enabling analysis across healthcare settings.

  • Detailed and standardised information - datasets include date of admission, and standardised diagnoses and procedures based on international clinical coding systems.

Weaknesses#

  • Quality of coding - quality varies between hospitals and over time, and some conditions are systematically under-recorded, e.g. frailty.

  • Missing data and administrative bias - certain fields, e.g. ethnicity may be incomplete, and capture of activity may be prioritised over clinical accuracy.

  • Structure of HESAPC is complex - see the HESAPC dataset guide.

  • Excludes private hospital care - unless the care is NHS-funded.