FAQs about HES data#
Last modified: 19 Nov 2025
What impact do the different levels of coding have on HES data?
The extent to which specific coding is used in HES data is important. For example, you may observe more records in your HESAPC (admitted patients) than in HESOP (outpatients) dataset, despite the national volume of HESOP records being typically ~5x greater per year. This is because HESAPC has meaningful diagnoses codes consistently provided, whereas generic codes are more often used in HESOP. This means when codes provided by a researcher are matched with HES data in the TRE, fewer matches ('hits') will be made on datasets with non-specific codes. Thus fewer records will be included in the project.Examples of non-specific codes include “R69=Not known” for diagnoses and “X997=Not known” for operations. These are used extensively in HESOP, but far less so in HESAPC.
UK LLC is considering the way it makes linked health records available, by initially making unfiltered views available to researchers (with particularly sensitive records removed) rather than asking for codelists upfront. This will allow codelists to be developed whilst working with the data, but will also allow exploration of records which do not have specific codes assigned.
What do the _ACP, _MAT and _OTR suffixes refer to in HES data?
_OTR is short for Other and is an extension of the HES record. There should be a 1:1 relationship between the main record found in HESAPC and its extension in HESAPC_OTR
_ACP is short for Augmented Care Period. This dataset was collected from 1997–2006. It was replaced by HESCC (critical care) in 2008
_MAT is short for Maternity and contains variables associated with maternity-related admissions.
See below for data and sub table lookup relationships. Note: HESCC is a subset of HESAPC
How can I link _ACP, _MAT, _OTR, and HESCC data to their main record?
These sub tables do not contain an individual-level identifier. They therefore need to be linked to the main HESAPC / HESOP / HESAE datsets. See below for the linkage keys for each dataset: