The processing cycle and HES data quality

Hospital Episode Statistics (HES) data comes from the routine submissions of data from providers to NHS Digital for the purposes of payment for and commissioning of healthcare in England.

Changes to HES, August 2021

Automatic data cleaning and derivation rules documentation and the HES Patient ID sections have been removed from this page. Information on data cleaning and derivations can now be found in the HES Technical Output Specification found on the HES Data Dictionary.

For activity occurring from April 2021, the patient identifier in HES will change from HESID to the Master Person Service (MPS) person Identifier (MPS Person ID). This change will take effect from the April to July 2021 year to date provisional HES data release in September 2021 and will be applied to all previous years of HES data .The MPS Person ID will be the biggest change to HES and is part of a wider strategy to move to a common patient identifier across all national patient level data sets.

A detailed analysis of the changes to HES and how this impacts data and statistics can be found in the methodological change paper and associated data tables accompanying this.

Emergency care data quality

Learn more about Emergency Care Data Quality at the data quality information and reports on the Emergency Care Data Set (ECDS).

HES processing cycle

Healthcare providers collect administrative and clinical data locally to support the care of the patient. The data is submitted to the Secondary Uses Service (SUS).

At pre-arranged dates during the year, SUS consolidates those submissions and compiles the data as HES. It is then validated and cleansed, before deriving new items and making the information available in a database. Data quality reports and checks are completed at various stages in the cleansing and processing cycle.

SUS and HES data quality

This interactive report provides a summary of high level counts of the extracts taken from SUS+. The first page gives a breakdown of number of records extracted by data set - Accident and Emergency (AE), Outpatient (OP) and Admitted Patient Care (APC) - and by organisation number and type (NHS and independent providers). The second page shows the number of records deleted from the SUS extract and the reason for the deletion. These removed records will not appear on the final HES data set. The third page offers some notes to clarify concepts or atypical trends.

Learn more about data quality checks performed on SUS and HES data.

Accessibility of this tool

This tool is in Microsoft PowerBI which does not fully support all accessibility needs.

If you need further assistance, please contact us for help.

HES data quality notes

Latest activity period: May 2026

Latest publication date: 9 July 2026

DQ Notes M2 2026-27 (publication)

zip 5 MB

Understanding volume of legally restricted codes in HES: April 2017 - March 2018

This document provides HES users a count of legally restricted records in HES from April 2017 - March 2018, by provider, for Admitted Patient Care and Outpatients. It highlights the differences in counts due to additional codes added by SUS+ that have since been reverted.

Understanding records affected by reverted legally restrictive codes

xlsx 51 KB

Duplicate methodology

How we identify and handle duplicate records within the HES dataset.

ARTICLE

Methodology for identifying and removing duplicate records from the HES data set

This guidance provides details of the methodology used to identify and remove duplicate records within the HES (Hospital Episode Statistics) data set.

Provider mapping methodology

How we handle records with an invalid provider code within the HES datasets.

ARTICLE

Provider mapping in HES

Guidance explaining how Hospital Episode Statistics (HES) provider mapping process works. This process ensures that every record in HES has a valid provider code attached to it.

SUS admitted patient care data

NHS Digital routinely collects data from hospital providers regarding a patient's time at hospital as part of the Commissioning Data Set (CDS). This is then processed and is returned to healthcare providers as the Secondary Uses Service (SUS) data set and is used by the NHS for operational purposes.

Most NHS hospital trusts submit data on a monthly basis by deadline following a two-phase reconciliation process to arrive at a final agreed position for each month's activity defined in the NHS Standard Contract for payment purposes. This data is consolidated, validated and cleaned and then used to create the Hospital Episode Statistics (HES) data set which is released on a monthly basis as official statistics.

Person_ID and Token_Person_ID in HES Outpatients 2022/23

In June 2024, an issue was identified in provisional HES data for the period of 2023/24 for a subset of activity reported in these fields, where records with unmatched one-time-use-id values for Person_ID and Token_Person_ID were being reported. These were being duplicated incorrectly where they should have been unique.

This issue has now been resolved for across all data products relating to 2023/24 activity. However further investigation of the issue for earlier years of HES data has identified an issue within the finalised HES Outpatient data for the year 2022/23.

ARTICLE

Person_ID and Token_Person_ID in HES outpatients 2022/23

Further information

Master Person Service (MPS)

The Master Person Service (MPS) helps us increase the amount of usable, better-quality data available to support research and planning.

Last edited: 8 July 2026 3:34 pm