Skip to main content
Creating a new NHS England: Health Education England, NHS Digital and NHS England have merged. More about the merger.

Hospital Episode Statistics data changes from 2021

From September 2021 we made changes to the way we process the Hospital Episode Statistics (HES) data set. These improvements provided data users with the tools to process larger volumes of data and manage incoming data to ensure that it is accurate, useful and secure.

September 2021 HES data changes

Extensive work was undertaken to change the way we process the Hospital Episode Statistics (HES) data set and to facilitate a wide range of benefits including but not limited to

  • improving matching of patients across all national patient-level datasets through the  Master Person Service
  • exploiting new analytical and data science capabilities
  • increasing efficiency through automation
  • reducing costs by retiring legacy systems

A small number of issues have been identified that we are working to address as quickly as possible. These may impact upon either the quality or availability of the specific data fields concerned. Details of the fields impacted and what this means in terms of the data are available in the consolidated technical output specification (ToS) document, under the ’Known Data Item Issues’ tab. 

Specifically, critical care has a number of data quality issues, particularly around how neonatal and paediatric are being grouped into a critical care period. The issues are understood and documented within the ToS. 

Consideration should be taken when using fields with a known data item issue as it could distort any time series analysis . 


Summary of changes

Data users saw the following changes to HES:

  • the introduction of the Person_ID to enable direct linkage of patient records across data sets, reducing the time and complexity of HES data analysis
  • the HES_ID field was retired along with a list of other fields that were no longer valid

We are aligned our handling and updating of reference data to other data sets. These changes increased the amount of linkable data available to support research and planning.

These changes helped us:

  • improve matching of patients across all national patient-level datasets through the Master Person Service
  • exploit new analytical and data science capabilities
  • increase efficiency through automation
  • reduce cost by retiring legacy systems

All users of HES data were affected by these changes but please note that the data submitted to SUS+ was not affected.

The following information is historical and included for information only


September 2021 planned HES data changes status

Person_ID

For activity occurring from April 2021, the patient identifier in HES will change from HESID to the Master Person Service (MPS) person Identifier (Person_ID). This change will take effect from the April to July 2021 year to date provisional HES data release in September 2021 and will be applied to all previous years of HES data. The Person_ID will be the biggest change to HES and is part of a wider strategy to move to a common patient identifier across all national patient level data sets.

MPS Person ID will support data users to

  • measure counts of patients rather than attendances, appointments, episodes
  • link together activity in the data relating to the same patient such as continuous inpatient spells
  • Identify and track specific patient cohorts across time

Person_ID confirms the individuals identity using PDS data on Spine and assigns a common ID across all NHS Digital data sets.

A detailed analysis of the changes to HES and how this impacts data and statistics can be found in the methodological change paper and associated data tables accompanying this.

Reference data

We are aligning our handling and updating of reference data to other data sets such as Mental Health Services Data set (MHSDS).

The areas of change are

  • the criteria for accepting values in some fields as valid
  • the frequency with which derivations will be rerun with latest reference data

HES_ID

All HES products, from the April to July 2021 year to date provisional HES data release in September 2021, will contain TOKEN_PERSON_ID only. 

Any HES data received from September 2021 will contain TOKEN_PERSON_ID instead of HES_ID.

The overall structure of HES data will not change. For a period of time, your HES data will still contain HES_ID field, but this will not be populated for activities recorded past March 2021 for the April to July 2021 year to date provisional HES data release in September. Instead, data referring to activities recorded from April 2021 onwards, will contain TOKEN_PERSON_ID. 

DARS Data Sharing Agreements (DSAs)

DARS customers will not need to make any changes outside their usual renewal schedule. However, if you are currently working on your affected Data Sharing Agreement (DSA) you may be able to update it. Please speak to your case officer for further information.

TOKEN_PERSON_ID is available to select on DARS Online from November 2020.

Should you need to, HES_ID will be available to select on DARS Online until February 2021.

To receive your first drop off date with TOKEN_PERSON_ID your DSA will need to be signed off and active by August 2021.

If you would like to make changes to your fields when you are renewing your DSA please speak to your case officer so they can arrange these changes for you, as you will not be able to make changes to your fields online. 

There will be no additional charge for adding TOKEN_PERSON_ID to your application.

We have reviewed the list of variables within HES data and identified fields that are no longer populated. Whilst those retired fields are still available to select on DARS Online as they are valid for historical data they are not populated. Therefore, we strongly advise you to review the list of retired fields before selecting your HES variables on DARS online.

Linking files to existing data

If you have received HES data (as extracts) before June 2021 and are requesting further HES data, you will receive a HES Person ID bridge file to enable you to attribute TOKEN_PERSON_ID to any historic data you hold. There is no intention to re-supply any HES data provided prior to June 2021.

If you received provisional HES data prior to the April to July 2021 year to date provisional HES data release in September 2021 then you will not receive a bridge file for this activity as this activity will be superseded with a year to date extract containing the new person identifier covering the the April to July 2021 year to date provisional HES data release in September 2021 onwards.

If you did not previously receive HES_ID the assumption will be that you also do not require TOKEN_PERSON_ID, unless you tell us otherwise during your application process in which case you will need to make an amendment to your application. You will continue to receive your local identifier or Study_ID to track patients as before.

HES admitted patient care bridge files are linkable to historical HES admitted patient care data using the EPIKEY field.

HES outpatient bridge files are linkable to historical HES outpatient data using the ATTENDKEY field. HES accident and emergency bridge files are linkable to historical HES accident and emergency data using the AEKEY field.

This will be supplied on a case by case basis, allowing you to assign TOKEN_PERSON_ID to your historical data, in place of HES_ID. From 2021/22 onwards data will be supplied with TOKEN_PERSON_ID so no further bridge files will be supplied.

If you receive new copies of historical HES data frequently, you will not need the bridge file as these will be supplied with TOKEN_PERSON_ID in future. You will therefore have a consistent person identifier (TOKEN_PERSON_ID) across all data years.

Bridge file

Data item  Description  Format 

EPIKEY

Unique record identifier

BIGINT

ATTENDKEY Unique record identifier

BIGINT

AEKEY

Unique record identifier

BIGINT

TOKEN_PERSON_ID

Pseudonymised Person Identifier

varchar(32)

FYEAR

Financial year activity is relating to

varchar(4)

Further information

internal Hospital Episode Statistics Data Dictionary

The Hospital Episode Statistics (HES) Data Dictionary is intended for use by all users of HES data. An NHS data dictionary works in the same way as a normal dictionary, but contains information about data items.

internal Data Access Request Service (DARS)

The Data Access Request Service (DARS) can offer clinicians, researchers and commissioners the data required to help improve NHS services. We handle applications, process data and provide access to specific sets of data when requests are approved.

internal Master Person Service (MPS)

The Master Person Service (MPS) helps us increase the amount of usable, better-quality data available to support research and planning.

Last edited: 18 May 2023 1:29 pm