Skip to main content

Current Chapter

Current chapter – Linking methodology


Cumulative submissions

Both HES and PROMs data are updated on a cumulative basis and providers can resubmit data for any previous period within the current financial year. Q1 PROMs questionnaires may be administered at any time between a decision to admit being taken and before the episode, and completed Q2 questionnaires received at any point after they are received and returned by patients. This is especially the case for Q2s, which are completed and returned several months after the episode in which the operation occurred.

For this reason, PROMs matching is run on a cumulative basis to ensure that any updated episode or recently returned questionnaire is considered for matching. Linkage rates will therefore increase retrospectively for PROMs data submitted in previous periods.


Matching pre-operative questionnaires (Q1) to HES

The methodology that matches a HES episode to a Q1 questionnaire uses a four-stage process that looks at a combination of patient identifiable fields, provider codes, operation codes and dates. It uses a ‘match ranking’ system in which a score is attributed to each part of the linking process, where the quality of the match is denoted by the rank, with the lowest rank (i.e. 1) being the highest quality match. The scores for each possible match are compared and the highest match is chosen.

The four stages are:

  1. Patient matching
  2. Provider matching
  3. Date matching
  4. A ‘tie break’ to pick the best match if an episode matches to more than one Q1 by combining the scores from the three stages.

See Appendix 4 and 5 in the accompanying excel file for the clinical operation codes used to identify hip and knee replacements for PROMs.

1. Patient matching

Patient matching looks at a range of patient identifiable fields, including NHS number, sex and date of birth. These patient identifiable fields from the PROMs questionnaires are used to find a matching patient identifier in the HES data. This identifier is then used to find an associated HES episode. The episode must have the relevant clinical coding and match to the compatible questionnaire i.e. a hip questionnaire must match to a hip replacement episode.

Details of the patient matching ranks can be found in the table below.

Match rank Description Score
1 Exact match of DOB, SEX, NHSNO and POSTCODE 15
2 Exact match of DOB, SEX, NHSNO 14
3 Partial match of DOB and exact match of SEX, NHSNO and POSTCODE 13
4 Partial match of DOB, and exact match of SEX, NHSNO 12
5 Exact match of POSTCODE and NHSNO 11
6 Exact match of DOB, SEX and POSTCODE (where NHSNO does not contradict the match and DOB is not 1 January and the POSTCODE is not in the 'ignore' list) 10
7 Exact match of DOB, SEX and POSTCODE (where NHSNO does not contradict the match and DOB is not 1 January) 9
8 Exact match of DOB, SEX and POSTCODE (where DOB is not 1 January) 8

 

2. Provider matching

The provider matching compares the provider code of the episode in HES to the provider code submitted by the hospital where the Q1 questionnaire was administered. In many cases, the Q1 questionnaire may be completed in one hospital and the operation carried out in another due to patient choice, subcontracting arrangements between providers or shared pathways.

For non-perfect matches, the following groups are used:

  • green list - a list of provider codes mapped where a known subcontracting or shared pathway relationship exists between the two organisations
  • amber list - organisations where little information is known about the relationship between the two but it is feasible that a relationship could exist

Details of the provider matching ranks (ranks 1 to 3 are classified as allowable matches) can be found in the table below.

Match rank Description  Score
1 Exact match of provider 10
2 Provider in the matrix 'allowed list' (green) 8
3 Provider in the matrix 'possible list' (amber) 5
4 Provider in the matrix 'possible list' (amber) greater than100 miles 3
5 Provider in the matrix 'exclude' list (red) / no provider match 1
6 No provider match 1
9 (Metadata - no episode to match / matches on patient only) 0

 

3. Date matching

Date matching compares the episode start date in HES with the ‘Q1 completed date’. This is the date on which the patient completes and signs the pre-operative questionnaire. Any matches where the completed date is greater than 18 weeks before the operation are deemed unacceptable, as this coincides with the 18-week target for the maximum wait between referral and treatment.

Details of the date matching ranks (ranks 1 to 5 are classified as allowable matches) can be found in the table below.

Match rank Description  Score
1 Completed date within one week prior to the episode start date and episode end 6
2 Completed date within three weeks prior to the episode start date 5
3 Completed date within six weeks prior to the episode start date 4
4 Completed date within 12 weeks prior to the episode start date 3
5 Completed date within 18 weeks prior to the episode start date 2
6 Completed date over 18 weeks prior to the episode start date 1
7 Completed date after the end of the episode 1
9 (Metadata – no episode to match / matches on patient only) 0

4. Tie-break

In cases where a Q1 can potentially link to more than one episode, as a tie breaker, the link with the highest score is chosen thereby ensuring that the highest quality data are selected. A higher weight has been given to the patient match compared to the provider and date matches as the patient match involves matching on multiple fields (NHS number, sex, date of birth, postcode) and is therefore less likely to create a false positive link.

Each match (or potential match) between an episode and a Q1 will have a combined match rank and score, for example a patient match of 1 (score 15), a provider match of 2 (score 8) and a date match of 3 (score 4) will give a combined match rank = 123 and a total score = 27.


Changes to linking methodology From 1 April 2021

In 2021 significant changes were made to the processing of HES data and its associated data fields which are used to link the PROMs-HES data. The data in HES consists of information about individual consultant episodes, outpatient attendances and A&E attendances, with no links between them. However, several such records may be related to a single patient. To address this NHS England has historically created a derived field known as the HES Patient ID (HESID) to provide a way of tracking patients through the HES database. This was designed to be resilient to data quality issues and remove the need to access personal.


We ran the two linking methodologies in parallel (one with HESID and one with Person ID) which confirmed this change had a minimal effect on the linked PROMs-HES data, with around 0.1% of questionnaires being affected. We also saw a small increase in the number questionnaires linking with episodes.

2019 -2020  HES Run   MPS Run Difference
Total records 164,527 164,527
Person matches (HESID/MPSID) 162,748 162,338 410
Episode matches 119,234 119,226 85

 

2020 -20201 HES Run   MPS Run Difference
Total records 39,971 39,971
Person matches (HESID/MPSID) 39,617 38,996 621
Episode matches 24,316 24,356 -40

For both financial years, the number of records were the same and the number of final episode matches were similar. More patient matches were found with the HES processing run however these did not return different episode matches compared to the MPS run. Note 2020-21 covers a period where health services were affected by the COVID-19 pandemic. This has directly impacted upon reported volumes of activity pertaining to Hip and Knee replacements reported in PROMS.


Matching the post-operative questionnaire (Q2) to HES/Q1

All Q1s have a unique number, the PROMs serial number on the front sheet which is scanned when the form is returned. When the Q2 questionnaire is returned it is a simple matter of linking the Q1 and Q2 questionnaires by PROMs serial number to create a match.

An additional step is to look at the patient’s date of birth as submitted on the Q2 questionnaire and compare this with the DOB as submitted on the Q1 questionnaire. The DOB part of the matching algorithm is an additional step to give an indication of extra confidence that the match is correct.

Details of the questionnaire matching ranks can be found in the table below.

Match rank Description
1 Exact match of PROMs serial number and DOB as recorded on the Q2
2 Exact match of PROMs serial number and partial match of DOB as recorded on the Q2
3 Exact match of PROMs serial number, no match on DOB

 


Last edited: 20 June 2023 4:15 pm