Once the data is collected from GP system suppliers (GPSS), validation on the file structure and contents will be carried out before files are accepted by NHS Digital’s GP Data Collector system. The data will then be processed by the Data Management Service (DMS) to create a data asset. The processing involves loading the files from GPSS into a secure database and linking to corporate reference data to provide additional demographic information. This process also removes any Personal Identifiable Data (PID) from the data asset.
Data quality will be checked against the standard six data quality characteristics, which are coverage, completeness, validity, default, integrity and timeliness, as per the requirements of NHSE and NHSI.
Many of the SNOMED CT codes used to specify the data items listed are also used in the Quality and Outcomes and Framework (QOF) and other payment extractions and therefore the data quality of these codes is expected to be high. For other SNOMED CT codes that are not used in payment extractions, the data quality may not be as high. Analysis conducted by NHSE and NHSI will determine this.
Other non-coded information will be validated against standard NHS Digital protocols. For example: patient NHS Number will be validated against the Modulus 11 algorithm and GP practice code will be validated against reference data held by NHS Digital.