Skip to main contentSkip to footer
demonstration-1
Brighton & Sussex Medical School

Secondary Data Analysis - what can I access?

Secondary Data Analysis

BACKGROUND IMAGE FOR PANEL

Sources of Primary Care Data

(note there is a cost implication for these datasets and a long lead time for access – speak to your supervisor)

  1. CPRD.com (Datasets costs £7,500 (<1000 pts) to £30,000 (>300,000pts) + £3-5k extraction fee)
  2. THIN – UCL
  3. QResearch – University of Nottingham
  4. ResearchOne – University of Leeds
  5. RCGP Research and Surveillance Centre – Unviersity of Surrey 

Note: GPs record clinical events using Read Codes. Read code training available here > 

Read prescribing data at GP practice and CCG level, reported monthly, here > 

BACKGROUND IMAGE FOR PANEL
BACKGROUND IMAGE FOR PANEL
BACKGROUND IMAGE FOR PANEL

Ethics around the use of big collected datasets 

  • Collected without individual level consent (except research datasets).
    - Opt-out usually possible for patients.
  • Legal Framework for use: 
    - Personally identifiable data is protected under the data protection act. 
    - Thus, data must be “anonymised”
    - Where a patient ID is retained for linkage, this is called “pseudonymised” 
    - For true anonymity, some data is aggregated, or as a researcher you must use k-anonymity in your reports.

Read more >

 

secondary data analysis

 

Permissions for using the data:

  • Often dataset-wide ethical approvals in place and data custodian will need to approve a study protocol before data release, but NHS/HRA ethics not needed.
  • NHS CAG – grants permission needed for research where patient identifiable data is needed (e.g. for linkage) but consent not sought (apply to REC at the same time).
  • If data used with patient consent: Normal HRA route for REC approval.
  • Some proprietary datasets cannot leave their trusted safehaven – you may be given a login for remote access through a webpage.