SEM Healthcare Interventions within 30 days Dataset (2017-2018)
melior_sem_interventions_30d.Rd
This dataset contains information about healthcare interventions performed during healthcare contacts for patients in the SEM cohort from the Melior journal system. The data represents interventions performed primarily during 2017-2018 (98.8% of records), with a small number of records from 2013-2016 (1.2%) and 2019-2020 (<0.1%).
Format
A data frame with 472,935 observations and 9 variables:
- contact_id
Character. Unique identifier for each healthcare contact/encounter, serves as a foreign key to link with other datasets. Original field name: KontaktId
- patient_id
Integer. Patient pseudonym identifier, serves as a foreign key to link with other patient-level data. Original field name: Alias
- activity_type
Character. Type of healthcare activity or note. 165 unique values. Most common: "Epikris, tvärprofessionell" (35.3%), "Inskrivning Läk" (17.2%), "Akutkliniken Läk" (11.2%), "Op-berättelse" (7.9%), "Överflyttningsanteckning" (6.7%). Original field name: AktivitetTyp
- term_name
Character. Term name for the terminology/classification system used. 3 unique values: "KVÅ-kod" (99.997%), "KVÅ/Opkod" and "Tillägg KVÅ/Opkod" (combined <0.003%). Original field name: Term_Namn
- care_episode_start
POSIXct. Start date/time of the care episode for the intervention. Date range: 2013-02-11 to 2020-04-22. Distribution by year: 2017 (50.0%), 2018 (48.8%), 2016 (0.7%), 2015 (0.3%), 2013-2014 (0.2%), 2019-2020 (<0.1%). Original field name: VårdtillfälleFörÅtgärd_StartDatum
- care_episode_end
POSIXct. End date/time of the care episode for the intervention. Date range: 2014-12-23 to 2029-10-22 (with one future date likely erroneous). Distribution by year: 2018 (49.6%), 2017 (47.1%), 2019 (2.9%), 2020 (0.3%), 2014-2016 (<0.1%). Contains 1,530 NA values (0.3%). Original field name: VårdtillfälleFörÅtgärd_SlutDatum
- intervention_code
Character. Patient intervention/procedure code (KVÅ code). 3,363 unique values across the dataset. Original field name: PatientAtgard_Kod
- intervention_description
Character. Description of the patient intervention/procedure. 3,367 unique values across the dataset. Contains 366 NA values (0.1%). Original field name: PatientAtgard_Beskrivning
- intervention_modified_date
POSIXct. Date/time when the intervention was recorded/modified. Date range: 2013-05-31 to 2020-12-03. Distribution by year: 2018 (49.8%), 2017 (47.6%), 2019 (2.2%), 2016 (0.3%), 2013-2015 and 2020 (0.1%). Original field name: PatientAtgard_ModifieradDatum
Details
This file was extracted from the Melior electronic health record system. The original filename indicates it contains information about interventions (Åtgärder) performed during healthcare contacts (VidVårdkontakt) during 2017-2018. KVÅ codes represent standardized Swedish healthcare procedure codes.
Note
Several fields from the original dataset have been omitted for efficiency:
AktivitetTermId (activity_term_id): ID that almost perfectly corresponded to activity_type.
VårdtillfälleFörÅtgärd_VardformText: is only represented with ("Slutenvård"), no exptions.
All records in this dataset are for inpatient care ("Slutenvård"), which is consistent with the fact that SEM cohort patients are primarily inpatient cases
The dataset contains one apparently erroneous future date (in 2029) in care_episode_end, which is likely a data entry error (probably 2019 with 10 years added) - this is fixed bellow
A small fraction of records have timestamps from 2013-2016 or 2020, which likely represent either data entry errors.
POSIXct fields are stored in datetime format
366 intervention descriptions (0.1%) are missing despite having intervention codes
Original field names are preserved in the documentation for reference