Post-SEM Healthcare Interventions Dataset (30 Days After 2017-2018 Contacts)
melior_post_sem_interventions_30d.Rd
This dataset contains information about healthcare interventions performed within 30 days after healthcare contacts for patients in the SEM cohort from the Melior journal system. The data represents interventions performed during 2017-2018, with a small fraction from adjacent years.
Format
A data frame with 457,186 observations and 10 variables:
- contact_id
Character. Unique identifier for each healthcare contact/encounter, serves as a foreign key to link with other datasets. Original field name: KontaktId
- patient_id
Integer. Patient pseudonym identifier, serves as a foreign key to link with other patient-level data. Original field name: Alias
- activity_type
Character. Type of healthcare activity or note. 256 unique values. Most common: "Epikris, tvärprofessionell" (12.4%), "Op-berättelse" (8.9%), "Inskrivning Läk" (6.9%), "Kontakt utan besök" (6.2%). Original field name: AktivitetTyp
- term_name
Character. Term name for the type of coded intervention. 3 unique values: "KVÅ-kod" (99.99%), "KVÅ/Opkod" (0.01%), "Tillägg KVÅ/Opkod" (0.01%). Original field name: Term_Namn
- care_episode_start
POSIXct. Start date/time of the care episode for the intervention. Date range: 2013-01-01 to 2019-02-13. Distribution by year: 2018 (50.6%), 2017 (47.3%), 2019 (1.5%), with a small fraction (<1%) from 2013-2016. 36 NA values (<0.01%). Original field name: VårdtillfälleFörÅtgärd_StartDatum
- care_episode_end
POSIXct. End date/time of the care episode for the intervention. Date range: 2013-09-09 to 2020-07-10. Distribution by year: 2018 (50.8%), 2017 (46.7%), 2019 (2.3%). 286,299 NA values (62.6%), primarily for outpatient episodes where this is expected. Original field name: VårdtillfälleFörÅtgärd_SlutDatum
- care_form
Character. Form of care ("Slutenvård" = Inpatient, "Öppenvård" = Outpatient). Distribution: Outpatient (63.9%), Inpatient (36.1%). 36 NA values (<0.01%). Original field name: VårdtillfälleFörÅtgärd_VardformText
- intervention_code
Character. Patient intervention/procedure code (KVÅ code). 3,489 unique values across the dataset. Original field name: PatientAtgard_Kod
- intervention_description
Character. Description of the patient intervention/procedure. 3,285 unique values across the dataset. 4,232 NA values (0.9%). Original field name: PatientAtgard_Beskrivning
- intervention_modified_date
POSIXct. Date/time when the intervention was recorded/modified. Date range: 2017-01-02 to 2019-01-30. Distribution by year: 2018 (51.2%), 2017 (46.9%), 2019 (1.9%). Original field name: PatientAtgard_ModifieradDatum
Details
This file was extracted from the Melior electronic health record system. The original filename indicates it contains information about interventions (Åtgärder) performed within 30 days after healthcare contacts (30DagarEfterVårdkontakt) during 2017-2018. KVÅ codes represent standardized Swedish healthcare procedure codes.
Note
Several fields from the original dataset have been omitted for efficiency and clarity:
AktivitetTermId (activity_term_id): This numeric identifier almost perfectly corresponded to the activity_type field and adds little analytical value.
TermId (term_id): This numeric identifier almost perfectly corresponded to the term_name field with 99.99% of records having value 1600019040 for "KVÅ-kod".
The care_episode_end field has a high proportion of NA values (62.6%), which is expected for outpatient interventions that don't have a formal end date/time
Although the dataset primarily focuses on the 2017-2018 period (97.9% of interventions from these years), it contains a small number of records with dates outside this range, including some from 2013-2016 (0.6%) and others from 2019-2020 (1.5%)
The 12 records with dates in 2020 are likely erroneous future dates
Likely the intervention_modified_date can be used as a surrugate to circumvent this.
POSIXct fields are stored in datetime format
The dataset includes both inpatient and outpatient interventions, with a majority (63.9%) being performed in outpatient settings
This dataset complements post_sem_interventions.csv which covers a broader time period but focuses specifically on the 30-day window following the SEM cohort contact