Skip to content

Set of 6 SPARQL* queries for epidemiological research within hospitals and RDF* knowledge graph dataset generated for paper "Spatiotemporal Data Modelling for Epidemiological Research in Hospitals"

License

Notifications You must be signed in to change notification settings

LorenaPujante/HospitalKG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 

Repository files navigation

HospitalKG

In this repository, we present the SPARQL* queries developed for "Spatiotemporal Data Modelling for Epidemiological Research in Hospitals" 10.1109/JBHI.2024.3417224 and the dataset in RDF* generated for testing them.

0. Related Repositories

Below, we present some other related repositories that may be of interest to you:

1. SPARQL* Queries

We have designed and implemented in SPARQL* a set of 6 queries based on patients' movements within a hospital that are meant to assist in several epidemiological research tasks. Each query is in a separate file, where the values of the parameters are the ones used in the experiments presented in 10.1109/JBHI.2024.3417224. The objectives of each query are the following:

  • Q1: Detection of an outbreak at a Service
  • Q2: Detection of an outbreak at a Location
  • Q3: Study of the spread of an outbreak from a patient via analysis of contacts
  • Q4: Study of the spread of an outbreak from a set of patients via analysis of contacts
  • Q5: Investigation of sources of contagion via analysis of contacts
  • Q6: Discovery of the index patient

2. RDF* dataset

We provide the dataset used for the experiments presented in 10.1109/JBHI.2024.3417224 This is an RDF* knowledge graph that follows the spatiotemporal data model presented in the paper. It is a synthetic dataset whose data have been generated using the H-Outbreak simulation model. The values for the parameters of H-Outbreak used to create the dataset are the following:

  • n_patients: 0.7
  • steps: 462 (462×8 = 3696 Hours → 3696/24 = 154 Days → 154/7 = 22 Weeks)
  • population: 170000
  • step_time: 8 (hours)
  • init_exposed: 1
  • init_infected: 0
  • arrival_rate: 17.55
  • prob_arrival_ER: 0.7
  • arrival_state_colonized: 0.076
  • arrival_state_S: 0.9973429
  • arrival_state_I: 0.001563
  • arrival_state_NS: 0.0010941
  • prob_p-env_min: 0.14
  • prob_p-env_max: 0.9
  • prob_p-env_mean: 0.52
  • prob_env-p_min: 0.3262
  • prob_env-p_max: 0.5437
  • prob_env-p_mean: 0.435
  • prob_pe_min: 0.18
  • prob_pe_max: 0.3
  • prob_pe_mean: 0.24
  • incubation_time_min: 48
  • incubation_time_max: 72
  • prob_quick_recov_min: 0.0
  • prob_quick_recov_max: 0.23
  • prob_quick_recov_mean: 0.115
  • prob_long_recov_min: 0.5985
  • prob_long_recov_max: 0.9975
  • prob_long_recov_mean: 0.7981
  • treatment_days_min: 5
  • treatment_days_max: 15
  • treatment_days_mean: 10
  • prob_death: 0.069
  • max_patients_rx: 3
  • max_patients_qx: 2
  • min_steps_rx: 10
  • min_steps_qx: 30
  • max_ward_movements: 2
  • max_steps_er_icu: 3
  • max_movements_room: 5
  • occupancy_icu: 0.46

To transform the output from H-Outbreak in an RDF* knowledge graph, we have used HospitalGeneratorRDF. The file Dataset/LogicZones.ttl has been created ad hoc and contains both the nodes and the edges related to the class LogicZone. The values for the parameters used to create the hospital and the temporal data are the following:

  • index: 1600
  • huPerService: 3
  • nFloors: 5
  • huPerFloor: 6
  • nRows: 3
  • nColumns: 4
  • startDateTime: 01/01/2023 08:00:00
  • optionFloorUH: None

About

Set of 6 SPARQL* queries for epidemiological research within hospitals and RDF* knowledge graph dataset generated for paper "Spatiotemporal Data Modelling for Epidemiological Research in Hospitals"

Topics

Resources

License

Stars

Watchers

Forks