
README - GENERAL INFORMATION

Title of Project: CompoundingPharma – Dynamic production sequencing to minimize patient delays for compounded medications 

Grant Agreement: Horizon Europe MSCA Postdoctoral Fellowship No. 101150714 
Repository Version: Final Project Update 2026-02-19

Project acronym: CompoundingPharma 

Principal Investigator: Kraig Delana <ORCID: 0000-0002-0485-9950>
Institution: IE Business School 
Email: kraig.delana@ie.edu


DATA PROVENANCE
The original data on prescription orders for compounded medications was collected as part of routine operations over the course of four years from April 2019 to March 2023 and supplied by a partnering pharmacy in the United Kingdom. As agreed with the partnering pharmacy and consistent with the principal of "as open as possible as closed as necessary", this replication package provides the minimal data necessary to reproduce our results.  Specifically, we have masked entries to hide the actual medications form, ingredients, and dosages, as well as the exact date and time of the order.  This masked data set is fully anonymized and contains no personal data under GDPR definitions. All identifiers are synthetic and non-reversible. The materials support full reproducibility of the numerical experiments reported in the associated manuscript. 

DATASET CHARACTERISTICS
- File format: CSV (comma-separated values, UTF-8 encoding) 
- File size: 5,399,280 bytes (5.15 MB) 
- Observations: 67,563 order-level records 
- Variables: 7 structured fields 
- Structure: Flat rectangular table; machine-readable; non-proprietary format 


VARIABLE DICTIONARY
   sim_time_hours        Continuous arrival time (hours from origin)
   year_month            Calendar month indicator (YYYY-MM)
   masked_prod_id        Synthetic medication identifier
   product_form_hash     Masked product form category
   uni_ingcomb_id        Masked ingredient combination identifier
   uni_ingcombdose_id    Masked ingredient–dosage identifier
   masked_prod_id_count  Aggregation count variable
