Data Generators
CdsDataGenerator
A class to generate CDS (Clinical Decision Support) data based on specified workflows and constraints.
This class provides functionality to generate synthetic FHIR resources for testing CDS systems. It uses registered data generators to create resources like Patients, Encounters, Conditions etc. based on configured workflows. It can also incorporate free text data from CSV files.
ATTRIBUTE | DESCRIPTION |
---|---|
registry |
A registry mapping generator names to generator classes.
TYPE:
|
mappings |
A mapping of workflow names to lists of required generators.
TYPE:
|
generated_data |
The most recently generated FHIR resources.
TYPE:
|
workflow |
The currently active workflow.
TYPE:
|
Example
generator = CdsDataGenerator() generator.set_workflow("encounter_discharge") data = generator.generate_prefetch( ... random_seed=42 ... )
Source code in healthchain/data_generators/cdsdatagenerator.py
18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 |
|
fetch_generator(generator_name)
Fetches a data generator class by its name from the registry.
PARAMETER | DESCRIPTION |
---|---|
generator_name
|
The name of the data generator to fetch (e.g. "PatientGenerator", "EncounterGenerator")
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
Callable
|
The data generator class that can be used to generate FHIR resources. Returns None if generator not found.
TYPE:
|
Example
generator = CdsDataGenerator() patient_gen = generator.fetch_generator("PatientGenerator") patient = patient_gen.generate()
Source code in healthchain/data_generators/cdsdatagenerator.py
free_text_parser(path_to_csv, column_name)
Parse free text data from a CSV file.
This method reads a CSV file and extracts text data from a specified column. The text data can later be used to create DocumentReference resources.
PARAMETER | DESCRIPTION |
---|---|
path_to_csv
|
Path to the CSV file containing the free text data.
TYPE:
|
column_name
|
Name of the column in the CSV file to extract text from.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
List[str]
|
List[str]: List of text strings extracted from the specified column. |
RAISES | DESCRIPTION |
---|---|
FileNotFoundError
|
If the specified CSV file does not exist or is not a file. |
ValueError
|
If column_name is not provided. |
Exception
|
If any other error occurs while reading/parsing the CSV file. |
Source code in healthchain/data_generators/cdsdatagenerator.py
generate_prefetch(constraints=None, free_text_path=None, column_name=None, random_seed=None)
Generates CDS data based on the current workflow, constraints, and optional free text data.
This method generates FHIR resources according to the configured workflow mapping. For each resource type in the workflow, it uses the corresponding generator to create a FHIR resource. If free text data is provided via CSV, it will also generate a DocumentReference containing randomly selected text from the CSV.
PARAMETER | DESCRIPTION |
---|---|
constraints
|
A list of constraints to apply to the data generation. Each constraint should match the format expected by the individual generators.
TYPE:
|
free_text_path
|
Path to a CSV file containing free text data to be included as DocumentReferences. If provided, column_name must also be specified.
TYPE:
|
column_name
|
The name of the column in the CSV file containing the free text data to use. Required if free_text_path is provided.
TYPE:
|
random_seed
|
Seed value for random number generation to ensure reproducible results. If not provided, generation will be truly random.
TYPE:
|
RETURNS | DESCRIPTION |
---|---|
Prefetch
|
A dictionary mapping resource types to generated FHIR resources. The keys are lowercase resource type names (e.g. "patient", "encounter"). If free text is provided, includes a "document" key with a DocumentReference.
TYPE:
|
RAISES | DESCRIPTION |
---|---|
ValueError
|
If the configured workflow is not found in the mappings |
FileNotFoundError
|
If the free_text_path is provided but file not found |
ValueError
|
If free_text_path provided without column_name |
Source code in healthchain/data_generators/cdsdatagenerator.py
set_workflow(workflow)
Sets the current workflow to be used for data generation.
PARAMETER | DESCRIPTION |
---|---|
workflow
|
The name of the workflow to set.
TYPE:
|
ClassGenerator
Bases: BaseGenerator
A generator class for creating FHIR Class resources.
METHOD | DESCRIPTION |
---|---|
generate |
Generates a FHIR Class resource. |
Source code in healthchain/data_generators/encountergenerators.py
EncounterGenerator
Bases: BaseGenerator
A generator class for creating FHIR Encounter resources.
METHOD | DESCRIPTION |
---|---|
generate |
Optional[list] = None, random_seed: Optional[int] = None) -> Encounter: Generates a FHIR Encounter resource with optional constraints and random_seed. |
Source code in healthchain/data_generators/encountergenerators.py
EncounterLocationGenerator
Bases: BaseGenerator
A generator class for creating FHIR EncounterLocation resources.
METHOD | DESCRIPTION |
---|---|
generate |
Generates a FHIR EncounterLocation resource. |
Source code in healthchain/data_generators/encountergenerators.py
EncounterPriorityGenerator
Bases: BaseGenerator
A generator class for creating FHIR EncounterPriority resources.
METHOD | DESCRIPTION |
---|---|
generate |
Generates a FHIR EncounterPriority resource. |
Source code in healthchain/data_generators/encountergenerators.py
EncounterTypeGenerator
Bases: BaseGenerator
A generator class for creating FHIR EncounterType resources.
METHOD | DESCRIPTION |
---|---|
generate |
Generates a FHIR EncounterType resource. |