# Functions
EncryptElements encrypts the genomic ids.
GenerateDataFiles generates the .csv files that 'belong' to the dataset (demodata).
GenerateLoadingDataScript creates a load dataset .sql script.
GenerateLoadingOntologyScript creates a load ontology .sql script.
GenerateOntologyFiles generates the .csv files that 'belong' to the whole ontology (metadata & medco).
GenerateRandomBytes returns securely generated random bytes.
LoadDataFiles executes the loading script for the dataset.
LoadGenomicData initiates the loading process.
LoadOntologyFiles executes the loading script for the ontology.
ReplayDataset replays the dataset x number of times.
SanitizeHeader gets and header name and transforms it in the form Xxx Yyy Zzz.
TagElements tags the genomic ids to allow for the comparison.
# Constants
ANNOTATIONS path to genomic_annotations schema.
I2B2DEMODATA path to i2b2demodata schema.
I2B2METADATA path to i2b2metadata schema.
ONT path to medco_ont schema.
# Variables
ToIgnore: defines the columns to be ignored (mostly the sample and patient IDs)
TranslationDic: defines the translation between the fields that are present in the different datafiles and their
'actual meaning' code-wise
AnnotationsToQuery: defines the annotations to be queried (to speed up the query)
*/.
ToIgnore: defines the columns to be ignored (mostly the sample and patient IDs)
TranslationDic: defines the translation between the fields that are present in the different datafiles and their
'actual meaning' code-wise
AnnotationsToQuery: defines the annotations to be queried (to speed up the query)
*/.
DefaultDataPath is the default path for the data folder.
The different paths and handlers for all the .sql files.
Support global variables.
The different paths and handlers for all the .sql files.
The different paths and handlers for all the .sql files.
NumElMap: defines an approximate size of the map (it avoids rehashing and speeds up the execution)
NumThreads: defines the amount of go subroutines to use when parelellizing the encryption
*/.
NumElMap: defines an approximate size of the map (it avoids rehashing and speeds up the execution)
NumThreads: defines the amount of go subroutines to use when parelellizing the encryption
*/.
stores the concept path and the correspondent ID.
The different paths and handlers for all the .sql files.
The different paths and handlers for all the .sql files.
The different paths and handlers for all the .sql files.
testing environment.
needed for the observation_fact table (counter).
ToIgnore: defines the columns to be ignored (mostly the sample and patient IDs)
TranslationDic: defines the translation between the fields that are present in the different datafiles and their
'actual meaning' code-wise
AnnotationsToQuery: defines the annotations to be queried (to speed up the query)
*/.
ToIgnore: defines the columns to be ignored (mostly the sample and patient IDs)
TranslationDic: defines the translation between the fields that are present in the different datafiles and their
'actual meaning' code-wise
AnnotationsToQuery: defines the annotations to be queried (to speed up the query)
*/.
# Structs
ConceptID defines its ID (e.g., E,1 - for ENC_ID,1; C,1 - for CLEAR_ID,1; sdasdcfsx,1432 - for tagged_value,TAG_ID.
ConceptPath defines the end of the concept path tree and we use it in a map so that we do not repeat concepts.
SensitiveIDValue contains both concept path and annotation which will be linked to a certain sensitive ID.