A package for defining (and performing) tabular-data transformations with JSON.
created by Austin Poor
Under Construction
This library is intended to help develop a standardized data-transformation schema for tabular data.
Data transformation files are defined in JSON so they are human-readable, trackable in VC, and so they can be passed between languages. DataTransform
‘s goal is to promote reproducablity and clean transformations (rather than hack-y fixes, whenever possible).
You pass in a rectangular dataset and define a stack of transformations to apply — DataTransform.jl
does the rest!
…
…
Data transform files are defined in JSON.
if/else-if/else
or case-when
)FeatureEng
The basic schema structure is:
{
"version": "0.1", // Schema Version Number for Compatability
"transformations": [...] // List of Transformation Objects
}
Deciding how to handle errors. Set error handling globally.
Example file: sample-transform.json.
…
Any contribution would be greatly appreciated! You can help by suggestint additions to the package or changes you think would help. In addition, please feel free to submit an issue or PR!
Thanks!
Austin