Python package to load user-specified metadata models into Google Cloud Data Catalog, comprising Custom Entries, Tag Templates, and Tags
A Python package intended to load user-specified metadata models into Google Cloud Data Catalog,
comprising Custom Entries, Tag
Templates, and Tags.
It is powered by datacatalog-custom-entries-
manager, datacatalog-
tag-template-processor, and
datacatalog-tag-manager, leveraging
their features in the format of a single CLI.
Loading custom models from CSV requires a folder (--files-folder
argument in the below commands)
containing three files: entries.csv
, tag_templates.csv
, and tags.csv
. Please refer to the
following external resources for further details on their layouts:
FILE | LAYOUT DOCUMENTATION |
---|---|
entries.csv |
datacatalog-custom-entries-manager README |
tag_templates.csv |
datacatalog-tag-template-processor README |
tags.csv |
datacatalog-tag-manager README |
Python + virtualenv
pip install datacatalog-custom-model-manager
datacatalog-custom-model load \
--files-folder <CSV-FILES-PATH> \
--project-id <YOUR-PROJECT-ID> --location-id <YOUR-LOCATION-ID> \
[--delete-existing-tag-templates]
Docker
docker build --rm --tag datacatalog-custom-model-manager .
docker run --rm --tty \
--volume <CREDENTIALS-FILE-FOLDER>:/credentials --volume <CSV-FILE-FOLDER>:/data \
datacatalog-custom-model-manager load \
--files-folder <CSV-FILES-PATH> \
--project-id <YOUR-PROJECT-ID> --location-id <YOUR-LOCATION-ID> \
[--delete-existing-tag-templates]
Please make sure to take a moment and read the Code of
Conduct.
Please report bugs and suggest features via the GitHub
Issues.
Before opening an issue, search the tracker for possible duplicates. If you find a duplicate, please
add a comment saying that you encountered the problem as well.
Please make sure to read the Contributing
Guide
before making a pull request.