项目作者: vim89

项目描述 :
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
高级语言: Python
项目地址: git://github.com/vim89/datalake-etl-pipeline.git
创建时间: 2019-11-16T14:19:20Z
项目社区:https://github.com/vim89/datalake-etl-pipeline

开源协议:Apache License 2.0

下载