项目作者: pavelmaksimov

项目描述 :
ETL flow framework based on Yaml configs in Python
高级语言: Python
项目地址: git://github.com/pavelmaksimov/FlowMaster.git
创建时间: 2021-04-30T11:46:05Z
项目社区:https://github.com/pavelmaksimov/FlowMaster

开源协议:GNU General Public License v3.0

下载


logo

ETL framework based on Yaml configs in Python

Supported Python Versions
License
Code style: black

A light framework for creating data streams.
Setting up streams through configuration in the Yaml file.
There is a schedule, task pools, concurrency limitation.
Works quickly, does not require a lot of resources.
Runs on Windows and Linux.
Flow run in parallel via threading library.
Internally SQLite Database.
Native data transformation.
There is a web interface.

At the moment there are connectors to sources

  • CSV file
  • SQLite
  • Postgres
  • MySQL
  • Yandex Metrika Management API
  • Yandex Metrika Stats API
  • Yandex Metrika Logs API
  • Yandex Direct API
  • Yandex Direct Report API
  • Criteo
  • Google Sheets

Storages

  • Save to csv file
  • Clickhouse

Documentation

Requirements

  • python >=3.9
  • virtual environment

Settings

It is highly recommended to install in a virtual environment.

Flowmaster needs a home, ‘{HOME}/FlowMaster’ is the default,\
but you can lay foundation somewhere else if you prefer\
(optional)

For Windows

  1. setx FLOWMASTER_HOME "{YOUR_PATH}"

For Linux

  1. export FLOWMASTER_HOME={YOUR_PATH}

Installing

  1. pip install flowmaster==0.7.1
  2. # For install web UI.
  3. pip install flowmaster[webui]==0.7.1
  4. # Optional libraries.
  5. pip install flowmaster[clickhouse,postgres,mysql,yandexdirect,yandexmetrika,criteo,googlesheets]==0.7.1

Run

  1. flowmaster run --help
  2. flowmaster run

WEB UI

http://localhost:8822

CHANGELOG

Support

Telegram support chat

Author

Pavel Maksimov

My contacts
Telegram,
Facebook

Удачи тебе, друг! Поставь звездочку ;)