项目作者: hzdr

项目描述 :
Small collection of dvc tutorials
高级语言: Python
项目地址: git://github.com/hzdr/dvc_tutorial_series.git
创建时间: 2021-06-04T15:02:38Z
项目社区:https://github.com/hzdr/dvc_tutorial_series

开源协议:BSD 3-Clause "New" or "Revised" License

下载


This is the codepart of the article series I published on medium [here]. (https://nsultova.medium.com/exploring-dvc-for-machine-learning-pipelines-in-research-part-1-3ebc2ca35a18)

During the past months part of my job became looking at different tools to manage machine learning workflows for our team at HelmholtzAI.

A lot of material accumulated on the way, thus I decided to share some of the process and what I’ve learned.

This repository contains tutorials and code centered around DVC which became one of our favourite candidates.

Code structure

  1. .
  2. ├── README.md
  3. ├── content
  4. ├── PART_00.md
  5. └── EXAMPLE.md
  6. └── RESSOURCES.md
  7. └── src
  8. ├── assets
  9. ├── config.py
  10. ├── create_dataset.py
  11. ├── create_features.py
  12. ├── environment.yml
  13. ├── evaluate_model.py
  14. ├── params.yaml
  15. ├── train_model.py
  16. └── wine-quality.csv
  • /content: contains the articles and additional informations
  • /assets: directiory where (intermediate) results are being written to
  • environment.yml - can be used to create a conda environment or just to look up which dependencies are needed
  • config.py - handle paths and other variables, makes eventual expanding less cumbersome
  • params.yaml - used by .dvc, custom parameters can be set here

..the rest should be self-explanatory.

Setup

Make sure you have some recent python version installed. (I run Python 3.9.1 within an conda environment on an macOS Big Sur 11.4 as of this writing).

I’d highly recommend to use any flavour of virtual environments (conda, venv, ..) for following along with this tutorial. (Except you’re a *-BSD or NixOS user, in thus case I assume you know your way around these issues anyway ^^ ).

Clone the repo, make sure the dependencies are installed and you’re good to go!

Look into /content/PART_01, section DVC Tutorial for more information.