项目作者: shotarok

项目描述 :
luigi workflows to evaluate models trained by vowpal wabbit
高级语言: Python
项目地址: git://github.com/shotarok/vw-luigi.git
创建时间: 2016-05-01T23:22:01Z
项目社区:https://github.com/shotarok/vw-luigi

开源协议:MIT License

下载


vw-luigi

luigi workflows to evaluate models trained by vowpal wabbit.

gif

Installation of Prerequisite softwares

If you’d like to use vw-luigi, you need to install vowpal wabbit and some python modules.

vowpal wabbit

See https://github.com/JohnLangford/vowpal_wabbit.
If you use OSX, you can install vowpal-wabbit through homebrew.

  1. brew install vowpal-wabbit

python modules

Workflows in vw-luigi depend on luigi, numpy and scikit-learn. You can install required modules through pip.

  1. pip install -r requirements.txt

Usage Example

In case you use /tmp/work/space/train.vw as training data, /tmp/work/space/test.vw as test data and squared loss as loss function, you can get the evaluation result, which includes AUROC, AUPR and LossLoss calculated by scikit-learn, following to below commands.

  1. $ cd vw-luigi
  2. $ ls /tmp/work/space
  3. > train.vw test.vw
  4. $ python -m luigi --module vwluigi EvalTask --loss-func squared --work-dir /tmp/work/space --local-scheduler
  5. > ...
  6. $ ls /tmp/work/space
  7. > model.vw predict.vw result.txt train.vw
  8. $ cat /tmp/work/space/result.txt
  9. > AUROC:0.88060 AUPR:0.72192 LOGLOSS:0.36215

If you are interested in vw-luigi, please see this blog post “‘Kaggle Display Advertising Challenge’ working with vw-luigi”. I wrote another usage example using ‘Kaggle Display Advertising Challenge Dataset’ provided by Critio.

Release History

  • 0.1.0
    • The first proper release

Meta

Distributed under the MIT license. See LICENSE for more information.
Author: Shotaro Kohama - tw: @shotarok28