项目作者: yandexdataschool

项目描述 :
Problem for IDAO 2019 on LHCb Muon Identification
高级语言: Jupyter Notebook
项目地址: git://github.com/yandexdataschool/IDAO-2019-muon-id.git
创建时间: 2018-11-05T22:49:57Z
项目社区:https://github.com/yandexdataschool/IDAO-2019-muon-id

开源协议:Apache License 2.0

下载


Baseline solutions for the Online Round of IDAO-2019

Baseline - simple.ipynb contains a naive xgboost solution and creates a model file for Track 2

Baseline - advanced.ipynb contains a primer on jagged arrays processing and creates a model file for Track 2

track_2_baseline_advanced_cpp contains an example submission for
Track 2 with catboost and jagged arrays in C++. Run make all to
build and the baseline binary will be accepting data as .csv (not
gzipped) at the standard input and writing predictions to the standard
output. Run make compute_features to build an utility binary that
preprocesses data and outputs the computed features.

track_2_baseline_simple_python contains an example submission for
Track 2 with xgboost and python. Quite slow to read the data and
doesn’t have the jagged arrays.

scoring.py contains an implementation of the quality metric

utils.py contains data reading and jagged arrays processing utilities

Evaluate advanced baseline.ipynb compares Python and C++ predictions for Track 2