项目作者: AndrewReaganM

项目描述 :
Time Series Data Cleaning by Predicting Missing Data Values
高级语言: Jupyter Notebook
项目地址: git://github.com/AndrewReaganM/msci_ml.git
创建时间: 2019-10-28T04:39:08Z
项目社区:https://github.com/AndrewReaganM/msci_ml

开源协议:

下载


Time Series Data Cleaning by Predicting Missing Data Values

The objective of this project is to predict data values that are missing from
a stock dataset using various machine learning techniques.

Data Courtesy of borismarjanovic on Kaggle: https://www.kaggle.com/borismarjanovic/price-volume-data-for-all-us-stocks-etfs

Project Asset Structure

The files in this project are organized in the following structure:

src/preparation

This directory contains the data import code that is used to pull the dataset
from disk into memory.

src/processing

This directory contains the data processing code that prepares the data for use
with machine learning models. This code will likely be used after the code in
src/preparation is used to pull the data from disk.

data/raw

This diretory contains the raw data imported from the source.

data/processed

This directory contains intermediary pre-processed data.

in progress