项目作者: Valdecy
项目描述 :
ID3 Algorithm - A Decision Tree for Categorical Data with Pruning Methods
高级语言: Python
项目地址: git://github.com/Valdecy/Iterative-Dichotomiser-3.git
Iterative-Dichotomiser-3
ID3 Algorithm uses Entropy and Information Gain measures to analyse categorical data. The function returns: 1) The decision tree rules. 2) The total number of rules.
- Xdata = Dataset Attributes
- ydata = Dataset Target
- pre_pruning = “none” or “chi_2”. If “chi_2” is selected then a pre-pruning method based on a Chi Squared test is performed. If the table is in the 2x2 format and has less than 10,000 examples then a Fisher Exact test is performed instead.
- Finnaly a prediction function - prediction_dt_id3( ) - is also included.