This repository contains the working files and notebook used as mentioned in the article published on Medium.
Link to the article: https://towardsdatascience.com/high-number-of-unique-values-and-tree-based-models-c30167b63edd
Due to some error in rendering on github, you may not be able to view the notebooks. A workaround to view the notebook is to use a nbviewer: https://nbviewer.jupyter.org/
- Workbook for creating the original income classifcation tree.
- Workbook for creating the dummy decision tree example.
- Full tree schema from the income classification tree.
- Data source used from https://www.kaggle.com/lodetomasi1995/income-classification