An analysis of data limitation for Predictive Analysis decision making course.
The code proposes an analysis of multiple dataset and the multiple decision trees which arise from different testing datasets, in order to determine their evolution and mainly insights regarding the lack of data.
Calculation have been performed using all possible splitting or the law of big numbers applied to random picking.