Data Science

How to determine informative attributes of a data set?

start with the parent/root and calculate the entropy based on the occurance of the target variable in the data set.
Iteratively split the data set on each attribute and calculate the information gain that results from that split.
Choose the attribute which decreases the entropy the most.
Repeat

Diskussion