Build a decision tree on the data set. Does the tree capture the “+” and “?” concepts?
Following is a data set that contains two attributes, X and Y , and two class
labels, “+” and “?”. Each attribute can take three different values: 0, 1, or 2.
The concept for the “+” class is Y = 1 and the concept for the “?” class is
X = 0 ? X = 2.
There are 30 positive and 600 negative examples in the data. Therefore,
at the root node, the error rate is
Eorig = 1 ? max(30/630, 600/630) = 30/630.
If we split on X, the gain in error rate is:
If we split on Y , the gain in error rate is:
Therefore, X is chosen to be the first splitting attribute. Since the
X = 1 child node is pure, it does not require further splitting. We may
use attribute Y to split the impure nodes, X = 0 and X = 2, as follows:
• The Y = 0 and Y = 2 nodes contain 100 ? instances.
• The Y = 1 node contains 100 ? and 10 + instances.
In all three cases for Y , the child nodes are labeled as ?. The resulting
concept is
You might also like to view...
In Access SQL statements, any table name or field name that has spaces must be enclosed in ________
A) Double Angle Brackets << >> B) Single Angle Brackets < > C) Parentheses ( ) D) Square brackets [ ]
How can change adversely affect your network?
What will be an ideal response?