Create a data set that contains only the following asymmetric binary at- tributes: (Weather = Bad, Driver’s condition = Alcohol-impaired, Traffic violation = Yes, Seat Belt = No, Crash Severity = Major). For Traffic violation, only None has a value of 0. The rest of the attribute values are assigned to 1. Assuming that support threshold is 30%, how many candidate and frequent itemsets will be generated?

onsider the traffic accident data set shown in Table 7.1.

The binarized data is shown in Table 7.3.



The number of candidate itemsets from size 1 to size 3 is 5+10+0 = 15.

The number of frequent itemsets from size 1 to size 3 is 5 + 3 + 0 = 8.

Computer Science & Information Technology

You might also like to view...

Which of the following is true?

a. Pseudocode is used to describe an algorithm. b. Pseudocode is not an actual computer programming language. c. Pseudocode is used to describe executable statements that will eventually be translated by the programmer into a program. d. All of the above.

Computer Science & Information Technology

A note inserted into the text of a research paper that refers the reader to a source in the bibliography.

What will be an ideal response?

Computer Science & Information Technology