Instead of using the formula derived in CURE—see Equation 9.19—we could run a Monte Carlo simulation to directly estimate the probability that a sample of size s would contain at least a certain fraction of the points from a cluster. Using a Monte Carlo simulation compute the probability that a sample of size s contains 50% of the elements of a cluster of size 100, where the total number of points is 1000, and where s can take the values 100, 200, or 500.
What will be an ideal response?
This question should have said “contains at least 50%.”
The results of our simulation consisting of 100,000 trials was 0, 0, and 0.54
for the sample sizes 100, 200, and 500 respectively.
Computer Science & Information Technology
You might also like to view...
The default sort order in reports is ________
Fill in the blank(s) with correct word
Computer Science & Information Technology
The keyboard shortcut for the Time format is ________
A) Ctrl + Shift + % B) Ctrl + Shift + # C) Ctrl + Shift + @ D) Ctrl + Shift + -
Computer Science & Information Technology