Consider the (relative distance) K-means scheme for outlier detection de- scribed in Section 10.5 and the accompanying figure, Figure 10.10.

(a) The points at the bottom of the compact cluster shown in Figure 10.10
have a somewhat higher outlier score than those points at the top of
the compact cluster. Why?
(a) The points at the bottom of the compact cluster shown in Figure 10.10
have a somewhat higher outlier score than those points at the top of
the compact cluster. Why?
(c) The use of relative distance adjusts for differences in density. Give an
example of where such an approach might lead to the wrong conclusion.

(a) The mean of the points is pulled somewhat upward from the center of
the compact cluster by point D.
(b) No. This point would become a cluster by itself.
(c) If absolute distances are important. For example, consider heart rate
monitors for patients. If the heart rate goes above or below a specified
range of values, then this has an physical meaning. It would be incorrect
not to identify any patient outside that range as abnormal, even though
there may be a group of patients that are relatively similar to each other
and all have abnormal heart rates.

Computer Science & Information Technology

You might also like to view...

Which of the following uses the Windows 8 Start screen to open a file?

A. Typing the file name and selecting the file from the Files list. B. Double-clicking a file name within File explorer. C. Double-clicking a file shortcut icon. D. Using the Open dialog box provided by an application.

Computer Science & Information Technology

What would happen if the try statement described in the previous exercise were moved to the level2 method?

What will be an ideal response?

Computer Science & Information Technology