The part of a data structure (abstract data type) that stores a collection of elements is usually an array, a(n) _____________, or a combination of both.

A. linked list
B. Node struct
C. NULL unit
D. set of operations, such as insert or delete

Computer Science & Information Technology

You might also like to view...

A relationship where the whole is responsible for the disposition of its parts is called

a) generalization/specialization. b) association. c) composition. d) aggregation.

View

Computer Science & Information Technology

For each of the following types of data or clusters, discuss briefly if (1) sam- pling will cause problems for this approach and (2) what those problems are. Assume that the sampling technique randomly chooses points from the to- tal set of m points and that any unmentioned characteristics of the data or clusters are as optimal as possible. In other words, focus only on problems caused by the particular characteristic mentioned. Finally, assume that K is very much less than m.

Hierarchical clustering algorithms require O(m2 log(m)) time, and conse- quently, are impractical to use directly on larger data sets. One possible technique for reducing the time required is to sample the data set. For ex- ample, if K clusters are desired and ?m points are sampled from the m points, then a hierarchical clustering algorithm will produce a hierarchical clustering in roughly O(m) time. K clusters can be extracted from this hier- archical clustering by taking the clusters on the Kth level of the dendrogram. The remaining points can then be assigned to a cluster in linear time, by using various strategies. To give a specific example, the centroids of the K clusters can be computed, and then each of the m ? ?m remaining points can be assigned to the cluster associated with the closest centroid. (a) Data with very different sized clusters. (b) High-dimensional data. (c) Data with outliers, i.e., atypical points. (d) Data with highly irregular regions. (e) Data with globular clusters. (f) Data with widely different densities. (g) Data with a small percentage of noise points. (h) Non-Euclidean data. (i) Euclidean data. (j) Data with many and mixed attribute types.

View

Computer Science & Information Technology