Consider the use of unclustered B + trees for external sorting. Let R denote the number of data records per disk block, and let F be the number of blocks in the data ?le. Estimate the cost of such a sorting procedure as a function of R and F. Compare this cost to merge-based external sorting. Consider the cases of R = 1, 10, and 100.
What will be an ideal response?
The cost of using unclustered B+ trees is 2R x F because we have to scan the leaves of the B+ tree and for each entry there to access the corresponding data ?le block twice (to read and then to write). When R increases, the cost increases.
Suppose we have M pages of main memory in which to do the sorting and that we use k-way merge. The cost of merge-based external sorting is 2F logk F /M . This cost does not depend on R.
When R = 1, B+ tree sorting is slightly better. For R = 10, merge-sort is likely to be better even for large ?les. For instance, if M = 10, 000 and R = 100, 000, 000 (i.e., 400G), then we can do 999-way merge and
log999(100000000/10000) < 2 < R = 10
You might also like to view...
If the same change needs to be made in several records, it is more efficient to use an update query to make the changes
Indicate whether the statement is true or false
You are assessing the value of insurance plans for a group of your clients. The data is contained in a table named "Insurance." The interest rates are contained in a field named "Interest." The numbers of payments per period are contained in a field named "Retain." The amount per payment, shown as a positive number, is contained in a field named "Amount." The field will be named "Value." What is
the proper entry for the field name? A) Value: FV([Insurance]![Interest], [Insurance]![Retain], -[Insurance]![Amount]) B) Value: FV(Insurance![Interest], Insurance![Retain], Insurance![Amount]) C) Value: FV([Interest], [Retain], -[Amount]) D) Value: FV([Insurance]![Interest], [Insurance]![Retain], [Insurance]![Amount])