For sparse data, discuss why considering only the presence of non-zero values might give a more accurate view of the objects than considering the actual magnitudes of values. When would such an approach not be desirable?

What will be an ideal response?

For sparse data, discuss why considering only the presence of non-zero values
might give a more accurate view of the objects than considering the actual
magnitudes of values. When would such an approach not be desirable?Consider document data. Intuitively, two documents are similar if they con-
tain many of the same words. Although we can also include the frequency

with which those words occur in the similarity computation, this can some-
times give a less reliable assessment of similarity. In particular, if one of

the words in a document occurs rather frequently compared to other words,
then this word can dominate the similarity comparison when magnitudes are
taken into account. In that case, the document will only be highly similar
to other documents that also contain the same word with a high frequency.
While this may be appropriate in many or even most cases, it may lead to the
wrong conclusion if the word can appear in different contexts, that can only
be distinguished by other words. For instance, the word, ‘game,’ appears
frequently in discussions of sports and video games.

Computer Science & Information Technology

You might also like to view...

The INDEX function ________ returns the reference of a cell at the intersection of a specified row and column

A) table B) array C) reference D) name

View

Computer Science & Information Technology

A workstation is not booting correctly. The technician wants to locate bad sectors and recover readable information. Which of the following commands would the technician use?

A. chkdsk c: /f B. chkdsk c: /i C. chkdsk c: /c D. chkdsk c: /r

View

Computer Science & Information Technology