What might be the purpose of this transformation?

Consider a document-term matrix, where tfij is the frequency of the i

th word

(term) in the jth document and m is the number of documents. Consider

the variable transformation that is defined by



Consider a document-term matrix, where tfij is the frequency of the i

th word

(term) in the jth document and m is the number of documents. Consider

the variable transformation that is defined by

This normalization reflects the observation that terms that occur in
every document do not have any power to distinguish one document
from another, while those that are relatively rare do.

Computer Science & Information Technology

You might also like to view...

________ show trends over time

Fill in the blank(s) with correct word

Computer Science & Information Technology

If you find a suspected bomb in a building, you should alert dispatch by

a. portable radio as soon as the item is located. b. telephone, do not transmit radio within 500 feet of bomb. c. portable radio only after inspecting the item to make sure that it is not electrically wired. d. the quickest and easiest means available whether it be radio or telephone.

Computer Science & Information Technology