Object that keeps track of the length of each document in the collection as a four-byte integers (ints). Document lengths are measured in number of terms.
Document length data is stored in a serialized data file, in the following format:
Since the documents are numbered sequentially starting at d + 1, each short corresponds unambiguously to a particular document.
@author Jimmy Lin
|
|