Larger threshold value will produce a smaller index. See {@link TermPruningPolicy} for size vs performance considerations.
This implementation uses simple term frequency thresholds to remove all postings from documents where a given term occurs rarely (i.e. its TF in a document is smaller than the threshold).
Threshold values in this method are expressed as absolute term frequencies.
|
|
|
|
|
|