Object that maps between WARC-TREC-IDs (String identifiers) to docnos (sequentially-numbered ints). This object provides mappings for the Clue Web English collection; the docnos are numbered from part 1 all the way through part 10.
Note that this class needs the data file docno.mapping
, loaded via the {@link #loadMapping(Path,FileSystem)} method.
|
|
|
|