By default, the {@link WordReader} provided by this factoryis just a {@link FastBufferedReader}, but you can specify an alternative word reader using the property {@link it.unimi.dsi.mg4j.document.PropertyBasedDocumentFactory.MetadataKeys#WORDREADER}. For instance, if you need to index a list of identifiers to retrieve documents from the collection more easily, you can use a {@link LineWordReader}to index each line of a file as a whole.
|
|