This class defines the following configurable properties that may be set using either the System properties or using the {@link ReflectiveRandomIndexing#ReflectiveRandomIndexing(Properties)} constructor.
{@value #VECTOR_LENGTH_PROPERTY}
{@value #USE_SPARSE_SEMANTICS_PROPERTY}
This class implements {@link Filterable}, which allows for fine-grained control of which semantics are retained. The {@link #setSemanticFilter(Set)}method can be used to speficy which words should have their semantics retained. Note that the words that are filtered out will still be used in computing the semantics of other words. This behavior is intended for use with a large corpora where retaining the semantics of all words in memory is infeasible.
This class is thread-safe for concurrent calls of {@link #processDocument(BufferedReader) processDocument}. The {@link #getVector(String) getVector} method will only return valid reflectivevectors after the call to {@link #processSpace(Properties) processSpace}.
@author David Jurgens
|
|