Examples of VectorSpaceModel

kipedia.org/wiki/Vector_space_model">Vector Space Model (VSM). This model was first based on the paper

G. Salton, A. Wong, and C. S. Yang (1975), "A Vector Space Model for Automatic Indexing," Communications of the ACM, vol. 18, nr. 11, pages 613–620. Available here

The VSM first processes documents into a word-document matrix where each unique word is a assigned a row in the matrix, and each column represents a document. The values of ths matrix correspond to the number of times the row's word occurs in the column's document. Optionally, after the matrix has been completely, its values may be transformed. This is frequently done using the {@link edu.ucla.sspace.matrix.TfIdfTransform Tf-Idf Transform}.

This class offers one configurable parameter.

Property: {@value #MATRIX_TRANSFORM_PROPERTY} Default: none: This variable sets the preprocessing algorithm to use on the term-document matrix. The property value should be the fully qualified named of a class that implements {@link Transform}. The class should be public, not abstract, and should provide a public no-arg constructor.

This class is thread-safe for concurrent calls of {@link #processDocument(BufferedReader) processDocument}. Once {@link #processSpace(Properties) processSpace} has been called, no further calls to{@code processDocument} should be made. This implementation does not supportaccess to the semantic vectors until after {@code processSpace} has beencalled. @see Transform @author David Jurgens

Examples of VectorSpaceModel

Examples of edu.ucla.sspace.vsm.VectorSpaceModel