{@code DocumentVectorBuilder} generates {@code Vector} representations of adocument, based on semantic {@code Vector}s provided for a {@code SemanticSpace}. This can be consider as a projecting the document into the semantic space. Documents will be tokenized using the current tokenizing method, and the vector in the {@code SemanticSpace} corresponding to eachword found in the document will be combined together. Options for combining term {@code Vector}s include summation, average, and term frequency weighting.
@author Keith Stevens
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.