This class reads in string encoded training samples, parses them and outputs {@link DocumentSample} objects.
Format: Each line contains one sample document. The category is the first string in the line followed by a tab and whitespace separated document tokens. Sample line: category-string tab-char whitespace-separated-tokens line-break-char(s)
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.