Package org.jasen.core.parsers

Examples of org.jasen.core.parsers.StandardHTMLParser


  public String[] tokenize(URL url) throws JasenException {
    assertReaderAvailable();

    String html = urlReader.readURL(url);
    StandardHTMLParser htmlParser = new StandardHTMLParser();
    String text = htmlParser.extractText(html);
    String[] tokens = text.split(" ");
    return tokens;
  }
View Full Code Here

TOP

Related Classes of org.jasen.core.parsers.StandardHTMLParser

Copyright © 2018 www.massapicom. All rights reserved.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.