Package de.jungblut.crawl.extraction

Examples of de.jungblut.crawl.extraction.OutlinkExtractor


  }

  public static void main(String[] args) throws InterruptedException,
      ExecutionException, IOException {
    String seedUrl = "http://news.google.de/";
    new MultithreadedCrawler<>(1000, new OutlinkExtractor(),
        new SequenceFileResultWriter<>()).process(seedUrl);
  }
View Full Code Here

TOP

Related Classes of de.jungblut.crawl.extraction.OutlinkExtractor

Copyright © 2018 www.massapicom. All rights reserved.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.