Package net.bpiwowar.mg4j.extensions.warc

Examples of net.bpiwowar.mg4j.extensions.warc.WarcRecord$WarcHeader


     * Parses a document from a WARC collection
     *
     * @throws java.io.IOException
     */
    private void parseWAR018CDocument() throws IOException {
      WarcRecord warcRecord = null;
      DataInputStream dis = new DataInputStream(rawContent);

      // Regardless of what the stream gives us, we read and return
      // the first entry which is a response.
      WarcHTMLResponseRecord warcResponse = null;
      while ((warcRecord = WarcRecord.readNextWarcRecord(dis)) != null) {
        // ignore if no WARC response type, otherwise read and finish
        if (warcRecord.getHeaderRecordType().equals("response")) {
          warcResponse = new WarcHTMLResponseRecord(warcRecord);
          break;
        }
      }

View Full Code Here

TOP

Related Classes of net.bpiwowar.mg4j.extensions.warc.WarcRecord$WarcHeader

Copyright © 2018 www.massapicom. All rights reserved.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.