Examples of org.apache.hadoop.io.compress.bzip2.CBZip2InputStream

Package org.apache.hadoop.io.compress.bzip2

Examples of org.apache.hadoop.io.compress.bzip2.CBZip2InputStream

org.apache.hadoop.io.compress.bzip2.BZip2DummyCompressor
An input stream that decompresses from the BZip2 format (without the file header chars) to be read as any other stream.
The decompression requires large amounts of memory. Thus you should call the {@link #close() close()} method as soon as possible, to forceCBZip2InputStream to release the allocated memory. See {@link CBZip2OutputStream CBZip2OutputStream} for information about memoryusage.

CBZip2InputStream reads bytes from the compressed source stream via the single byte {@link java.io.InputStream#read() read()} method exclusively.Thus you should consider to use a buffered source stream.

This Ant code was enhanced so that it can de-compress blocks of bzip2 data. Current position in the stream is an important statistic for Hadoop. For example in LineRecordReader, we solely depend on the current position in the stream to know about the progess. The notion of position becomes complicated for compressed files. The Hadoop splitting is done in terms of compressed file. But a compressed file deflates to a large amount of data. So we have handled this problem in the following way. On object creation time, we find the next block start delimiter. Once such a marker is found, the stream stops there (we discard any read compressed data in this process) and the position is updated (i.e. the caller of this class will find out the stream location). At this point we are ready for actual reading (i.e. decompression) of data. The subsequent read calls give out data. The position is updated when the caller of this class has read off the current block + 1 bytes. In between the block reading, position is not updated. (We can only update the postion on block boundaries).

Instances of this class are not threadsafe.

        // stream
        in.reset();
      }
    }


    in = new CBZip2InputStream(in);
    return in;
  }

View Full Code Here

        // stream
        in.reset();
      }
    }


    in = new CBZip2InputStream(new BufferedInputStream(in));
    return in;
  }

View Full Code Here

  *
  * @throws java.lang.UnsupportedOperationException
  *             Throws UnsupportedOperationException
  */
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

  *
  * @return Compressor
  */
  @Override
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

  * This functionality is currently not supported.
  *
  * @return Compressor
  */
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

   * @param conf configuration
   * @return the appropriate implementation of the bzip2 compressor.
   */
  public static Compressor getBzip2Compressor(Configuration conf) {
    return isNativeBzip2Loaded(conf)? 
      new Bzip2Compressor(conf) : new BZip2DummyCompressor();
  }

View Full Code Here

  *
  * @throws java.lang.UnsupportedOperationException
  *             Throws UnsupportedOperationException
  */
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

  * This functionality is currently not supported.
  *
  * @return Compressor
  */
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

   * @param conf configuration
   * @return the appropriate implementation of the bzip2 compressor.
   */
  public static Compressor getBzip2Compressor(Configuration conf) {
    return isNativeBzip2Loaded(conf)? 
      new Bzip2Compressor(conf) : new BZip2DummyCompressor();
  }

View Full Code Here

  * This functionality is currently not supported.
  *
  * @return Compressor
  */
  public Compressor createCompressor() {
    return new BZip2DummyCompressor();
  }

View Full Code Here

0 1 2 3 4 5 6 7 8 9

TOP

Related Classes of org.apache.hadoop.io.compress.bzip2.CBZip2InputStream

at.molindo.utils.io.StreamUtils

cc.twittertools.index.IndexStatuses

com.endgame.binarypig.util.BuildSequenceFileFromArchive

com.google.gwt.ant.taskdefs.TarCat$UntarCompressionMethod

com.google.refine.importing.ImportingUtilities

com.packetloop.packetpig.loaders.pcap.PcapRecordReader

com.packetloop.packetpig.loaders.pcap.StreamingPcapRecordReader

edu.umd.cloud9.collection.wikipedia.WikipediaPagesBz2InputStream

er.woinstaller.archiver.XarFile

mwt.wow.mpq.BZip2Reader

All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.