List of readTextFile() Examples

Examples of readTextFile()

com.cloudera.crunch.Pipeline.readTextFile()
com.cloudera.crunch.impl.mr.MRPipeline.readTextFile()
eu.stratosphere.api.java.ExecutionEnvironment.readTextFile()
ocal/file" or "hdfs://host:port/file/path"). @return A DataSet that represents the data read from the given file as text lines.
limelight.io.FileSystem.readTextFile()
org.apache.crunch.Pipeline.readTextFile()
A convenience method for reading a text file.
org.apache.crunch.impl.mr.MRPipeline.readTextFile()
org.apache.crunch.impl.spark.SparkPipeline.readTextFile()
org.apache.flink.api.java.ExecutionEnvironment.readTextFile()
ocal/file" or "hdfs://host:port/file/path"). @return A DataSet that represents the data read from the given file as text lines.
org.apache.flink.streaming.api.environment.LocalStreamEnvironment.readTextFile()
org.platformlayer.ops.OpsTarget.readTextFile()

Examples of com.cloudera.crunch.impl.mr.MRPipeline.readTextFile()

public class CogroupCrunchTest implements Serializable {
  
  @Test
  public void test() throws IOException {
    Pipeline pipeline = new MRPipeline(CogroupCrunchTest.class);
    PCollection<String> a = pipeline.readTextFile("join/A");
    PCollection<String> b = pipeline.readTextFile("join/B");
    
    PTable<String, String> aTable = a.parallelDo(new DoFn<String, Pair<String, String>>() {
    @Override
    public void process(String input, Emitter<Pair<String, String>> emitter) {

View Full Code Here

Examples of com.cloudera.crunch.impl.mr.MRPipeline.readTextFile()

  
  @Test
  public void test() throws IOException {
    Pipeline pipeline = new MRPipeline(CogroupCrunchTest.class);
    PCollection<String> a = pipeline.readTextFile("join/A");
    PCollection<String> b = pipeline.readTextFile("join/B");
    
    PTable<String, String> aTable = a.parallelDo(new DoFn<String, Pair<String, String>>() {
    @Override
    public void process(String input, Emitter<Pair<String, String>> emitter) {
      Iterator<String> split = Splitter.on('\t').split(input).iterator();

View Full Code Here

Examples of eu.stratosphere.api.java.ExecutionEnvironment.readTextFile()

    
    // construct the plan it will be multiple flat maps, all unioned
    // and the "unioned" dataSet will be grouped
    final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
    
    DataSet<String> source = env.readTextFile(IN_FILE);
    DataSet<Tuple2<String, Integer>> lastUnion = source.flatMap(new DummyFlatMap());
  
    for (int i = 1; i< NUM_INPUTS; i++){
      lastUnion = lastUnion.union(source.flatMap(new DummyFlatMap()));
    }

View Full Code Here

Examples of limelight.io.FileSystem.readTextFile()

  {
    final FileSystem fs = new FileSystem();
    if(!fs.exists(".testClasses"))
      throw new RuntimeException(".testClasses file is missing");


    final String classesContent = fs.readTextFile(".testClasses");
    return classesContent.split("(\r\n|\n)");
  }
}

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

      return 1;
    }
    // Create an object to coordinate pipeline creation and execution.
    Pipeline pipeline = new MRPipeline(AverageBytesByIP.class, getConf());
    // Reference a given text file as a collection of Strings.
    PCollection<String> lines = pipeline.readTextFile(args[0]);


    // Aggregator used for summing up response size and count
    Aggregator<Pair<Long, Long>> agg = pairAggregator(SUM_LONGS(), SUM_LONGS());


    // Table of (ip, sum(response size), count)

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

      return 1;
    }
    // Create an object to coordinate pipeline creation and execution.
    Pipeline pipeline = new MRPipeline(TotalWordCount.class, getConf());
    // Reference a given text file as a collection of Strings.
    PCollection<String> lines = pipeline.readTextFile(args[0]);


    // Define a function that splits each line in a PCollection of Strings into
    // a
    // PCollection made up of the individual words in the file.
    PCollection<Long> numberOfWords = lines.parallelDo(new DoFn<String, Long>() {

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

      return 1;
    }
    // Create an object to coordinate pipeline creation and execution.
    Pipeline pipeline = new MRPipeline(SecondarySortExample.class, getConf());
    // Reference a given text file as a collection of Strings.
    PCollection<String> lines = pipeline.readTextFile(args[0]);


    // Define a function that parses each line in a PCollection of Strings into
    // a pair of pairs, the first of which will be grouped by (first member) and
    // the sorted by (second memeber). The second pair is payload which can be
    // passed in an Iterable object.

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

      return 1;
    }
    // Create an object to coordinate pipeline creation and execution.
    Pipeline pipeline = new MRPipeline(AverageBytesByIP.class, getConf());
    // Reference a given text file as a collection of Strings.
    PCollection<String> lines = pipeline.readTextFile(args[0]);


    // Combiner used for summing up response size and count
    CombineFn<String, Pair<Long, Long>> stringPairOfLongsSumCombiner = CombineFn.pairAggregator(CombineFn.SUM_LONGS,
        CombineFn.SUM_LONGS);

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

      return 1;
    }
    // Create an object to coordinate pipeline creation and execution.
    Pipeline pipeline = new MRPipeline(TotalBytesByIP.class, getConf());
    // Reference a given text file as a collection of Strings.
    PCollection<String> lines = pipeline.readTextFile(args[0]);


    // Combiner used for summing up response size
    CombineFn<String, Long> longSumCombiner = CombineFn.SUM_LONGS();


    // Table of (ip, sum(response size))

View Full Code Here

Examples of org.apache.crunch.Pipeline.readTextFile()

  @Test
  public void testAvroReflectSortPair() throws IOException {
    Pipeline pipeline = new MRPipeline(SortIT.class, tmpDir.getDefaultConfiguration());
    pipeline.enableDebug();
    String rsrc = tmpDir.copyResourceFileName("set2.txt");
    PCollection<Pair<String, StringWrapper>> in = pipeline.readTextFile(rsrc)
        .parallelDo(new MapFn<String, Pair<String, StringWrapper>>() {


          @Override
          public Pair<String, StringWrapper> map(String input) {
            return Pair.of(input, wrap(input));

View Full Code Here

0 1 2 3 4 5

TOP

All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.