Examples of cascading.flow.planner.rule.RuleExpression

cascading.flow.planner.rule.RuleExpression
A RuleExpression describes where a {@link cascading.flow.planner.rule.Rule} is applied in an element graph. Wherea Rule can be a {@link cascading.flow.planner.rule.RuleAssert}, {@link cascading.flow.planner.rule.RuleTransformer}, or {@link cascading.flow.planner.rule.partitioner.ExpressionRulePartitioner}.
To pin down how a Rule behaves and where, a RuleExpression relies on the {@link cascading.flow.planner.iso.expression.ExpressionGraph} class, where an ExpressionGraph is an actual graphof {@link cascading.flow.planner.iso.expression.ElementExpression} nodes and{@link cascading.flow.planner.iso.expression.ScopeExpression} edges.
This expression graph is analogous to a text Regular Expression. A regular expression is used to match a sub-set of text in a string. This enables efficient string replacement and parsing.
The expression graph is used to match a smaller graph inside a larger one so that the larger graph can be manipulated. As in a regular expression, elements of the captured graph can be addressed and used.
The simplest application is to use a single ExpressionGraph to match a portion of a larger graph. Once found, the calling rule can fire. Most commonly this is useful for the RuleAssert rule that looks for a given structure, and if founds throws an error pinpointing the element in the graph that violates the assertion rule.
The second application is to have one ExpressionGraph identify distinguished elements in a larger graph, and remove them. This is called graph contraction. The structure of the original graph is retained where possible. Think of this as hiding elements in the larger graph so that a second ExpressionGraph can be applied to look for a sub-graph.
If the second sub-graph is found, the calling Rule can execute. This most commonly used to partition a graph into smaller graphs. For example, any sub-graph with source Taps and a single sink Group (Pipe) is a process Node (Mapper).
The third application is similar to the second. An ExpressionGraph is used to create a contracted graph of only distinguished elements. The second ExpressionGraph finds a sub-graph in the contracted graph. This contracted sub-graph is then isolated and all hidden elements are restored within the bounds of the sub-graph.
Finally a third ExpressionGraph is used to identify a location within the new sub-graph so the Rule can execute. This is most commonly used to perform transformations within a graph. For example, to insert a temporary Tap into the full assembly element graph to force boundaries between MapReduce jobs.

  {
  public RemoveMalformedHashJoinNodeTransformer()
    {
    super(
      PostNodes,
      new RuleExpression(
        new MalformedJoinExpressionGraph()
      )
    );
    }

View Full Code Here

  {
  public RemoveMalformedHashJoinPipelineTransformer()
    {
    super(
      PostPipelines,
      new RuleExpression( new MalformedJoinExpressionGraph() )
    );
    }

View Full Code Here

  {
  public MultiTapGroupNodePartitioner()
    {
    super(
      PartitionNodes,
      new RuleExpression(
        new NoGroupTapExpressionGraph(),
        new MultiTapGroupExpressionGraph()
      )
    );
    }

View Full Code Here


    ruleRegistry.addRule( new WholeGraphStepPartitioner() );


    ruleRegistry.addRule(
      new ExpressionRulePartitioner( PlanPhase.PartitionNodes,
        new RuleExpression(
          new TestNoGroupTapExpressionGraph(),
          new MultiTapGroupExpressionGraph()
        )
      )
    );


    ruleRegistry.addRule(
      new UniquePathRulePartitioner( PlanPhase.PartitionNodes,
        RulePartitioner.PartitionSource.PartitionCurrent,
        new RuleExpression(
          new TestNoGroupTapExpressionGraph(),
          new TestBoundarySelJoinCoGroupExpressionGraph()
        )
      )
    );

View Full Code Here

  public NoHashJoinAssert()
    {
    super(
      PreBalanceAssembly,


      new RuleExpression( new ExpressionGraph()
        .arcs(
          new FlowElementExpression( ElementCapture.Primary, HashJoin.class )
        ) ),


      "HashJoins not supported by this rule registry, found {Primary}"

View Full Code Here

  public AccumulatedPostNodeAnnotator()
    {
    super(
      PlanPhase.PostNodes,


      new RuleExpression(
        new NoGroupJoinMergeBoundaryTapExpressionGraph(),
        new AccumulatedSourceExpressionGraph()
      ),


      // not marked accumulated unless all paths are blocking

View Full Code Here

    {
    super(
      PartitionNodes,
      PartitionSource.PartitionCurrent, // force repartitioning


      new RuleExpression(
        new NoGroupMergeBoundaryTapExpressionGraph(),
        new StreamedAccumulatedBoundariesExpressionGraph()
      )
    );
    }

View Full Code Here

  public TopDownBoundariesNodePartitioner()
    {
    super(
      PartitionNodes,


      new RuleExpression(
        new NoGroupJoinMergeBoundaryTapExpressionGraph(),
        new TopDownConsecutiveBoundariesExpressionGraph()
      ),


      new ElementAnnotation( ElementCapture.Include, IORole.sink )

View Full Code Here

  public BottomUpJoinedBoundariesNodePartitioner()
    {
    super(
      PartitionNodes,


      new RuleExpression(
        new NoGroupJoinMergeBoundaryTapExpressionGraph(),
        new BottomUpConsecutiveBoundariesExpressionGraph()
      ),


      new ElementAnnotation( ElementCapture.Primary, IORole.sink )

View Full Code Here

    {
    super(
      PartitionNodes,
      PartitionSource.PartitionCurrent, // force repartitioning


      new RuleExpression(
        new NoGroupJoinMergeBoundaryTapExpressionGraph(),
        new StreamedOnlySourcesExpressionGraph()
      )
    );
    }

View Full Code Here

0 1

TOP

Related Classes of cascading.flow.planner.rule.RuleExpression

cascading.flow.hadoop.planner.rule.partitioner.MultiTapGroupNodePartitioner

cascading.flow.hadoop.planner.rule.transformer.RemoveMalformedHashJoinPipelineTransformer

cascading.flow.iso.IsomorphismTest

cascading.flow.tez.planner.rule.annotator.AccumulatedPostNodeAnnotator

cascading.flow.tez.planner.rule.assertion.NoHashJoinAssert

cascading.flow.tez.planner.rule.partitioner.BottomUpBoundariesNodePartitioner

cascading.flow.tez.planner.rule.partitioner.BottomUpJoinedBoundariesNodePartitioner

cascading.flow.tez.planner.rule.partitioner.ConsecutiveGroupOrMergesNodePartitioner

cascading.flow.tez.planner.rule.partitioner.SplitJoinBoundariesNodeRePartitioner

cascading.flow.tez.planner.rule.partitioner.StreamedAccumulatedBoundariesNodeRePartitioner

All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.