The gradient computations are the same as that of CRFOptimizableByLabelLikelihood.
*Note*: Expectations corresponding to each batch of data can be computed in parallel. During gradient computation, the prior and the constraints are incorporated into the expectations of the last batch (see getBatchValue, getBatchValueGradient). *Note*: This implementation ignores instances with infinite weights (see getExpectationValue). @author Gaurav Chandalia
|
|