For example, if you have a data file containing one line per token, and the label also appears on that line, you can first get a TokenSequence in which the text of each line is the Token.getText() of each token, then run this pipe, and separate the target information from the data information. For example to process the following,
BACKGROUND Then PERSON Mr. PERSON Smith BACKGROUND said ...use
new TokenSequenceMatchDataAndTarget (Pattern.compile ("([A-Z]+) (.*)"), 2, 1)
.
@author Andrew McCallum mccallum@cs.umass.edu
|
|