Examples of com.google.streamhtmlparser.util.EntityResolver

com.google.streamhtmlparser.util.EntityResolver

Decodes (unescapes) HTML entities with the complication that these are received one character at a time hence must be stored temporarily. Also, we may receive some "junk" characters before the actual entity which we will discard.

This class is designed to be 100% compatible with the corresponding logic in the C-version of the {@link com.google.security.streamhtmlparser.HtmlParser}, found in htmlparser.c. There are however a few intentional differences outlines below:

We accept lower and upper-case hex NCRs, the C-version accepts only lower-case ones.
The output on some invalid inputs may be different. This is currently in the process of consolidation with Filipe.
The API is a bit different, I find this one better suited for Java. In particular, the C method processChar returns the output {@code String} whereas in Java, we returna status code and then provide the {@code String} in a separatemethod getEntity. It is cleaner as it avoids the need to return empty {@code String}s during incomplete processing.

Valid HTML entities have one of the following three forms:

&dd; where dd is a number in decimal (base 10) form.
&x|Xyy; where yy is a hex-number (base 16).
&<html-entity>; where <html-entity> is one of lt, gt, amp, quot or apos.

A reset method is provided to facilitate object re-use.

    super(STATE_TABLE, STATE_MAPPING, TEXT);
    tag = new CharacterRecorder();
    attr = new CharacterRecorder();
    value = new CharacterRecorder();
    cdataCloseTag = new CharacterRecorder();
    entityResolver = new EntityResolver();
    jsParser = new JavascriptParserImpl();
    insideJavascript = false;
    valueIndex = 0;
    textInsideUrlValue = false;
  }

    super(aHtmlParserImpl);
    tag = new CharacterRecorder(aHtmlParserImpl.tag);
    attr = new CharacterRecorder(aHtmlParserImpl.attr);
    value = new CharacterRecorder(aHtmlParserImpl.value);
    cdataCloseTag = new CharacterRecorder(aHtmlParserImpl.cdataCloseTag);
    entityResolver = new EntityResolver(aHtmlParserImpl.entityResolver);
    jsParser = new JavascriptParserImpl(aHtmlParserImpl.jsParser);
    insideJavascript = aHtmlParserImpl.insideJavascript;
    valueIndex = aHtmlParserImpl.valueIndex;
    textInsideUrlValue = aHtmlParserImpl.textInsideUrlValue;
  }

Examples of com.google.streamhtmlparser.util.EntityResolver

Related Classes of com.google.streamhtmlparser.util.EntityResolver