This performance test compares the three HTML parsers performance. It downloads a large real world HTML file (at time of writing this file was 676KB and growing, this is a file that is generated using sitemesh), and then parses that with each parser 1000 times over. Between each parse it does a System.gc(), to ensure garbage left over from the last parser doesn't have to be collected during the next parsers run. And it runs each parser 3 times. This should ensure that by the third time, all major JIT has been done, and so the system will be as close to a running production server as possible.
All source code are property of their respective owners. Java is a trademark of Sun Microsystems, Inc and owned by ORACLE Inc. Contact coftware#gmail.com.