[451] in java-interest
Java HTML parser
daemon@ATHENA.MIT.EDU (Arthur van Hoff)
Thu Jun 22 19:46:17 1995
Date: Thu, 22 Jun 1995 16:06:54 -0800 (PDT)
From: Arthur van Hoff <Arthur.Vanhoff@Eng.Sun.COM>
Reply-To: Arthur van Hoff <Arthur.Vanhoff@Eng.Sun.COM>
To: hotjava-interest@java.Eng.Sun.COM, java-interest@java.Eng.Sun.COM
Hi,
Check out:
ftp://java.sun.com/pub/java/avh/html.tar.Z
for a prototype of a DTD driven HTML parser written entirely in Java.
This parser is the prototype of the HTML parser used in the HotJava beta
release. It is prototype, it requires the alpha3 release, it is has had very
little testing, and it has only been tested on Solaris. I'd like to encourage
you to parse your favorite html pages, see if they contain any html errors,
and send me feedback (avh@eng.sun.com).
The parser reads a DTD and uses it to parse an html file. Errors are reported
when they are found. Programmers can subclass the parser and add their own
functionality to the parser. The parser is an SGML parser with some fine
tuning to make it work for html, not all SGML features are supported yet.
Have fun,
Arthur van Hoff
-
Note to Sun employees: this is an EXTERNAL mailing list!
Info: send 'help' to java-interest-request@java.sun.com