Yeesh. I have some fairly icky HTML that would be a trivial parsing job with an HTML parser that provides callbacks (a la SAX) and some lenient error handling. Too bad I can't plug an error handler into REXML.
Nasty. Maybe I will do this in JRuby and a Java SAX implementation.
Or use Rubyful Soup . . .comments powered by Disqus