On Mon, 2013-02-25 at 23:37 -0500, Anil Jangam wrote:
Team,
I observed that HTML parser (hubbub-0.1.2) is breaking when it finds a
SEMICOLON in the text field. I am giving below an example of the text
string.
[...]
<meta http-equiv="content-type" content="text/html;
charset=UTF-8" />
When it finds the ';', it stops working. When I remove this
';' from
the string, it works fine. Can you please check, if this is an issue
with the parser or if I am missing anything?
Can you explain what you mean by "stops working"? The output below is
exactly what I would expect to see, given the input, above.
ELEMENT meta
ATTRIBUTE http-equiv
TEXT
content=content-type
ATTRIBUTE content
TEXT
content=text/html; charset=UTF-8
J.