On 30 Sep 2008 Rob Kendrick wrote:
On Tue, 30 Sep 2008 09:26:14 +0100
Paul Vigay <lists-nospam(a)vigay.com> wrote:
> When you go to http://www.rfidglobal.org/
NetSurf (build r5462)
> downloads the page but then immediately comes up with a "Parsing the
> document failed" error and all you get is a blank page.
> I grabbed it using wget and although the HTML looks reasonable,
> NetSurf is still unable to view the local copy either.
> I'll submit a bug report, but wonder if this is a problem with the new
> parser, as NetSurf 1.2 views the site ok.
The page says in an HTML header (rather than an HTTP one) that its
encoding is "gb-2312", which is I believe meant to be "gb2312", and
a simplified chinese character encoding. That may be the first hurdle.
Additionally, the page contains bytes that do not exist in this
encoding in the first place. However, it does parse as iso-8859-1.
I am not familiar enough with the new parser to say which of these
either) is causing the issue, or how to repair the issue - this posting
is informational only :)
NetSurf seems to throw up its hands and surrender very easily instead
of trying to put something out, even if that means defaulting to
ISO-8859-1 or USASCII.
The other problem I noticed was that when you get this error you can't
shift-drag or ctrl-drag the URL into another browser or an editor. Why
|_|. _ Richard Porter http://www.minijem.plus.com/
Disclaimer: I disclaim everything.