[html4all] Interview: HTML 5 Editor Ian Hickson discusses features, pain points, adoption rate, and more
Robert J Burns
rob at robburns.com
Sat Aug 30 12:43:41 PDT 2008
Hi Leif,
On Aug 30, 2008, at 9:05 PM, Leif Halvard Silli wrote:
> Robert J Burns 2008-08-30 17.34:
> [...]
>
>> You think Ian is saying there should have been XML-style error
>> handling from the beginning?
>
>
> HTML 5 is being defined taking into account history. But in the
> interview he speculated about the past. I imagine that he imagines
> that error handeling, if defined back then, could have differed
> from both XML and HTML 5.
True, it could differ. But the only error handling that would reduce
'tag soup' as I'm using the term would be XML style draconian error
handling. There are other benefits of defined error handling, but
reducing 'tag soup' is not one of them unless it is draconian error
handling (again I think you and Ian are using the phrase differently,
but I haven't figured out how you're using it yet).
>> I think that would have been great, but that's not the Ian I know.
>
> In danger of repeating: XML and Ian agree that HTML should have
> had error handeling from the start. They also agree that this
> would have created less (with XML, theoretically close to zero)
> tag soup. Both also try to answer the challenge from the past. But
> they solve this challenge from the past differently.
Only XML style error handling reduces tag soup. Again, whatever other
benefits it may have, Ian-style error handling does not reduce 'tag
soup'.
>>>>> However, I think that even a more "friendly" error handeling would
>>>>> have meant less tag soup. The point, in my view, is that the
>>>>> effect of errors should have be predictable.
>
>>>>
>
>>>> Certainly having predicable error handling results is a good thing.
>>>> However, if by tag soup we mean syntax errors (including ill-
>>>> formedness and invalid which is how I understand it), then I can't
>>>> see how providing predictable results for errors would reduce
>>>> errors.
>>>> If anything, it reduces the incentive to produce valid and error-
>>>> free
>>>> code, because at least one incentive now is consistent handling by
>>>> browsers.
>
>>>
>
>>> Had it been taken care of from the start, then we could have had a
>>> much more logical error handeling than we have to define now, when
>>> one has to take into account what the browsers already accept.
>>> Defining error handeling is a kind of negative defining of what is
>>> correct. Hence, I agree with Ian on this.
>>
>> You (and Ian) must be using 'tag soup' to mean something completely
>> different than I defined it. Could you tell me how you're using the
>> phrase 'tag soup'?
>
> XML style errorhandling is draconian, and hence predicable.
>
> Ian's error handeling, as he dreamed it from the start, would not
> be draconian - I think.
So how could it cause a reduction in 'tag soup'?
> Still, if all browsers were required to handle errors and
> exceptions "mildly", but in the same way, then they would not have
> had to invent this handeling by themselves. This would have lead
> to fewer errors being accepted. Unlike today, when some browser
> does it that way, another that way, a third that way - and as a
> result, each browser has to support all the different ways of
> handeling errors, in order to be compatible.
So are you and Ian using the phrase 'tag soup' to describe the various
ways browser handle errant HTML? The reason XML error handling reduces
tag soup is that the author is made immediately aware of the errors
(the soupiness of their tags) even without using a validator or
conformance checker. Upon seeing their soup they fix it so that they
can publish their documents. Ian-style error handling, masks the
errors of the authors (and their authoring tools) from the authors.
They do not know they are creating tag soup because they test it in
their browser and it works. With standardized Ian-style error handling
authors still can't see their errors upon testing in a browser. In
fact they may test it in a dozen HTML5 browsers and still not see the
soupiness of their tags. Perhaps you (and Ian) are saying that the
errors are no longer tag soup because they are handled in a consistent
manner? So then we have a new distinction (at least new to me):
conforming documents and errant documents where errant documents can
be broken down into those containing tag soup and those that don't.
For example <object>fallback</object data='file.mpeg' > would be a tag
soup errant document, but <b><i>some bold italics</b></i> would be an
errant document, with no tag soup. I'm just honestly trying to
understand how you're using the term tag soup here.
>>> I hope at least those elements you mention /are/ included in the
>>> <p> model, when HTML 5 is ready ...
>>
>> No, Leif. There won't be any tables, lists or block quotes in the
>> HTML
>> paragraph element. They were in the original WG draft lats year (only
>> for the XML serialization), but they've now been removed (presumably
>> because of Ian's impeccable research and logic).
>
>
> This is of course steps in the wrong direction.
Like so many steps over the last 18 months.
>>> Well, is microformats a way to extend or use HTML?
>>
>> Yes, but not the only way. That's why I was asking you to clarify.
>
> I would call it "using HTML".
Yes, I would too. However, its also using HTML in a way that extends
the expressiveness and semantics of HTML. If HTML (meaning here
specifically text/html HTML) supported XML namespaces-like
extensibility, then using those namespaces for extending HTML would
also be using HTML. Again, I think the microformats approach is a much
clumsier way to do extensibility because it stomps over other ways
authors commonly use HTML.
>>> On the general level, OK. But the editor must be targetted at the
>>> spesific document. Only then can WYSIWYM work, I think.
>>
>> Again, I don't know what you're saying here. Editors do target
>> specific documents.
>
> Specific /types/ of documents.
I'm still not understanding you. Could you list some examples of the
types of documents you are referring to. I'm thinking of an editor
that targets specifically an HTML type document, or an XHTML type
document. However, I can tell here you must be using type in a
different way.
Take care,
Rob
More information about the List_HTML4all.org
mailing list