![]() | |
![]() |
| | Thread Tools | Display Modes |
#11
| |||
| |||
|
|
"For any language x, that set of characters which are escape characters in x should themselvesd be escaped if they are to take their normal values in some expression." I thought that was obviously my meaning, and it seems to require some perverse gymnastics to get my original utterance to mean something different. It was obvious that that's what you should have meant, but it wasn't obvious, if you did mean that, why you would have thought a remark about the escaping of escape characters to be relevant, given that backslashes *aren't* escape characters in the context being discussed (URL syntax). |
#12
| |||
| |||
|
|
Alan J. Flavell In http://lists.w3.org/Archives/Public/www-style/2005May/0004.html , I found a top-posted "answer" which is resolutely ignoring the bottom-quoted question - | > Shouldn't backslash itself be included in the must-be-escaped | > list? Shouldn't it? I suppose you could say backslashes *are* included in the must-be-escaped list, if you recognise the list as implied. The explicit list seems to have been silently dropped: |
|
the word 'excluded' appears in RFC3986 only in unrelated contexts, and I can't find mention of this removal anywhere in the changeover notes: http://www.gbiv.com/protocols/uri/rev-2002/issues.html |
|
Anyway, backslashes still can't occur in URLs, since no production allows them. |
#13
| |||
| |||
|
|
On Sun, 10 Jun 2006, John Dunlop wrote: Alan J. Flavell In http://lists.w3.org/Archives/Public/www-style/2005May/0004.html , I found a top-posted "answer" which is resolutely ignoring the bottom-quoted question - | > Shouldn't backslash itself be included in the must-be-escaped | > list? Shouldn't it? I suppose you could say backslashes *are* included in the must-be-escaped list, if you recognise the list as implied. The explicit list seems to have been silently dropped: ah, thanks for explaining that... the word 'excluded' appears in RFC3986 only in unrelated contexts, and I can't find mention of this removal anywhere in the changeover notes: http://www.gbiv.com/protocols/uri/rev-2002/issues.html Well, appendix D does say: Section 2, on characters, has been rewritten to explain what characters are reserved, when they are reserved, and why they are reserved, even when they are not used as delimiters by the generic syntax. So that, at least, does still define "reserved" characters. But it also says ... URI normalizers are now given license to decode any percent-encoded octets corresponding to unreserved characters. Oh, hang on! "Reserved Characters" are listed in 2.2, while "Unreserved characters" are listed in 2.3. And the backslash is neither "reserved" nor "unreserved". So, "unreserved" does not mean "any characters which are not reserved". There are characters which are neither. Is that confusing? Indeed, if I take an ASCII table, and mark off the (non-control) characters which they designate as reserved, and as unreserved, I'm left with quite a few which are neither, as follows. "%" is obviously special ... Leaving (by my reckoning) space, ", <, >, \, ^, `, {, |, and } Is there a name for this category of characters - that are neither reserved nor unreserved? Anyway, backslashes still can't occur in URLs, since no production allows them. I must admit I hadn't tried approaching the problem from that direction, but, now that you put it that way, it does make sense. thanks |
#14
| |||
| |||
|
|
David E. Ross: The real questions are: I think these are different points of discussion, no more real or imaginary than the original, but more off-topic. The original was about the status of backslashes wrt URLs, which has a direct bearing on whether or not a doc violates or conforms to the spec. |
#15
| |||
| |||
|
|
Oh, hang on! "Reserved Characters" are listed in 2.2, while "Unreserved characters" are listed in 2.3. And the backslash is neither "reserved" nor "unreserved". So, "unreserved" does not mean "any characters which are not reserved". There are characters which are neither. Is that confusing? |
|
Is there a name for this category of characters - that are neither reserved nor unreserved? |
#16
| |||
| |||
|
|
The story so far: on somewhat unrelated newsgroup, my attention fell upon the URL: http://www.speedtouchdsl.com/prod706.htm which contains a link to the purported URL: http://www.speedtouchdsl.com/pdf\datasheet706WL-780WL.pdf Comparing the latter with other URLs in that area, it appeared that the "\" was a probable blunder for "/". However, since their web server is IIS, it appears that their server silently fixes-up this blunder[1], and delivers the intended content. |
![]() |
| Thread Tools | |
| Display Modes | |
| |