![]() | |
![]() |
| | Thread Tools | Display Modes |
#11
| |||
| |||
|
|
On Thu, 13 Mar 2008, Ben C wrote: Better to use a Content-Language header and/or set the lang attribute on the html element to tell the browser the language so it can use that as a hint to pick a font. But that does not work in Internet Explorer. |
|
It works in Mozilla & Co. http://www.unics.uni-hannover.de/nht...-attribute.htm How about others like Opera? |
#12
| |||
| |||
|
|
On Thu, 13 Mar 2008, Ben C wrote: Better to use a Content-Language header and/or set the lang attribute on the html element to tell the browser the language so it can use that as a hint to pick a font. But that does not work in Internet Explorer. It works in Mozilla & Co. http://www.unics.uni-hannover.de/nht...-attribute.htm How about others like Opera? |
#13
| |||
| |||
|
|
For example, in Safari, in the title element of a commercial site http://mecu.com.au I see a question mark on a black background between "- " and " home". |
#14
| |||
| |||
|
|
In article NOwebmasterSPAM-9366B5.0644051403200...) iinet.net.au>, Eric Lindsay <NOwebmasterSPAM (AT) ericlindsay (DOT) com> wrote: For example, in Safari, in the title element of a commercial site http://mecu.com.au I see a question mark on a black background between "- " and " home". The "title element" is what in this case? The only "home" link I am seeing in my Safari is in the footer. |
#15
| |||
| |||
|
|
In article doraymeRidThis-989660.09164914032008...ptusnet.com.au>, dorayme <doraymeRidThis (AT) optusnet (DOT) com.au> wrote: In article NOwebmasterSPAM-9366B5.0644051403200...) iinet.net.au>, Eric Lindsay <NOwebmasterSPAM (AT) ericlindsay (DOT) com> wrote: For example, in Safari, in the title element of a commercial site http://mecu.com.au I see a question mark on a black background between "- " and " home". The "title element" is what in this case? The only "home" link I am seeing in my Safari is in the footer. This is the text within the required <title> element within the <head element of the page. This is not part of the page viewpoint, therefore not part of the viewable page content. It normally shows up at the very top of the browser, outside the viewpoint. At least, it did for me in Safari, Opera and Firefox. It is the text that often appears as the content when you bookmark a page. The text in full is as it appears in the lines in my post starting with echo. I can find equivalent strange question marks elsewhere in the site, within the regular content of a page. But they are in pages that are often changed, so I couldn't be sure they would remain available. Whereas the title has been the same for some time. I think they may have intended to insert a non-breaking space, but since this entire UTF-8 thing has made me retreat to printable ASCII characters without the high bit set, I hesitate to even guess what was originally intended. I would like to know why so many browsers treat it different however, when you try to inspect the source html. |
#16
| |||||
| |||||
|
|
In article Pine.GSO.4.63.0803131832170.14921 (A...ni-hannover.de>, Andreas Prilop <aprilop2008 (AT) trashmail (DOT) net> wrote: On Thu, 13 Mar 2008, Ben C wrote: Better to use a Content-Language header and/or set the lang attribute on the html element to tell the browser the language so it can use that as a hint to pick a font. But that does not work in Internet Explorer. It works in Mozilla & Co. http://www.unics.uni-hannover.de/nht...-attribute.htm How about others like Opera? How do you even tell what characters are in the source of a page? |
|
For example, in Safari, in the title element of a commercial site http://mecu.com.au I see a question mark on a black background between "- " and " home". I do not see that in Opera or Firefox. The page in question includes a meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1", however the server does not seem to provide a charset. curl -I http://mecu.com.au HTTP/1.1 200 OK Date: Thu, 13 Mar 2008 20:25:59 GMT Server: Microsoft-IIS/6.0 X-Powered-By: ASP.NET Content-Length: 26406 Content-Type: text/html; Charset=, Set-Cookie: ASPSESSIONIDASAQCRTB=NGJFGNAAPHDIMMHKOEJHEJDP; path=/ Cache-control: private curl http://mecu.com.au echo mecu - intelligent banking - ?home | hexdump 0000000 6d 65 63 75 20 2d 20 69 6e 74 65 6c 6c 69 67 65 0000010 6e 74 20 62 61 6e 6b 69 6e 67 20 2d 20 3f 68 6f 0000020 6d 65 0a |
|
Source from Safari 3.0.4 echo mecu - intelligent banking - ??home | hexdump 0000000 6d 65 63 75 20 2d 20 69 6e 74 65 6c 6c 69 67 65 0000010 6e 74 20 62 61 6e 6b 69 6e 67 20 2d 20 ef bf bd 0000020 68 6f 6d 65 0a |
|
Source from Firefox 2.0.0.10 echo mecu - intelligent banking - home | hexdump 0000000 6d 65 63 75 20 2d 20 69 6e 74 65 6c 6c 69 67 65 0000010 6e 74 20 62 61 6e 6b 69 6e 67 20 2d 20 68 6f 6d 0000020 65 0a |
|
Source from Opera 9.5 beta echo mecu - intelligent banking - ? 0000000 6d 65 63 75 20 2d 20 69 6e 74 65 6c 6c 69 67 65 0000010 6e 74 20 62 61 6e 6b 69 6e 67 20 2d 20 c2 a0 68 0000020 6f 6d 65 0a |
#17
| |||
| |||
|
| http://www.unics.uni-hannover.de/nht...-attribute.htm In that test everything gets the same font. I think what Opera does, but this is just a guess, is choose a font based on the actual characters. |
|
Although I don't know how they tell the difference between zh-tw and zh-cn (languages and codepoints very similar but you need different fonts-- simplified characters for zh-cn and traditional ones for zh-tw). |
#18
| |||
| |||
|
|
On Thu, 13 Mar 2008, Ben C wrote: http://www.unics.uni-hannover.de/nht...-attribute.htm In that test everything gets the same font. I think what Opera does, but this is just a guess, is choose a font based on the actual characters. If that is true, you should be able to see different fonts for Latin letters and Greek letters on http://www.unics.uni-hannover.de/nhtcapri/greek.html7 and different fonts for Latin letters and Cyrillic letters on http://www.unics.uni-hannover.de/nht...cyrillic.html5 But I doubt. I believe Opera uses only one font for each of these two test pages. |
|
Although I don't know how they tell the difference between zh-tw and zh-cn (languages and codepoints very similar but you need different fonts-- simplified characters for zh-cn and traditional ones for zh-tw). But how to do this with "charset=utf-8"? The codepoints in Unicode are the same for CN and TW and JP. |
#19
| |||
| |||
|
|
Please tell me there is an easier way... I need to a) strip leading whitespace from the content of my html files and b) save these files as UTF-8 and have them STAY UTF-8. Thanks |
#20
| |||
| |||
|
|
There is one reason *not* to use UTF-8. Browsers usually take the typeface depending on the page's charset. So with charset=iso-8859-5, the page is displayed in the reader's preferred Cyrillic typeface. But with charset=utf-8, the page is displayed in the reader's preferred *Latin* typeface, which might be less suitable. |
|
This reason is even more important for Chinese text. |
![]() |
| Thread Tools | |
| Display Modes | |
| |