www.fgks.org   »   [go: up one dir, main page]

ISO/IEC 8859-2: Difference between revisions

Content deleted Content added
No edit summary
No edit summary
Line 17:
'''ISO/IEC 8859-2:1999''', ''Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2'', is part of the [[ISO/IEC 8859]] series of ASCII-based standard [[character encoding]]s, first edition published in 1987. It is informally referred to as "Latin-2". It is generally intended for Central<ref>{{cite web|title=Microsoft Outlook Message Encodings|url=https://technet.microsoft.com/en-us/library/cc179149(v=office.12).aspx}}</ref> or "Eastern European" languages that are written in the Latin script. Note that ISO/IEC 8859-2 is very different from [[code page 852]] (MS-DOS Latin 2, PC Latin 2) which is also referred to as "Latin-2" in Czech and Slovak regions.<ref>{{Cite web |title=The Czech and Slovak Character Encoding Mess Explained |url=http://luki.sdf-eu.org/txt/cs-encodings-faq.html#pcl2 |access-date=2022-02-27 |website=luki.sdf-eu.org}}</ref> Almost half the use of the encoding is for Polish, and it's the main legacy encoding for Polish, while virtually all use of it has been replaced by UTF-8 (on the web).
 
'''ISO-8859-2''' is the [[Internet Assigned Numbers Authority|IANA]] preferred charset name for this standard when supplemented with the [[C0 and C1 control codes]] from [[ISO/IEC 6429]]. Less than 0.04% of all web pages use ISO-8859-2 as of October 2022.<ref>{{Cite web |title=Usage Statistics and Market Share of ISO-8859-2 for Websites, October 2022 |url=https://w3techs.com/technologies/details/en-iso885902 |access-date=2022-10-23 |website=w3techs.com}}</ref><ref>{{Cite web|url=https://w3techs.com/technologies/history_overview/character_encoding|title = Historical trends in the usage statistics of character encodings for websites, February 2022}}</ref> Microsoft has assigned '''code page 28592''' a.k.a. '''Windows-28592''' to ISO-8859-2 in Windows. IBM assigned [[Code page 912]] to ISO 8859-2<ref>{{cite web |url=https://github.com/unicode-org/icu-data/blob/main/charset/data/xml/ibm-912_P100-1995.xml}}</ref>, until itthat code page was extended in 1999.<ref>{{cite web |url=https://github.com/unicode-org/icu-data/blob/main/charset/data/ucm/ibm-912_P100-1999.ucm}}</ref> '''Code page 1111''' is similar, but replaces byte B0 ° (degree sign) with U+02DA ˚ (ring above).
 
[[Windows-1250]] is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike [[Windows-1252]], which keeps all printable characters from [[ISO-8859-1]] in the same place).