ISO/IEC 8859-14: Difference between revisions
→External links: NSAI/AGITS/WG6 would be a standardisation working group (WG) under the NSAI, not a standard itself. |
Drmccreedy (talk | contribs) Add ICU ref |
||
Line 13: | Line 13: | ||
{|{{chset-tableformat}} |
{|{{chset-tableformat}} |
||
{{chset-table-header|ISO/IEC 8859-14<ref>{{cite web |url=https://www.unicode.org/Public/MAPPINGS/ISO8859/DatedVersions/8859-14-1998.TXT |title=ISO/IEC 8859-14:1998 to Unicode |last1=Kuhn |first1=Markus |last2=Whistler |first2=Ken |date=1999-07-27 |work=8859 to Unicode mapping tables |publisher=[[Unicode Consortium|Unicode, Inc]]}}</ref>}} |
{{chset-table-header|ISO/IEC 8859-14<ref>{{cite web |url=https://www.unicode.org/Public/MAPPINGS/ISO8859/DatedVersions/8859-14-1998.TXT |title=ISO/IEC 8859-14:1998 to Unicode |last1=Kuhn |first1=Markus |last2=Whistler |first2=Ken |date=1999-07-27 |work=8859 to Unicode mapping tables |publisher=[[Unicode Consortium|Unicode, Inc]]}}</ref><ref>{{Citation|title=International Components for Unicode (ICU), iso-8859_14-1998.ucm|url=https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/iso-8859_14-1998.ucm|date=1999-07-27}}</ref>}} |
||
|- |
|- |
||
!{{chset-left2|0_<br/>0}} |
!{{chset-left2|0_<br/>0}} |
Revision as of 21:20, 14 June 2020
ISO/IEC 8859-14:1998, Information technology — 8-bit single-byte coded graphic character sets — Part 14: Latin alphabet No. 8 (Celtic), is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1998. It is informally referred to as Latin-8 or Celtic. It was designed to cover the Celtic languages, such as Irish, Manx, Scottish Gaelic, Welsh, Cornish, and Breton.
ISO-8859-14 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. CeltScript made an extension for Windows called Extended Latin-8. Microsoft has assigned code page 28604 a.k.a. Windows-28604 to ISO-8859-14.[1]
History
ISO-8859-14 was originally proposed for the Sami languages.[2] ISO 8859-12 was proposed for Celtic.[3] Later, ISO 8859-12 was proposed for Devanagari, so the Celtic proposal was changed to ISO 8859-14. The Sami proposal was changed to ISO 8859-15,[4] but it got rejected as an ISO/IEC 8859 part, although it was registered as ISO-IR-197.[5]
The original proposal used a different arrangement of points 0xA1–BF.[3] At the committee draft stage of the specification, a dotless i was included at 0xAE,[6] which was changed to a registered trademark sign (matching ISO-8859-1) in the final publication.
ISO-IR-182, an earlier (registered in 1994) modification of ISO-8859-1, had added the letters Ẁ, Ẃ, Ẅ, Ỳ, Ÿ, Ŵ, Ŷ and their lowercase forms (except for ÿ, which was already included) for Welsh language use.[7] The final published version of ISO-8859-14 includes these letters in the same positions which they appear at in ISO-IR-182.
Codepage layout
Letter Number Punctuation Symbol Other Undefined Differences from ISO-8859-1
Draft
The first draft had positions A0-BF different. It did not include the pilcrow sign, but included the cent sign instead at its Latin-1 position. Later, it was ruled that the pilcrow sign was more common, so the pilcrow sign remains at its Latin-1 position, and the cent sign was removed instead.
Draft layout
Differences from the final, published version of ISO/IEC 8859-14 are boxed. Only A0-BF is shown, the rest corresponding to the current ISO 8859-14.
References
- ^ "SheetJS/js-codepage". GitHub.
- ^ Everson, Michael. "Proposed ISO 8859-14 (later 15)".
- ^ a b c Everson, Michael. "Proposed ISO 8859-12 (later 14)".
- ^ Everson, Michael (1996-06-19). Proposal for a new part of ISO/IEC 8859: Latin alphabet No. 9 (Sámi).
- ^ Swedish Institute for Standards (1997-01-24). ISO-IR-197: Sami supplementary Latin set (PDF). ITSCJ/IPSJ.
- ^ Everson, Michael (1997-05-05). "ISO/IEC CD 8859-14:1997 — Latin alphabet No. 8 (Celtic)" (Committee Draft).
- ^ British Standards Institution (1994-03-16). Welsh variant of Latin Alphabet No. 1 (right-hand part) (PDF). ITSCJ/IPSJ. ISO-IR-182.
- ^ Kuhn, Markus; Whistler, Ken (1999-07-27). "ISO/IEC 8859-14:1998 to Unicode". 8859 to Unicode mapping tables. Unicode, Inc.
- ^ International Components for Unicode (ICU), iso-8859_14-1998.ucm, 1999-07-27
External links
- ISO/IEC 8859-14:1998
- ISO-IR 199 Celtic Supplementary Latin Set (May 1, 1998, submitted by Irish body NSAI/AGITS/WG6)