Technical Notes |
Version | 1 |
Author | Robin Leroy |
Date | 2024-05-08 |
This Version | https://www.unicode.org/notes/tn54/tn54-1.html |
Previous Version | n/a |
Latest Version | https://www.unicode.org/notes/tn54/ |
The Unicode Line Breaking Algorithm, defined in UAX #14, Unicode Line Breaking Algorithm, has changed with each subsequent version of the Unicode Standard. The maintenance of the algorithm frequently requires understanding interactions between the rules and property assignments and historical background behind them that would be too obscure to document in the standard. The purpose of this Unicode Technical Note is to provide a detailed history the UAX, as well as annotations useful to the maintainers.
This document is a Unicode Technical Note. Sole responsibility for its contents rests with the author. Publication does not imply any endorsement by the Unicode Consortium. This document is not subject to the Unicode Patent Policy. For information on Unicode Technical Notes, including criteria for acceptance, see Unicode Technical Notes.
The body of this Unicode Technical Note is contained in the HTML file “alba-1.html.”
The attached HTML file is not normative, and it is not the actual Unicode Standard Annex; implementers should refer to UAX #14, which is the normative document.
The HTML file contains the entire text of Unicode Standard Annex #14, Unicode Line Breaking Algorithm, Version 15.1, plus certain annotations. The annotations give a more in-depth analysis of the algorithm. They describe the reason for each nonobvious rule, and point out interesting ramifications of the rules and interactions among the rules (interesting to Unicode maintainers, that is).
The changes in each successive published version between the original Unicode Line Breaking Algorithm (Unicode Version 3.0.0) and Version 15.1 are indicated with highlighting and strikethroughs; a sidebar makes it possible to select the range of versions of interest. UTC dispositions, documents, and public review feedback relevant to the changes are listed in curly brackets.
The following summarizes modifications from the previous version of this document.
1 | First version. |
Copyright © 2024 Robin Leroy and Unicode, Inc. All Rights Reserved. The Unicode Consortium and Robin Leroy make no expressed or implied warranty of any kind, and assume no liability for errors or omissions. No liability is assumed for incidental and consequential damages in connection with or arising out of the use of the information or programs contained or accompanying this technical note. The Unicode Terms of Use apply.
Unicode and the Unicode logo are trademarks of Unicode, Inc., and are registered in some jurisdictions.