(todo: graph at right needs updating)
Unicode CLDR 30 provides an update to the key building blocks for software
supporting the world’s languages. This data is used by all
systems for their software internationalization and localization, adapting
software to the conventions of different languages for such common software
tasks. The following summarizes the main improvements in the release.
- Unicode support is updated to 9.0, including updated Unihan readings for the pinyin collation and Han-Latin transforms, and support for new script codes and number systems.
- The set of language codes for translation has been updated, with a significant increase in the total number of translated language names.
- Substantial new data has been added for likely subtags (e.g., to get the main script for each language).
- New data items have been added to support relative times such as “3 Fridays ago” or “this hour”.
- New draft format and preference structure has been added to support week designations such as “the week of August 10” or “week 3 of March”.
- New <characterLabels> data can be used to generate labels for groups of related characters in character pickers.
- The structure for emoji annotations has been revised, and the data has been significantly updated. The emoji collation has been updated, and data is added for improved segmentation behavior. Added a specification for synthesizing ZWJ sequence names.
- The CLDR 30 Survey Tool data collection resulted in a net increase in data items of about 9.2%, with an additional 5.9% of items changed.
For further details and links to documentation, see the CLDR Release Notes
MARK: My recommendation is:
- Just pour the https://sites.google.com/site/cldr/index/downloads/cldr-30?pli=1#TOC-Overview language in above, after finalized. Keeping bullets for clarity.
- For the image, it should be a current snapshot of https://sites.google.com/site/cldr/index/downloads/cldr-30?pli=1#TOC-Growth
- For the tweet, I suggest:
- New CLDR v30! Unicode 9.0, full Emoji 4.0 names, extended, relative times, language names, … http://cldr.unicode.org/index/downloads/cldr-30
(Not generally part of the announcement posting, but here for reference:)
About the Unicode Consortium
The Unicode Consortium is a non-profit organization founded to
develop, extend and promote use of the Unicode Standard and related
The membership of the consortium represents a broad spectrum of
corporations and organizations, many in the computer and information
processing industry. Members include: Adobe, Apple, Emoji One,
EmojiXpress, Facebook, Google, Government of Bangladesh, Government of
India, Huawei, IBM, Microsoft, Monotype Imaging, Sultanate of Oman MARA,
Oracle, SAP, Tamil Virtual University, The University of California
(Berkeley), Yahoo!, plus well over a hundred Associate, Liaison, and
Individual members. For a complete member list go to
For more information, please contact the Unicode Consortium