ULI Project

The Unicode Localization Interoperability Technical Committee (ULI) works to ensure interoperable data interchange of critical localization-related assets, including:
  • Translation memory: A translation memory system stores words or phrases that have been tanslated previously. The use of translation memory ensures the consistency of translated content, accelerates the speed of translation, and also reduces the cost of repeated translation requests.
  • Segmentation rules: Segmentation rules define the way to segment text for translation or other text processing. The rules are used in conjunction with translation memory to create memory segments or identify matches within the source content of existing translation memories.
  • Translation source strings and their translations: Translation source is natural language text, typically with markup, that will be translated into another language. The translated strings are the results of translating the source strings while preserving the markup.
  • Word Count: new! - Defining best practices around how to best count words in the context of translation interchange.

Whether a translation request is completed by human or machine, these assets play a vital role in the overall translation process. Interoperable interchange of these assets reduces errors, lowers costs, and improves throughput.

Charter and Scope

Please see ULI Charter page for more details.

Profiles of Use

The primary focus of the ULI Technical Committee will be to establish profiles of use for XLIFF, TMX, and SRX. The committee will develop and publish specifications that document specific usage conventions that can be shared for interoperability. This will improve data exchange through more consistent implementations and enhance the usefulness of these three standards.

Extensions to Established Standards

The secondary focus of the ULI Technical Committee will be to gather requirements for future extensions to XLIFF, TMX, and SRX. The ULI committee will develop reference implementations, as necessary, to demonstrate the feasibility of any proposals for future standardization.

Word Count

One of the challenges of translation interoperability is objectively measuring the difficulty of a particular translation workload. A common metric used is the word count. However, methods for counting words vary across different systems and languages. Some examples: Thai is written without space characters between words, as is Japanese and Chinese. Should numbers be included or not included? Are Mongolian suffixes considered a separate word or not?

The ULI-TC is hosting the development of a future Unicode technical note, you may follow and contribute to the discussion on this Github page.

Publicly Available Specifications

These documents are archived for historical purposes and do not specify a Unicode standard. These documents are already publicly available online elsewhere, are are only hosted on the Unicode ULI site as a convenience.

Participation

For information on how to join the ULI and get involved in its work, contact the Unicode Consortium with the contact form and ask about the ULI.

To become a voting participant in the work of the ULI committee, join Unicode in one of the three voting categories of membership: Full, Institutional, or Supporting. Learn about the benefits of joining.

The officers of the ULI will establish the meeting schedule. Meetings are to be conducted by conference call to enable broad participation by members of the industry.

Email Discussions

Outside of formal meetings, much of the technical work of the Unicode Localization Interoperability Technical Committee is conducted in email discussions held on the distribution list of ULI members (uli). Informal discussions of technical issues are also held on public Unicode email distribution lists.

Officers

The current Technical Committee Officers are:

  • Chair: Steven R Loomis (IBM)
  • Vice Chair (Interim): Yoshito Umaoka (IBM)
 Hot News!
  • February 2017 ULI-TC meeting After a great January kickoff, our second meeting of 2017 will commence next Monday. Time in your local time zone: https://time.is/1000_20_Feb_2017_in_PT?ULI ...
    Posted Feb 14, 2017, 5:58 PM by Steven R. Loomis
  • Word Count effort begun One of the challenges of translation interoperability is objectively measuring the difficulty of a particular translation workload. A common metric used is the word count. However, methods for counting words ...
    Posted Feb 14, 2017, 5:47 PM by Steven R. Loomis
  • Publicly Available Specifications The ULI project now hosts documents which are archived for historical purposes and do not specify a Unicode standard. They are located at http://www.unicode.org/uli/pas/
    Posted Oct 27, 2015, 11:33 PM by Steven R. Loomis
  • ULI Segment Exceptions Posted in SVN and Demo Updated The latest ULI segmentation exception has been posted in SVN: http://unicode.org/uli/trac/browser/trunk/abbrs including: Reference to CLDR date/month and other necessary symbolsAvailable in ...
    Posted Jan 18, 2013, 2:26 PM by Steven R. Loomis
  • New Mailing List: uli-users A public mailing list has been created for discussion of unicode, localization, and interoperability as on Unicode.org. This mailing list is intended for broad-based conversations on the topics ...
    Posted Mar 27, 2012, 7:54 AM by Kevin Lenzo
Showing posts 1 - 5 of 9. View more »