Language:
    • Available Formats
    • Options
    • Availability
    • Priced From ( in USD )
    • Self Extracting File Format
    • 👥
    • Immediate download
    • $366.00
    • Add to Cart
    • Printed Edition
    • Ships in 1-2 business days
    • $421.00
    • Add to Cart

Customers Who Bought This Also Bought

 

About This Item

 

Full Description

ISO/IEC 10646:2003 specifies the Universal Multiple-Octet Coded Character Set (UCS). It is applicable to the representation, transmission, interchange, processing, storage, input and presentation of the written form of the languages of the world as well as additional symbols.

ISO/IEC 10646:2003

  • specifies the architecture of ISO/IEC 10646:2003;
  • defines terms used in ISO/IEC 10646:2003;
  • describes the general structure of the coded character set;
  • specifies the Basic Multilingual Plane (BMP) of the UCS;
  • specifies supplementary planes of the UCS: the Supplementary Multilingual Plane (SMP), the Supplementary Ideographic Plane (SIP) and the Supplementary Special-purpose Plane (SSP);
  • defines a set of graphic characters used in scripts and the written form of languages on a world-wide scale;
  • specifies the names for the graphic characters of the BMP, SMP, SIP, SSP and their coded representations;
  • specifies the four-octet (32-bit) canonical form of the UCS: UCS-4;
  • specifies a two-octet (16-bit) BMP form of the UCS: UCS-2;
  • specifies a multiple byte (one to four) byte transformation UTF-8 for use with ISO 646 (ASCII) byte-oriented environments;
  • specifies a two 16-bit form and associated transformation UTF-16 for supplementary characters;
  • specifies collection identifiers for selected set of character subsets;
  • specifies the coded representations for control functions;
  • specifies the management of future additions to this coded character set;
  • incorporates the Unicode bi-directional algorithm and normalization forms by reference.

The UCS is a coding system different from that specified in ISO/IEC 2022. A graphic character will be assigned only one code position in ISO/IEC 10646:2003, located either in the BMP or in one of the supplementary planes.

NOTE - The Unicode Standard, Version 4.0 includes a set of characters, names, and coded representations that are identical with those in ISO/IEC 10646:2003. It additionally provides details of character properties, processing algorithms, and definitions that are useful to implementers. Version 4.0 strengthens Unicode support for worldwide communication, software availability, and publishing.

By defining a consistent way of encoding multilingual text ISO/IEC 10646:2003 enables the exchange of data internationally. The information technology industry gains data stability, greater global interoperability and data interchange. ISO/IEC 10646:2003 has been widely adopted in new Internet and W3C protocols and mark up languages such as XML and HTML, and implemented in modern operating systems and computer programming languages. This edition covers over 96 000 characters from the world's scripts.

 

Document History

  1. ISO/IEC 10646:2020


    Information technology - Universal coded character set (UCS)

    • Most Recent
  2. ISO/IEC 10646:2017


    Information technology - Universal Coded Character Set (UCS)

    • Historical Version
  3. ISO/IEC 10646:2014


    Information technology - Universal Coded Character Set (UCS)

    • Historical Version
  4. ISO/IEC 10646:2012


    Information technology - Universal Coded Character Set (UCS)

    • Historical Version
  5. ISO/IEC 10646:2011


    Information technology - Universal Coded Character Set (UCS)

    • Historical Version
  6. ISO/IEC 10646:2003

    👀 currently
    viewing


    Information technology - Universal Multiple-Octet Coded Character Set (UCS)

    • Historical Version
 

Amendments, rulings, supplements, and errata

  1. ISO/IEC 10646/Amd7:2010


    Mandaic, Batak, Brahmi, and other characters

  2. ISO/IEC 10646/Amd6:2009


    Amendment 6: Bamum, Javanese, Lisu, Meetei Mayek, Samaritan, and other characters

  3. ISO/IEC 10646/Amd5:2008


    Amendment 5: Tai Tham, Tai Viet, Avestan, Egyptian Hieroglyphs, CJK Unified Ideographs Extension C, and other characters

  4. ISO/IEC 10646/Amd4:2008


    Amendment 4: Cham, Game Tiles, and other characters

  5. ISO/IEC 10646/Amd3:2008


    Amendment 3: Lepcha, Ol Chiki, Saurashtra, Vai and other characters

  6. ISO/IEC 10646/Amd2:2006


    Amendment 2: NKo, Phags-pa, Phoenician and other characters

  7. ISO/IEC 10646/Amd1:2005


    Amendment 1: Glagolitic, Coptic, Georgian and other characters