Logo Search packages:      
Sourcecode: icu version File versions  Download package

#define U_CHARSET_FAMILY   0

These definitions allow to specify the encoding of text in the char data type as defined by the platform and the compiler. It is enough to determine the code point values of "invariant characters", which are the ones shared by all encodings that are in use on a given platform.

Those "invariant characters" should be all the uppercase and lowercase latin letters, the digits, the space, and "basic punctuation". Also, '\n', '\r', '\t' should be available.

The list of "invariant characters" is:

    A-Z  a-z  0-9  SPACE  "  %  &amp;  '  (  )  *  +  ,  -  .  /  :  ;  <  =  >  ?  _

(52 letters + 10 numbers + 20 punc/sym/space = 82 total)

This matches the IBM Syntactic Character Set (CS 640).

In other words, all the graphic characters in 7-bit ASCII should be safely accessible except the following:

    '\' <backslash>
    '[' <left bracket>
    ']' <right bracket>
    '{' <left brace>
    '}' <right brace>
    '^' <circumflex>
    '~' <tilde>
    '!' <exclamation mark>
    '#' <number sign>
    '|' <vertical line>
    '$' <dollar sign>
    '@' <commercial at>
    '`' <grave accent>
ICU 2.0

Definition at line 161 of file utypes.h.

Referenced by ucnvsel_openFromSerialized(), and ucol_cloneBinary().


Generated by  Doxygen 1.6.0   Back to index