Logo Search packages:      
Sourcecode: icu version File versions  Download package

Constants for Regular Expression Match Modes. ICU 2.4

Enumerator:
UREGEX_CANON_EQ  Forces normalization of pattern and strings. Not implemented yet, just a placeholder, hence draft. ICU 2.4
UREGEX_CASE_INSENSITIVE  Enable case insensitive matching. ICU 2.4
UREGEX_COMMENTS  Allow white space and comments within patterns ICU 2.4
UREGEX_DOTALL  If set, '.' matches line terminators, otherwise '.' matching stops at line end. ICU 2.4
UREGEX_LITERAL  If set, treat the entire pattern as a literal string. Metacharacters or escape sequences in the input sequence will be given no special meaning. Not implemented yet as of ICU 4.4.

The flags CASE_INSENSITIVE and UNICODE_CASE retain their impact on matching when used in conjunction with this flag. The other flags become superfluous. TODO: say which escapes are still handled; anything Java does early (\u) we should still do. ICU 4.0

UREGEX_MULTILINE  Control behavior of "$" and "^" If set, recognize line terminators within string, otherwise, match only at start and end of input string. ICU 2.4
UREGEX_UNIX_LINES  Unix-only line endings. When this mode is enabled, only \u000a is recognized as a line ending in the behavior of ., ^, and $. ICU 4.0
UREGEX_UWORD  Unicode word boundaries. If set, uses the Unicode TR 29 definition of word boundaries. Warning: Unicode word boundaries are quite different from traditional regular expression word boundaries. See http://unicode.org/reports/tr29/#Word_Boundaries ICU 2.8
UREGEX_ERROR_ON_UNKNOWN_ESCAPES  Error on Unrecognized backslash escapes. If set, fail with an error on patterns that contain backslash-escaped ASCII letters without a known specail meaning. If this flag is not set, these escaped letters represent themselves. ICU 4.0

Definition at line 47 of file uregex.h.

                        {

#ifndef U_HIDE_DRAFT_API 
    /** Forces normalization of pattern and strings. 
    Not implemented yet, just a placeholder, hence draft. 
    @draft ICU 2.4 */
    UREGEX_CANON_EQ         = 128,
#endif
    /**  Enable case insensitive matching.  @stable ICU 2.4 */
    UREGEX_CASE_INSENSITIVE = 2,

    /**  Allow white space and comments within patterns  @stable ICU 2.4 */
    UREGEX_COMMENTS         = 4,

    /**  If set, '.' matches line terminators,  otherwise '.' matching stops at line end.
      *  @stable ICU 2.4 */
    UREGEX_DOTALL           = 32,
    
    /**  If set, treat the entire pattern as a literal string.  
      *  Metacharacters or escape sequences in the input sequence will be given 
      *  no special meaning. Not implemented yet as of ICU 4.4.
      *
      *  The flags CASE_INSENSITIVE and UNICODE_CASE retain their impact
      *  on matching when used in conjunction with this flag.
      *  The other flags become superfluous.
      *  TODO:  say which escapes are still handled; anything Java does
      *         early (\\u) we should still do.
      * @stable ICU 4.0
      */
    UREGEX_LITERAL = 16,

    /**   Control behavior of "$" and "^"
      *    If set, recognize line terminators within string,
      *    otherwise, match only at start and end of input string.
      *   @stable ICU 2.4 */
    UREGEX_MULTILINE        = 8,
    
    /**   Unix-only line endings.
      *   When this mode is enabled, only \\u000a is recognized as a line ending
      *    in the behavior of ., ^, and $.
      *   @stable ICU 4.0
      */
    UREGEX_UNIX_LINES = 1,

    /**  Unicode word boundaries.
      *     If set, \b uses the Unicode TR 29 definition of word boundaries.
      *     Warning: Unicode word boundaries are quite different from
      *     traditional regular expression word boundaries.  See
      *     http://unicode.org/reports/tr29/#Word_Boundaries
      *     @stable ICU 2.8
      */
    UREGEX_UWORD            = 256,

     /**  Error on Unrecognized backslash escapes.
       *     If set, fail with an error on patterns that contain
       *     backslash-escaped ASCII letters without a known specail
       *     meaning.  If this flag is not set, these
       *     escaped letters represent themselves.
       *     @stable ICU 4.0
       */
     UREGEX_ERROR_ON_UNKNOWN_ESCAPES = 512

}  URegexpFlag;


Generated by  Doxygen 1.6.0   Back to index