Logo Search packages:      
Sourcecode: icu version File versions  Download package

SelectFormat Class Reference

#include <selfmt.h>

Inheritance diagram for SelectFormat:

Format UObject UMemory

List of all members.


Detailed Description

SelectFormat supports the creation of internationalized messages by selecting phrases based on keywords. The pattern specifies how to map keywords to phrases and provides a default phrase. The object provided to the format method is a string that's matched against the keywords. If there is a match, the corresponding phrase is selected; otherwise, the default phrase is used.

Using SelectFormat for Gender Agreement

The main use case for the select format is gender based inflection. When names or nouns are inserted into sentences, their gender can affect pronouns, verb forms, articles, and adjectives. Special care needs to be taken for the case where the gender cannot be determined. The impact varies between languages:

Some other languages have noun classes that are not related to gender, but similar in grammatical use. Some African languages have around 20 noun classes.

To enable localizers to create sentence patterns that take their language's gender dependencies into consideration, software has to provide information about the gender associated with a noun or name to MessageFormat. Two main cases can be distinguished:

The resulting keyword is provided to MessageFormat as a parameter separate from the name or noun it's associated with. For example, to generate a message such as "Jean went to Paris", three separate arguments would be provided: The name of the person as argument 0, the gender of the person as argument 1, and the name of the city as argument 2. The sentence pattern for English, where the gender of the person has no impact on this simple sentence, would not refer to argument 1 at all:

{0} went to {2}.

The sentence pattern for French, where the gender of the person affects the form of the participle, uses a select format based on argument 1:

{0} est {1, select, female {allée} other {allé}} à {2}.

Patterns can be nested, so that it's possible to handle interactions of number and gender where necessary. For example, if the above sentence should allow for the names of several people to be inserted, the following sentence pattern can be used (with argument 0 the list of people's names, argument 1 the number of people, argument 2 their combined gender, and argument 3 the city name):

{0} {1, plural,
                 one {est {2, select, female {allée} other  {allé}}}
                 other {sont {2, select, female {allées} other {allés}}}
          }à {3}.

Patterns and Their Interpretation

The SelectFormat pattern text defines the phrase output for each user-defined keyword. The pattern is a sequence of keyword{phrase} clauses. Each clause assigns the phrase phrase to the user-defined keyword.

Keywords must match the pattern [a-zA-Z][a-zA-Z0-9_-]*; keywords that don't match this pattern result in the error code U_ILLEGAL_CHARACTER. You always have to define a phrase for the default keyword other; this phrase is returned when the keyword provided to the format method matches no other keyword. If a pattern does not provide a phrase for other, the method it's provided to returns the error U_DEFAULT_KEYWORD_MISSING. If a pattern provides more than one phrase for the same keyword, the error U_DUPLICATE_KEYWORD is returned.
Spaces between keyword and {phrase} will be ignored; spaces within {phrase} will be preserved.

The phrase for a particular select case may contain other message format patterns. SelectFormat preserves these so that you can use the strings produced by SelectFormat with other formatters. If you are using SelectFormat inside a MessageFormat pattern, MessageFormat will automatically evaluate the resulting format pattern. Thus, curly braces ({, }) are only allowed in phrases to define a nested format pattern.

Example: UErrorCode status = U_ZERO_ERROR; MessageFormat *msgFmt = new MessageFormat(UnicodeString("{0} est {1, select, female {allée} other {allé}} à Paris."), Locale("fr"), status); if (U_FAILURE(status)) { return; } FieldPosition ignore(FieldPosition::DONT_CARE); UnicodeString result; char* str1= "Kirti,female"; Formattable args1[] = {"Kirti","female"}; msgFmt->format(args1, 2, result, ignore, status); cout << "Input is " << str1 << " and result is: " << result << endl; delete msgFmt;

Produces the output:
Kirti est allée à Paris.

ICU 4.4

Definition at line 184 of file selfmt.h.


Public Member Functions

void applyPattern (const UnicodeString &pattern, UErrorCode &status)
virtual Formatclone (void) const
virtual UnicodeStringformat (const Formattable &obj, UnicodeString &appendTo, FieldPositionIterator *posIter, UErrorCode &status) const
UnicodeStringformat (const Formattable &obj, UnicodeString &appendTo, UErrorCode &status) const
UnicodeStringformat (const Formattable &obj, UnicodeString &appendTo, FieldPosition &pos, UErrorCode &status) const
UnicodeStringformat (const UnicodeString &keyword, UnicodeString &appendTo, FieldPosition &pos, UErrorCode &status) const
virtual UClassID getDynamicClassID () const
Locale getLocale (ULocDataLocaleType type, UErrorCode &status) const
const char * getLocaleID (ULocDataLocaleType type, UErrorCode &status) const
virtual UBool operator!= (const Format &other) const
SelectFormatoperator= (const SelectFormat &other)
virtual UBool operator== (const Format &other) const
void parseObject (const UnicodeString &source, Formattable &result, UErrorCode &status) const
virtual void parseObject (const UnicodeString &source, Formattable &result, ParsePosition &parse_pos) const
 SelectFormat (const SelectFormat &other)
 SelectFormat (const UnicodeString &pattern, UErrorCode &status)
UnicodeStringtoPattern (UnicodeString &appendTo)
virtual ~SelectFormat ()

Static Public Member Functions

static UClassID U_EXPORT2 getStaticClassID (void)
static void U_EXPORT2 operator delete (void *, void *) U_NO_THROW
static void U_EXPORT2 operator delete (void *p) U_NO_THROW
static void U_EXPORT2 operator delete[] (void *p) U_NO_THROW
static void *U_EXPORT2 operator new (size_t, void *ptr) U_NO_THROW
static void *U_EXPORT2 operator new (size_t size) U_NO_THROW
static void *U_EXPORT2 operator new[] (size_t size) U_NO_THROW

Protected Member Functions

void setLocaleIDs (const char *valid, const char *actual)

Static Protected Member Functions

static void syntaxError (const UnicodeString &pattern, int32_t pos, UParseError &parseError)

Private Types

typedef enum
SelectFormat::classesForSelectFormat 
CharacterClass
enum  classesForSelectFormat {
  tStartKeyword, tContinueKeyword, tLeftBrace, tRightBrace,
  tSpace, tOther
}

Private Member Functions

UBool checkSufficientDefinition ()
UBool checkValidKeyword (const UnicodeString &argKeyword) const
CharacterClass classifyCharacter (UChar ch) const
void copyHashtable (Hashtable *other, UErrorCode &status)
void init (UErrorCode &status)
void parsingFailure ()

Private Attributes

HashtableparsedValuesHash
UnicodeString pattern

The documentation for this class was generated from the following files:

Generated by  Doxygen 1.6.0   Back to index