About the Character Data Type

The Character class provides several methods for determining a character's category (lowercase letter, digit, and so on) and for converting a character from uppercase to lowercase and vice versa.

The char data type represents 16-bit Unicode characters. These are a superset of the ASCII character set which allow non-English language characters. Any Unicode character can be written as a literal using the Escape character (backslash \) and the “u” character followed by its hexadecimal representation. For example, \u0065 represents the letter e.

The methods and data of class Character are defined by the information in the UnicodeData file standard that is part of the Unicode Character Database maintained by the Unicode Consortium. This file and its description are available from the Unicode Consortium at:

http://www.unicode.org

For examples of character code, see Example Character Code and Character tab Syntax Buttons.