|Non-Confidential||PDF version||ARM 100073_0607_00_en|
|Home > The ARM C and C++ Libraries > Assembler macros that tailor locale functions in the C library > Link time selection of the locale subsystem in the C library > Shift-JIS and UTF-8 implementation|
The Shift-JIS and UTF-8 locales let you use Japanese and Unicode characters.
The following table shows the Shift-JIS (Japanese characters) or UTF-8 (Unicode characters) locale categories.
Table 1-8 Default Shift-JIS and UTF-8 locales
||Sets the character set to the Shift-JIS multibyte encoding of Japanese characters|
||Sets the character set to the UTF-8 multibyte encoding of all Unicode characters|
The following list describes the effects of Shift-JIS and UTF-8 encoding:
behave correctly on any byte value that is a self-contained character
in Shift-JIS. For example, half-width katakana characters that Shift-JIS
encodes as single bytes between
treated as alphabetic by
UTF-8 encoding uses the same set of self-contained characters as the ASCII character set.
The multibyte conversion functions such as
wcrtomb(), all convert between wide strings
in Unicode and multibyte character strings in Shift-JIS or UTF-8.
printf("%ls") converts a Unicode
wide string into Shift-JIS or UTF-8 output, and
Shift-JIS or UTF-8 input into a Unicode wide string.