The CONSTANT_Utf8_info structure is used to represent constant string values.
The bytes of multibyte characters are stored in the class file in big-endian
(high byte first) order. There are two differences between this format and the
"standard" UTF-8 format. First, the null byte (byte)0 is encoded using the
two-byte format rather than the one-byte format, so that Java Virtual Machine
UTF-8 strings never have embedded nulls. Second, only the one-byte, two-byte,
and three-byte formats are used. The Java Virtual Machine does not recognize
the longer UTF-8 formats. |