Why UTF-16 have a reserved range in UCS Database?
UTF-16 is just a way to represent character scalar value using one or two unsigned 16-bits
, the layout of these values shouldn't be related to character scalar value because we should apply some algorithm to get the actual character scalar value from such representation.
Let's assume that the reserved range D800-DBFF
and DC00-DFFF
are not reserved in UCS Database, and there is another representation of UTF-16 that can represent all characters in range 0-7FFF
in single unsigned 16-bits
and when the high order bit is set then another
16-bit is followed with the remaining bits, and for the byte order mark we will reserve the two possible values and that's it.
If I'm wrong then could you explain it to me.
Thanks