Penguin
Note: You are viewing an old revision of this page. View the current version.

UTF8 is one serialisation of unicode, designed to be used where most people are expecting to be dealing mostly with 7bit ascii text with the occasional unicode charactor. The first 127 codes in UTF8 map exactly onto the 7bit ASCII range, with higher codes being reachable with escapes creating multibyte charactors.