|Top Previous Next|
Glossary > Unicode
Unicode is a coding standard that was developed by the Unicode Consortium. Unicode can represent almost all written languages of the world.
However, all the individual characters cannot be represented by a single byte any more as it is the case for the ASCII/ANSI set. There are different methods how the characters can be represented by means of several bytes.
If an UTF-8 coded file is opened in ANSI mode the German word "für" appears as:
The German Umlaut 'ü' needs two bytes in the UTF-8 encoding, which are shown as two characters in ANSI mode. But if the file is opened in UTF-8 mode the word is shown correctly.
This page belongs to the TextTransformer Documentation
|Home Content German|