For a short introduction to the ISO 8859 character sets, see the section on ISO 8859 in my character code tutorial and Roman Czyborra's famous ISO 8859 Alphabet Soup.
I have composed the following more detailed documents:
standard | name of alphabet | characterization |
---|---|---|
ISO 8859-1 | Latin alphabet No. 1 |
"Western", "West European" |
ISO 8859-2 | Latin alphabet No. 2 | "Central European", "East European" |
ISO 8859-3 | Latin alphabet No. 3 | "South European"; "Maltese & Esperanto" |
ISO 8859-4 | Latin alphabet No. 4 | "North European" |
ISO 8859-9 | Latin alphabet No. 5 | "Turkish" |
ISO 8859-10 | Latin alphabet No. 6 | "Nordic" (Sámi, Inuit, Icelandic) |
ISO 8859-13 | Latin alphabet No. 7 | Baltic Rim |
ISO 8859-14 | Latin alphabet No. 8 | Celtic |
ISO 8859-15 | Latin alphabet No. 9 | "euro" |
ISO 8859-16 | Latin alphabet No. 10 | Romanian and some other languages |
Notice that only the following characters in the "upper halves" are invariant in the ISO Latin alphabets, in the sense of occurring in all of them in the same code position: no-break space, soft hyphen, and the characters § Ä ä Ö ö Ü ü É é ß.
There is an excellent online character database by Indrek Hein at the Institute of the Estonian Language. You can e.g. search for Unicode characters by name or code position, get lists of differences or conversion between some character sets (such as the ISO 8859 family and many others), and get lists of characters needed for different languages.
Dates of last update: 2005-12-02, 2018-10-16.
Jukka Korpela.