Character frequency analyzer

Analyze the presence and frequencies of non-Ascii characters in a file. The input data must be UTF-16LE encoded. The results will be returned as a table in an HTML document.

If you have a file (plain text or HTML), you can convert it to UTF-16LE encoding by opening it in an editor and saving it as UTF-16LE encoded. In Windows, some programs (such as Notepad) know UTF-16LE under the name “Unicode” (as opposite to “Unicode, big endian”).