Hex Editor - Binary File Editing Software for Windows
Docs

Encodings

There are a lot of different character encodings that describe how characters of some specific alphabet are encoded in single or multi-byte codes. The editor window text pane is used to display the textual representation of the document. The text it displays may of course be interpreted according to one or another encoding.

Hex Editor Neo allows you to choose the editor window's encoding from a wide set of supported encodings. As long as an encoding is a property of individual editor window, you can set different encoding for each editor window, even if they represent the same document.

Below is a full list of supported encodings.

NOTE: Support for a specific encoding depends on installed Windows code pages and fonts. If required components cannot be found for a selected encoding, the “Encoding not supported” text is displayed instead of document's data. Typing in text pane is also disabled until you select another, supported encoding.

EncodingEncodingEncoding
Default ANSIDefault OEMUTF-8
ANSI - ArabicANSI - BalticANSI - Central European
ANSI - CyrillicANSI - GreekANSI - Hebrew
ANSI - Latin IANSI - TurkishArabic - ASMO 449+, BCON V4
Arabic - ASMO 708Arabic - Transparent ArabicArabic - Transparent ASMO
ISO 2022 Japanese JIS X 0201-1989ISO 2022 Japanese with halfwidth KatakanaISO 2022 Japanese with no halfwidth Katakana
ISO 2022 KoreanISO 2022 Simplified ChineseISO 2022 Traditional Chinese
ISO 6937 Non-Spacing AccentISO 8859-1 Latin IISO 8859-15 Latin 9
ISO 8859-2 Central EuropeISO 8859-3 Latin 3ISO 8859-4 Baltic
ISO 8859-5 CyrillicISO 8859-6 ArabicISO 8859-7 Greek
ISO 8859-8 HebrewISO 8859-8 HebrewISO 8859-9 Latin 5
IBM EBCDIC - ArabicIBM EBCDIC - Cyrillic (Russian)IBM EBCDIC - Cyrillic (Serbian, Bulgarian)
IBM EBCDIC - Denmark/NorwayIBM EBCDIC - Denmark/Norway (20277 + Euro symbol)IBM EBCDIC - Finland/Sweden
IBM EBCDIC - Finland/Sweden (20278 + Euro symbol)IBM EBCDIC - FranceIBM EBCDIC - France (20297 + Euro symbol)
IBM EBCDIC - GermanyIBM EBCDIC - Germany (20273 + Euro symbol)IBM EBCDIC - Greek
IBM EBCDIC - HebrewIBM EBCDIC - IcelandicIBM EBCDIC - Icelandic (20871 + Euro symbol)
IBM EBCDIC - InternationalIBM EBCDIC - International (500 + Euro symbol)IBM EBCDIC - Italy
IBM EBCDIC - Italy (20280 + Euro symbol)IBM EBCDIC - Japanese Katakana ExtendedIBM EBCDIC - Korean Extended
IBM EBCDIC - Latin 1/Open SystemIBM EBCDIC - Latin America/SpainIBM EBCDIC - Latin America/Spain (20284 + Euro symbol)
IBM EBCDIC - Latin-1/Open System (1047 + Euro symbol)IBM EBCDIC - Modern GreekIBM EBCDIC - Multilingual/ROECE (Latin-2)
IBM EBCDIC - ThaiIBM EBCDIC - TurkishIBM EBCDIC - Turkish (Latin-5)
IBM EBCDIC - U.S./CanadaIBM EBCDIC - U.S./Canada (037 + Euro symbol)IBM EBCDIC - United Kingdom
IBM EBCDIC - United Kingdom (20285 + Euro symbol)ISCII AssameseISCII Bengali
ISCII DevanagariISCII GujaratiISCII Kannada
ISCII MalayalamISCII OriyaISCII Punjabi
ISCII TamilISCII TeluguMAC - Arabic
MAC - CroatiaMAC - CyrillicMAC - Greek I
MAC - HebrewMAC - IcelandicMAC - Japanese
MAC - KoreanMAC - Latin IIMAC - Roman
MAC - RomaniaMAC - Simplified Chinese (GB 2312)MAC - Thai
MAC - Traditional Chinese (Big5)MAC - TurkishMAC - Ukraine
OEM - ArabicOEM - BalticOEM - Canadian-French
OEM - Cyrillic (primarily Russian)OEM - Greek (formerly 437G)OEM - Hebrew
OEM - IcelandicOEM - Latin IIOEM - Modern Greek
OEM - Multilingual Latin IOEM - Multilingual Latin I + Euro symbolOEM - Nordic
OEM - PortugueseOEM - RussianOEM - Turkish
OEM - United StatesJapanese (Katakana) ExtendedJapanese (Latin) Extended and Japanese
JIS X 0208-1990 & 0121-1990Korean (Johab)Korean Extended and Korean
Simplified ChineseSimplified Chinese (GB2312)Simplified Chinese Extended and Simplified Chinese
Russian - KOI8-RT.61TCA - Taiwan
TeleText - TaiwanUkrainian (KOI8-U)US/Canada and Japanese
US/Canada and Traditional ChineseUS-ASCII (7-bit)Wang - Taiwan
CNS - TaiwanEten - TaiwanEUC - Japanese
EUC - KoreanEUC - Simplified ChineseEUC - Traditional Chinese
Europa 3HZ-GB2312 Simplified ChineseIA5 German (7-bit)
IA5 IRV International Alphabet No. 5 (7-bit)IA5 Norwegian (7-bit)IA5 Swedish (7-bit)
IBM5550 - Taiwan

Working with Encodings

The current editor window's encoding is displayed on the status bar:

Encodings

Text pane displays text data according to selected encoding. When you type new data on the keyboard (with text pane active), typed characters are processed according to selected encoding.

When the editor window is displaying data in Hex Words or Decimal Words view type, the UNICODE (UTF-16) encoding is automatically selected (as the text pane displays UNICODE data in these modes).

To change the current window's encoding, open the shortcut menu, select “Encoding” item and choose an encoding from the list. The list consists of "Default ANSI", “Default OEM”, 5 recently used encodings and the "Other" item. Selecting the Other item opens a full list of supported encodings.

UTF-8 Support

UTF-8 is the first (any only, for now) multi-byte encoding supported by the editor.

The editor provides the full support for UTF-8 encoding. It not only displays the text encoded in UTF-8, but also allows you to type new data in this encoding. When you type, entered characters are converted on-the-fly and a single entered character may occupy up to 4 bytes.

When a character occupies several bytes, a space character is displayed in all but the last cell (in Text View). The last cell displays the character itself.

UTF-8 encoding defines strict rules for encoding UNICODE characters into single, two, three or four bytes. If these rules are broken and Hex Editor Neo cannot decode the character, it displays the ‘?’ character for each cell that contains invalid data.

All editor features, such as Find, Fill and so on are compatible with current editor window's encoding, and, therefore, are capable of working with UTF-8 as well.