This section lists the legacy encodings supported by Rosette. The encodings are listed by language and include alternative names that Rosette recognizes as equal in code points to the encoding.
Supported platforms for Rosette Core Library for Unicode include Windows, Linux, Solaris, AIX, HPUX, and MacOS.
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10004 | Macintosh Arabic | Microsoft & IBM | CP10004 |
| CP1256 | Microsoft & IBM | CP1256 | |
| CP20420 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20420 |
| CP28596 | Arabic Alphabet (ISO) | Microsoft & IBM | CP28596 |
| CP708 | ASMO708 | Microsoft & IBM | CP708 |
| CP720 | Transparent ASMO | Microsoft & IBM | CP720 |
| CP864 | Microsoft & IBM | CP864 | |
| ISO 8859-6 | ISOLatinArabic | International or National Standard | ISO_8859-6, Arabic, iso-ir-127, ECMA-114, ASMO-708 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP28594 | Baltic Alphabet (ISO) | Microsoft & IBM | CP28594 |
| CP775 | Microsoft & IBM | CP775 | |
| ISO 8859-4 | Latin4 | International or National Standard | ISO-8859-4, Latin4, iso-ir-110 |
| ISO 8859-13 | Latin7 | International or National Standard | ISO-8859-13, Latin7 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| ISO 8859-14 | Latin8 | International or National Standard | ISO-8859-14, Latin8, iso-ir-199 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| ChineseAutoDetect | For encodings, see ChineseAutodetect | Rosette Autodetect | ChineseAutoDetect |
| HKSCS | International or National Standard | HKSCS | |
| ISO 2022-CN | International or National Standard | ISO-2022-CN | |
| GB 18030 | International or National Standard | GB18030 | |
| Chinese, Simplified | |||
| CCSID 935 | IBM | CCSID-935, CCSID935 | |
| EUC-CN | GB2312, EUC-SC | Unix | GB2312 |
| GB2312 | EUC-CN, EUC-SC | International or National Standard | GB2312 |
| HZ-GB-2312 | HZ-GB-2312 | International or National Standard | HZ, HZ-GB-2312 |
| CP936 | GBK | Microsoft & IBM | CP936, GBK |
| MacChineseSimplified | Macintosh | MacChineseSimplified | |
| Chinese, Traditional | |||
| CCSID 937 | IBM | CCSID-937, CCSID937 | |
| CNS-11643-1986 | EUC-TW | International or National Standard | CNS-11643-1986 |
| CNS-11643-1992 | EUC-TW | International or National Standard | CNS-11643, CNS-11643-1992 |
| EUC-TW | CNS-11643-1986, CNS-11643-1992 | Unix | CNS-11643, CNS-11643-1992 |
| GB12345 | International or National Standard | GB12345 | |
| Big5 | International or National Standard | Big5 | |
| Big5+ | International or National Standard | Big5+, Big5Plus | |
| CP10002 | Macintosh Traditional Chinese | Microsoft & IBM | CP10002 |
| CP950 | Microsoft & IBM | CP950 | |
| MacChineseTraditional | Macintosh | MacChineseTraditional |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| MacCroatian | Macintosh | MacCroatian |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10007 | Macintosh Cyrillic | Microsoft & IBM | CP10007 |
| CP1251 | MS Windows Cyrillic (Slavic) | Microsoft & IBM | CP1251 |
| CP20866 | Cyrillic Alphabet, KOI8-R | Microsoft & IBM | CP20866 |
| CP20880 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20880 |
| CP21025 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP21025 |
| CP21866 | Ukrainian KOI8-RU | Microsoft & IBM | CP21866 |
| CP28595 | Cyrillic Alphabet (ISO) | Microsoft & IBM | CP28595 |
| CP855 | IBM Cyrillic | Microsoft & IBM | CP855 |
| CP866 | MS DOS Russian | Microsoft & IBM | CP866 |
| ISO 8859-5 | ISOLatinCyrillic | International or National Standard | ISOLatinCyrillic |
| MacCyrillic | Macintosh | MacCyrillic |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| MacDevanagari | Macintosh | MacDevanagari | |
| ISCII-Devanagari | Indian Standards | x-iscii-de, windows-57002 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10006 | Macintosh Greek 1 | Microsoft & IBM | CP10006 |
| CP1253 | Microsoft & IBM | CP1253 | |
| CP20423 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20423 |
| CP28597 | Greek Alphabet (ISO) | Microsoft & IBM | CP28597 |
| CP737 | Microsoft & IBM | CP737 | |
| CP869 | IBM Modern Greek | Microsoft & IBM | CP869 |
| ISO 8859-7 | ISOLatinGreek | International or National Standard | ISO-8859-7, Greek |
| MacGreek | Macintosh | MacGreek |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| MacGujarati | Macintosh | MacGujarati | |
| ISCII-Gujarati | Indian Standards | x-iscii-gu, windows-57010 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10010 | Macintosh Gurmukhi | Microsoft & IBM | CP10010 |
| MacGurmukhi | Macintosh | MacGurmukhi |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10005 | Macintosh Hebrew | Microsoft & IBM | CP10005 |
| CP1255 | Microsoft & IBM | CP1255 | |
| CP28598 | Hebrew Alphabet (ISO) | Microsoft & IBM | CP28598 |
| CP38598 | ASCII + Hebrew and private use characters | Microsoft & IBM | CP38598 |
| CP862 | Microsoft & IBM | CP862 | |
| ISO 8859-8 | ISOLatinHebrew | International or National Standard | Hebrew |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10079 | Macintosh Icelandic | Microsoft & IBM | CP10079 |
| CP861 | MS DOS Icelandic | Microsoft & IBM | CP861 |
| MacIcelandic | Macintosh | MacIcelandic |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CCSID 1027 | EBCDIK | Microsoft & IBM | CCSID-1027, CCSID1027 |
| CCSID 290 | EBCDIK | Microsoft & IBM | CCSID-290, CCSID290 |
| CCSID 930 | IBM | CCSID-930, CCSID930 | |
| CCSID 939 | IBM | CCSID-939, CCSID939 | |
| CCSID 942 | Microsoft & IBM | CCSID-942, CCSID942 | |
| CP10001 | Macintosh Japanese | Microsoft & IBM | CP10001 |
| CP20290 | (full/half width Latin & halfwidth katakana) | Microsoft & IBM | CP20290 |
| CP21027 | (halfwidth Latin, halfwidth katakana & private use) | Microsoft & IBM | CP21027 |
| EUC-JP | Unix | EUC-JP, EUC-J | |
| EUC-JP-JISROMAN | Unix | EUC-JP-JISROMAN | |
| ISO 2022-JP | International or National Standard | ISO-2022-JP | |
| JapaneseAutoDetect | For encodings, see JapaneseAutodetect | Rosette Autodetect | JapaneseAutoDetect |
| JIS_X_0201 | HalfWidthKatakana | International or National Standard | JIS_X_0201, IBM897 |
| JIS_X_0208 | International or National Standard | JIS_X_0208 | |
| MacJapanese | Macintosh | MacJapanese | |
| Shift-JISMS | MS_Kanji, CP932 | Microsoft & IBM | Shift-JIS, SJIS |
| Shift_JIS-2004 | ShiftJISX0213 | Microsoft & IBM | Shift_JISX0213, Shift-X |
| Shift-JIS78 | Shift-JIS without MS/IBM extensions | Unix/Macintosh | Shift-JIS78, SJIS78 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10003 | Macintosh Korean | Microsoft & IBM | CP10003 |
| CP1361 | Korean Johab (based on KSC 5861-1992) | Microsoft & IBM | CP1361 |
| CP949 | Microsoft & IBM | CP949 | |
| EUC-KR | KS_C_5861-1992 | Unix | EUC-KR, EUC-K |
| ISO 2022-KR | KS_C_5601-1987 | International or National Standard | ISO-2022-KR |
| Johab | International or National Standard | Johab | |
| KoreanAutoDetect | See KoreanAutodetect | Rosette Autodetect | KoreanAutoDetect |
| KoreanAutoDetect | See KoreanAutodetect | Rosette Autodetect | KoreanAutoDetect |
| KS_C_5601-1987 | ISO-2022-KR | International or National Standard | ISO-2022-KR |
| KS_C_5861-1992 | EUC-KR | International or National Standard | KS_C_5861-1992 |
| MacKorean | Macintosh | MacKorean |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10000 | Macintosh Roman | Microsoft & IBM | CP10000 |
| CP10029 | Macintosh Latin2 | Microsoft & IBM | CP10029 |
| CP10082 | (with mathematical symbols) | Microsoft & IBM | CP10082 |
| CCSID 1047 | EBCDIC (for IBM Open Systems platform) | Microsoft & IBM | CCSID1047 |
| CP20261 | (with private use characters) | Microsoft & IBM | CP20261 |
| CP20269 | Microsoft & IBM | CP20269 | |
| CP20273 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20273 |
| CP20277 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20277 |
| CP20278 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20278 |
| CP20280 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20280 |
| CP20284 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20284 |
| CP20285 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20285 |
| CP20297 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20297 |
| CP20833 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20833 |
| CP20871 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20871 |
| CP28591 | ASCII + Latin accented vowels | Microsoft & IBM | CP28591 |
| CP28593 | Latin 3 Alphabet (ISO) | Microsoft & IBM | CP28593 |
| CP850 | MS DOS Multilingual, MS-DOS Latin1 | Microsoft & IBM | CP850 |
| CP870 | (with fullwidth punctuation) | Microsoft & IBM | CP870 |
| ISO 8859-1 | Latin1 | International or National Standard | ISO-8859-1, Latin1, IBM819, iso-ir-100 |
| ISO 8859-15 | Latin1 + Euro symbol & accented characters | International or National Standard | ISO-8859-15, Latin9 |
| ISO 8859-2 | ISO_8859-2, Latin2, iso-ir-101 | International or National Standard | Latin2, ISO-8859-2 |
| MacRoman | Macintosh | MacRoman | |
| NextStep | Apple/Next | NextStep | |
| Adobe-Standard-Encoding | (used in PS printers) | Other Corporate | Adobe-Standard-Encoding |
| Adobe-Standard-Encoding | (used in PS printers) | Other Corporate | Adobe-Standard-Encoding |
| Latin, Canadian French | |||
| CP863 | MS DOS Canadian French | Microsoft & IBM | CP863 |
| Latin, Central European | |||
| CP28592 | Central European Alphabet (ISO) | Microsoft & IBM | CP28592 |
| MacCentralEuropean | Macintosh | MacCentralEuropean | |
| Latin, Eastern European | |||
| CP1250 | Microsoft & IBM | CP1250 | |
| Latin, Esperanto | |||
| CP20905 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20905 |
| Latin, Portugese | |||
| CP860 | MS DOS Portugese | Microsoft & IBM | CP860 |
| Latin, Southeast European | |||
| ISO 8859-3 | Latin3 | International or National Standard | Latin3, ISO-8859-3 |
| Latin, US English | |||
| ASCII | US-ASCII, CP367 | International or National Standard | ASCII |
| CP037 | EBCDIC | Microsoft & IBM | CP037 |
| CP1026 | EBCDIC | Microsoft & IBM | CP1026 |
| CP1252 | MS Windows Latin1 (ANSI) | Microsoft & IBM | CP1252 |
| CP20105 | US ASCII | Microsoft & IBM | CP20105 |
| CP437 | MS-DOS Latin US | Microsoft & IBM | CP437 |
| CP500 | EBCDIC | Microsoft & IBM | CP500 |
| CP875 | EBCDIC | Microsoft & IBM | CP875 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10017 | Macintosh Malayalam | Microsoft & IBM | CP10017 |
| ISCII-Malayalam | Indian Standards | x-iscii-ma, windows-57009 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP865 | MS DOS Nordic | Microsoft & IBM | CP865 |
| ISO 8859-10 | Latin6 | International or National Standard | Latin6, ISO-8859-10, iso-ir-157 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| MacRomanian | Macintosh | MacRomanian |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP852 | MS DOS Slavic | Microsoft & IBM | CP852 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| Adobe-Symbol-Encoding | (used in PS printers) | Adobe | Adobe-Symbol-Encoding |
| Adobe-Zapf-Dingbats-Encoding | (used in PS printers) | Adobe | Adobe-Zapf-Dingbats-Encoding |
| CP10008 | Macintosh RSymbol (Right-left symbol) | Microsoft & IBM | CP10008 |
| MacDingbats | Macintosh | MacDingbats | |
| MacSymbol | Macintosh | MacSymbol |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP20838 | (with fullwidth Latin & punctuation) | Microsoft & IBM | CP20838 |
| CP874 | IBMThai | Microsoft & IBM | CP874 |
| ISO 8859-11 (draft) | ISOLatinThai | International or National Standard | Thai |
| MacThai | Macintosh | MacThai |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP10081 | Macintosh Turkish | Microsoft & IBM | CP10081 |
| CP1254 | Microsoft & IBM | CP1254 | |
| CP28599 | Turkish (ISO) | Microsoft & IBM | CP28599 |
| CP857 | IBM Turkish | Microsoft & IBM | CP857 |
| ISO 8859-9 | Latin5 | International or National Standard | ISO-8859-9, Latin5, iso-ir-148 |
| MacTurkish | Macintosh | MacTurkish |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| MacUkrainian | Macintosh | MacUkrainian |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| CP1258 | Microsoft & IBM | CP1258 |
| Encoding | Other Names | Vendor/Standard Body | Other Rosette Names |
|---|---|---|---|
| BMP | Unicode | BMP, Unicode20:big-endian | |
| Java | (way of representing Unicode chars in ASCII) | Sun | Java, Unicode20:BOM:Java, Unicode11:Java, Unicode11:BOM:Java |
| UCS2 | ISO-10646-UCS2, UTF16 | Unicode | Unicode |
| Unicode Big-endian | Unicode | big-endian, Unicode20:big-endian, Unicode11:big-endian, Unicode11:BOM:big-endian | |
| Unicode Little-endian | Unicode | little-endian, Unicode20:little-endian, Unicode11:little-endian, Unicode11:BOM:little-endian | |
| Unicode11-UCS2 | Unicode | Unicode11-UCS2, Unicode11:UCS2, Unicode11:BOM:UCS2 | |
| Unicode11-UTF7 | Unicode | Unicode11-UTF7, Unicode11:UTF7, Unicode11:BOM:UTF7 | |
| Unicode11-UTF8 | Unicode | Unicode11-UTF8, Unicode11:UTF8, Unicode11:BOM:UTF8 | |
| UTF7 | Unicode | UTF7, Unicode20:BOM:UTF7 | |
| UTF8 | Unicode | UTF8, Unicode20:BOM:UTF8 | |
| UTF32 | Unicode | UTF32 | |
| UTF8 | Unicode | UTF8, Unicode20:BOM:UTF8 | |
| UTF-EBCDIC | Unicode | UTF8-EBCDIC, UTF-8-EBCDIC |