RSS

Understanding Code Pages and Full List of Code Pages

Understanding Code Pages and Full List of Code Pages

What is CODEPAGE

A code page is a character encoding standard used to represent text in computers. It defines a specific set of characters and their corresponding byte values. Each code page allows for the representation of a unique subset of characters from a given character set, enabling the proper display and manipulation of text data in different languages and scripts.

Key points about code pages:

  1. Character Representation: Each code page maps specific byte values to characters. For example, a code page may include characters for the English alphabet, numbers, punctuation, and symbols.

  2. Localization: Different code pages are used to accommodate different languages and regional character sets. For instance, code pages for Western European languages differ from those used for Asian languages.

  3. Examples: Common code pages include:

    • ASCII (Code Page 437): Represents basic English characters.
    • ISO 8859-1: Extends ASCII to include characters for many Western European languages.
    • UTF-8: A variable-length encoding that can represent all Unicode characters and is widely used on the web.
  4. Database Context: In database management systems, specifying a code page is important for ensuring that text data is stored and retrieved correctly, particularly when dealing with multiple languages or special characters.

Understanding code pages is essential for text processing, data interchange, and ensuring compatibility across different systems and applications.

Common codepages

Here are some of the most common code pages:

  1. ASCII (Code Page 437)

    • Description: Represents basic English characters and control codes.
    • Range: 0-127.
  2. ISO 8859 Series

    • ISO 8859-1 (Latin-1): Western European languages (e.g., English, French, German).
    • ISO 8859-2 (Latin-2): Central and Eastern European languages (e.g., Polish, Czech).
    • ISO 8859-5: Cyrillic script (e.g., Russian).
    • ISO 8859-6: Arabic script.
    • ISO 8859-7: Greek script.
    • ISO 8859-9 (Latin-5): Turkish language.
  3. Windows Code Pages

    • Windows-1250: Central and Eastern European languages.
    • Windows-1251: Cyrillic script (e.g., Russian).
    • Windows-1252: Western European languages (similar to ISO 8859-1).
    • Windows-1254: Turkish language.
    • Windows-1255: Hebrew script.
  4. UTF Encodings

    • UTF-8: Variable-length encoding for all Unicode characters, widely used on the web.
    • UTF-16: Fixed-length encoding for most common characters, used in many Windows applications.
  5. EBCDIC

    • Description: Used mainly in IBM mainframes. Common variants include:
      • EBCDIC US-Canada
      • EBCDIC International
  6. GBK(Code page 936) Code Page 936 is a character encoding used primarily for simplified Chinese characters. It is also known as GBK (Guo Biao Kuozhan) and is an extension of the GB2312 character set. Here are some key points about Code Page 936:

  • Character Set:
    • Supports simplified Chinese characters, including a wide range of characters used in modern Chinese texts.
    • Includes additional characters beyond those found in GB2312, making it suitable for a broader range of applications.
  • Encoding:
    • It is a double-byte encoding system, meaning that most characters are represented by two bytes, allowing for a larger character set.
    • This encoding can represent more than 21,000 characters.
  • Usage:
    • Commonly used in Windows environments for applications and documents that require simplified Chinese.
    • Widely used in software, databases, and web pages targeting Chinese-speaking users.
  • Compatibility:
    • GBK is backward compatible with GB2312, meaning any character in GB2312 can be represented in GBK.
    • It is often used in conjunction with other encodings, such as UTF-8, to support mixed-language content.
  • Code Page 936 (GBK) is an important encoding standard for simplified Chinese, providing extensive character coverage and compatibility with earlier standards. It plays a crucial role in text processing, software development, and data storage in contexts involving Chinese language content.

These code pages are essential for text representation in various languages and are widely used in different systems. UTF-8 has become the most popular encoding for web content due to its ability to handle a wide variety of characters.

All Codepage list

37 (=x0025) EBCDIC US English
273 (=x0111) EBCDIC German
277 (=x0115) EBCDIC Danish/Norwegian
278 (=x0116) EBCDIC Finnish/Swedish
280 (=x0118) EBCDIC Italian
284 (=x011C) EBCDIC Spanish
285 (=x011D) EBCDIC UK English
297 (=x0129) EBCDIC French
300 (=x012C) EBCDIC Japanese DBCS
301 (=x012D) Japanese PC DBCS
420 (=x01A4) EBCDIC Arabic
424 (=x01A8) EBCDIC Arabic
437 (=x01B5) PC-ASCII US
500 (=x01F4) EBCDIC International
803 (=x0323) Hebrew Set A
813 (=x032D) ISO8859-7 Greek
819 (=x0333) ISO8859-1 Western European
833 (=x0341) IBM-833: Korean
834 (=x0342) IBM-834: Korean Host DBCS
835 (=x0343) EBCDIC Traditional Chinese DBCS
836 (=x0344) EBCDIC Simplified Chinese SBCS
838 (=x0346) EBCDIC Thai SBCS
850 (=x0352) ISO8859-1 Western European
852 (=x0354) PC-ASCII Eastern European
855 (=x0357) PC-ASCII Cyrillic
856 (=x0358) PC-ASCII Hebrew
857 (=x0359) PC-ASCII Turkish
858 (=x035A) PC-ASCII Western European with Euro
860 (=x035C) PC-ASCII Portuguese
861 (=x035D) PC-ASCII Icelandic
862 (=x035E) PC-ASCII Hebrew
863 (=x035F) PC-ASCII Canadian French
864 (=x0360) PC-ASCII Arabic
865 (=x0361) PC-ASCII Scandinavian
866 (=x0362) PC-ASCII Cyrillic #2
868 (=x0364) PC-ASCII Urdu
869 (=x0365) PC-ASCII Greek
870 (=x0366) EBCDIC Eastern Europe
871 (=x0367) EBCDIC Icelandic
872 (=x0368) PC-ASCII Cyrillic with Euro
874 (=x036A) PC-ASCII Thai SBCS
875 (=x036B) EBCDIC Greek
880 (=x0370) EBCDIC Cyrillic
891 (=x037B) IBM-891: Korean
897 (=x0381) PC-ASCII Japan Data SBCS
903 (=x0387) PC Simplified Chinese SBCS
904 (=x0388) PC Traditional Chinese Data - SBCS
912 (=x0390) ISO8859-2 Eastern European
915 (=x0393) ISO8859-5 Cyrillic
916 (=x0394) ISO8859-8 Hebrew
918 (=x0396) EBCDIC Urdu
920 (=x0398) ISO8859-9 Turkish
921 (=x0399) ISO Baltic
922 (=x039A) ISO Estonian
923 (=x039B) ISO8859-15 Western Europe with euro (Latin 9)
924 (=x039C) EBCDIC Western Europe with euro
927 (=x039F) PC Traditional Chinese DBCS
928 (=x03A0) PC Simplified Chinese DBCS
930 (=x03A2) EBCDIC Japanese Katakana/Kanji mixed
932 (=x03A4) Japanese OS/2
933 (=x03A5) EBCDIC Korean Mixed
935 (=x03A7) EBCDIC Simplified Chinese Mixed
936 (=0x3A8) GBK (Guo Biao Kuozhan) and is an extension of the GB2312 character set
937 (=x03A9) EBCDIC Traditional Chinese Mixed
939 (=x03AB) EBCDIC Japanese Latin/Kanji mixed
941 (=x03AD) Japanese PC DBCS - for open systems
942 (=x03AE) Japanese PC Data Mixed - extended SBCS
943 (=x03AF) Japanese PC Mixed - for open systems
944 (=x03BO) Korean PC data Mixed - extended SBCS
946 (=x03B2) Simplified Chinese PC data Mixed - extended SBCS
947 (=x03B3) PC Traditional Chinese DBCS
948 (=x03B4) PC Traditional Chinese Mixed - extended SBCS
949 (=x03B5) PC Korean Mixed - KS code
950 (=x03B6) PC Traditional Chinese Mixed - big5
951 (=x03B7) PC Korean DBCS - KS code
970 (=x03CA) euc Korean
1004 (=x03EC) PC Data Latin1
1006 (=x03EE) ISO Urdu
1008 (=x03F0) ASCII Arabic 8-bit ISO
1025 (=x0401) EBCDIC Cyrillic
1026 (=x0402) EBCDIC Turkish
1027 (=x0403) EBCDIC Japanese Latin
1040 (=x0410) IBM-1040: Korean
1041 (=x0411) Japanese PC - extended SBCS
1042 (=x0412) PC Simplified Chinese - extended SBCS
1043 (=x0413) PC Traditional Chinese - extended SBCS
1046 (=x0416) PC-ASCII Arabic
1047 (=x0417) IBM-1047: Western European
1051 (=x041B) ASCII roman8 for HP Western European
1088 (=x0440) PC Korean SBCS - KS code
1089 (=x0441) ISO8859-6 Arabic
1097 (=x0449) EBCDIC Farsi
1098 (=x044A) PC-ASCII Farsi
1112 (=x0458) EBCDIC Baltic (Latvian/Lithuanian)
1114 (=x045A) PC Traditional Chinese - big 5 SBCS
1115 (=x045B) PC Simplified Chinese SBCS
1122 (=x0462) EBCDIC Estonian
1123 (=x0463) EBCDIC Ukrainian
1124 (=x0464) UNIX-ASCII Ukrainian
1131 (=x046B) PC-ASCII Belarus
1140 (=x0474) EBCDIC USA, with euro (like 037)
1141 (=x0475) EBCDIC Austria, Germany, with euro (like 273)
1142 (=x0476) EBCDIC Denmark, Norway, with euro (like 277)
1143 (=x0477) EBCDIC Finland, Sweden, with euro (like 278)
1144 (=x0478) EBCDIC Italy, with euro (like 280)
1145 (=x0479) EBCDIC Spain, with euro (like 284)
1146 (=x047A) EBCDIC UK, with euro (like 285)
1147 (=x047B) EBCDIC France, with euro (like 297)
1148 (=x047C) EBCDIC International, with euro (like 500)
1149 (=x047D) EBCDIC Iceland, with euro (like 871)
1200 (=x04B0) Unicode - UCS-2
1208 (=x04B8) Unicode - UTF-8
1250 (=x04E2) Windows - Eastern European
1251 (=x04E3) Windows - Cyrillic
1252 (=x04E4) Windows - Western European
1253 (=x04E5) Windows - Greek
1254 (=x04E6) Windows - Turkish
1255 (=x04E7) Windows - Hebrew
1256 (=x04E8) Windows - Arabic
1257 (=x04E9) Windows - Baltic Rim
1275 (=x04FB) Apple - Western European
1280 (=x0500) Apple - Greek
1281 (=x0501) Apple - Turkish
1282 (=x0502) Apple - Eastern European
1283 (=x0503) Apple - Cyrillic
1284 (=x0504) IBM-504: Eastern European
1285 (=x0505) IBM-505: Eastern European
1363 (=x0553) Windows Korean PC Mixed including 11,172 full hangul
1364 (=x0554) Korean Host Mixed extended including 11,172 full hangul
1380 (=x0564) PC Simplified Chinese DBCS
1381 (=x0565) PC Simplified Chinese Mixed
1383 (=x0567) euc Simplified Chinese Mixed
1386 (=x056A) PC Simplified Chinese Data GBK Mixed
1388 (=x056C) DBCS Host Simplified Chinese Data GBK Mixed
5346 (=x14E2) Windows-Eastern European with Euro (like 1250)
5347 (=x14E3) Windows - Cyrillic with Euro (like 1251)
5348 (=x14E4) Windows-Western European with Euro (like 1252)
5349 (=x14E5) Windows-Windows - Greek with Euro (like 1253)
5350 (=x14E6) Windows - Turkish with Euro (like 1254)
5351 (=x14E7) Windows - Hebrew with Euro (like 1255)
5352 (=x14E8) Windows - Arabic with Euro (like 1256)
5353 (=x14E9) Windows - Baltic Rim with Euro (like 1257)
5354 (=x14EA) 'Windows - Vietnamese with Euro (like 1258)

OmniLock - Block / Hide App on iOS

Block distractive apps from appearing on the Home Screen and App Library, enhance your focus and reduce screen time.

DNS Firewall for iOS and Mac OS

Encrypted your DNS to protect your privacy and firewall to block phishing, malicious domains, block ads in all browsers and apps

Ad