Tuesday, January 17, 2012

Character Set HTML

Character set digunakan untuk memberitahukan kepada browser bahwa character encoding yang mana perlu digunakan untuk meng-encode karakter-karakter.

Untuk menetapkan character set didalam dokumen HTML, gunakan elemen META. Untuk menetapkan character set bagi script block atau dokumen yang dilink, gunakan atribut CHARSET.

Berikut daftar sebagian kecil character set:

Deskripsi Nama Charset Nama Lain
Arabic (ISO) iso-8859-6 iso-ir-127, ISO_8859-6, ISO-8859-6:1987, ECMA-114, ASMO-708, arabic, csISOLatinArabic
Baltic (ISO) iso-8859-4 iso-ir-110, ISO_8859-4, ISO-8859-4:1988, latin4, l4, csISOLatin4
Central European (ISO) iso-8859-2 iso-ir-101, ISO_8859-2, ISO-8859-2:1987, latin2, l2, csISOLatin2
Chinese Traditional (Big5) big5 csBig5, cn-big5, x-x-big5
Cyrillic (ISO | Windows) iso-8859-5 windows-1251, iso-ir-144, ISO_8859-5, ISO-8859-5:1988, cyrillic, csISOLatinCyrillic, KOI8-U
Japanese (Katakana) JIS_C6220-1969-jp JIS_C6220-1969, iso-ir-13, katakana, x0201-7, csISO13JISC6220jp
Korean (ISO) iso-2022-kr csISO2022KR
Latin 3 (ISO) iso-8859-3 iso-ir-109, ISO_8859-3, ISO-8859-3:1988, latin3, l3, csISOLatin3
Latin 9 (ISO) iso-8859-15 ISO_8859-15, Latin-9, l9, csISOLatin9
Turkish (ISO | Windows) iso-8859-9 windows-1254, iso-ir-148, ISO_8859-9, ISO-8859-9:1989, latin5, l5, csISOLatin5
Unicode utf-16 unicode
Unicode (UTF-16 Big-Endian) unicodeFFFE UTF-16BE
Unicode (UTF-32 Big-Endian) utf-32BE -
Unicode (UTF-32) utf-32 -
Unicode (UTF-7) utf-7 UNICODE-1-1-UTF-7, csUnicode11UTF7, x-unicode-2-0-utf-7
Unicode (UTF-8) utf-8 unicode-1-1-utf-8, unicode-2-0-utf-8, x-unicode-2-0-utf-8
US-ASCII us-ascii iso-ir-6, ANSI_X3.4-1986, ISO_646.irv:1991, ASCII, ISO646-US, us, IBM367, cp367, csASCII
Western European (ISO) iso-8859-1 iso-ir-100, ISO_8859-1, ISO_8859-1:1987, latin1, l1, IBM819, CP819, csISOLatin1