ANSI Character Set
and Equivalent Unicode and HTML Characters


The ANSI set of 217 characters, also known as Windows-1252, was the standard for the core fonts supplied with US versions of Microsoft Windows up to and including Windows 95 and Windows NT 4. During the lifetime of those two products, Microsoft added the euro currency symbol bringing the number of characters to 218, and introduced a new core set of Pan-European fonts containing the WGL4 (Windows Glyph List 4) character set, with 652 characters.

If you use a version of Windows that is designed for a non-Latin alphabet such as Arabic, Cyrillic, Greek, Hebrew or Thai to view a document that has been typed using the ANSI character set, then characters from these languages may replace some of those in the 128-255 range; this problem will be resolved when Unicode becomes more widely used, because it provides a unique numeric identifier for each character. There are similar problems when transferring ANSI documents to DOS or Macintosh computers, because DOS and MacRoman arrange characters differently in the 128-255 range.

ANSI characters 32 to 127 correspond to those in the 7-bit ASCII character set, which forms the Basic Latin Unicode character range. Characters 160-255 correspond to those in the Latin-1 Supplement Unicode character range. Positions 128-159 in Latin-1 Supplement are reserved for controls, but most of them are used for printable characters in ANSI; the Unicode equivalents are noted in the table below. Entries in the "Entity" column are character entity references that can be used in HTML and should be interpreted correctly by Web browsers that support HTML 4.0.

The characters that appear in the first column of the following table are generated from Unicode numeric character references, and so they should appear correctly in any Web browser that supports Unicode, that has suitable fonts available, that is set to view Western European encoding and that has its Unicode options set correctly, regardless of the operating system.

Table shorts legend:

  • Ch - Character
  • AN - ANSI number
  • UN - Unicode number
  • AH - ANSI Hex
  • UH - Unicode Hex
  • HTML - HTML 4.0 entity
  • U. Range - Unicode Range
  • L.c.l. - Latin capital letter
  • L.s.l. - Latin small letter
  • G. - General
  • l.-p. - left-pointing
  • q. m. - quotation mark
  • r.-p. - right-pointing
  • m.l. - modifier letter(s)

Character set: 32 - 126
Unicode U+0020 - U+007E

Ch AN UN AH U. Hex HTML Unicode Name U. Range
32 32 0x20 U+0020   space Basic Latin
! 33 33 0x21 U+0021   exclamation mark - " -
" 34 34 0x22 U+0022 " quotation mark - " -
# 35 35 0x23 U+0023   number sign - " -
$ 36 36 0x24 U+0024   dollar sign - " -
% 37 37 0x25 U+0025   percent sign - " -
& 38 38 0x26 U+0026 & ampersand - " -
' 39 39 0x27 U+0027   apostrophe - " -
( 40 40 0x28 U+0028   left parenthesis - " -
) 41 41 0x29 U+0029   right parenthesis - " -
* 42 42 0x2A U+002A   asterisk - " -
+ 43 43 0x2B U+002B   plus sign - " -
, 44 44 0x2C U+002C   comma - " -
- 45 45 0x2D U+002D   hyphen-minus - " -
. 46 46 0x2E U+002E   full stop - " -
/ 47 47 0x2F U+002F   solidus - " -
0 48 48 0x30 U+0030   digit zero - " -
1 49 49 0x31 U+0031   digit one - " -
2 50 50 0x32 U+0032   digit two - " -
3 51 51 0x33 U+0033   digit three - " -
4 52 52 0x34 U+0034   digit four - " -
5 53 53 0x35 U+0035   digit five - " -
6 54 54 0x36 U+0036   digit six - " -
7 55 55 0x37 U+0037   digit seven - " -
8 56 56 0x38 U+0038   digit eight - " -
9 57 57 0x39 U+0039   digit nine - " -
: 58 58 0x3A U+003A   colon - " -
; 59 59 0x3B U+003B   semicolon - " -
< 60 60 0x3C U+003C &lt; less-than sign - " -
= 61 61 0x3D U+003D   equals sign - " -
> 62 62 0x3E U+003E &gt; greater-than sign - " -
? 63 63 0x3F U+003F   question mark - " -
@ 64 64 0x40 U+0040   commercial at - " -
A 65 65 0x41 U+0041   Latin capital letter A - " -
B 66 66 0x42 U+0042   - " - B - " -
C 67 67 0x43 U+0043   - " - C - " -
D 68 68 0x44 U+0044   - " - D - " -
E 69 69 0x45 U+0045   - " - E - " -
F 70 70 0x46 U+0046   - " - F - " -
G 71 71 0x47 U+0047   - " - G - " -
H 72 72 0x48 U+0048   - " - H - " -
I 73 73 0x49 U+0049   - " - I - " -
J 74 74 0x4A U+004A   - " - J - " -
K 75 75 0x4B U+004B   - " - K - " -
L 76 76 0x4C U+004C   - " - L - " -
M 77 77 0x4D U+004D   - " - M - " -
N 78 78 0x4E U+004E   - " - N - " -
O 79 79 0x4F U+004F   - " - O - " -
P 80 80 0x50 U+0050   - " - P - " -
Q 81 81 0x51 U+0051   - " - Q - " -
R 82 82 0x52 U+0052   - " - R - " -
S 83 83 0x53 U+0053   - " - S - " -
T 84 84 0x54 U+0054   - " - T - " -
U 85 85 0x55 U+0055   - " - U - " -
V 86 86 0x56 U+0056   - " - V - " -
W 87 87 0x57 U+0057   - " - W - " -
X 88 88 0x58 U+0058   - " - X - " -
Y 89 89 0x59 U+0059   - " - Y - " -
Z 90 90 0x5A U+005A   - " - Z - " -
[ 91 91 0x5B U+005B   left square bracket - " -
\ 92 92 0x5C U+005C   reverse solidus - " -
] 93 93 0x5D U+005D   right square bracket - " -
^ 94 94 0x5E U+005E   circumflex accent - " -
_ 95 95 0x5F U+005F   low line - " -
` 96 96 0x60 U+0060   grave accent - " -
a 97 97 0x61 U+0061   Latin small letter a - " -
b 98 98 0x62 U+0062   - " - b - " -
c 99 99 0x63 U+0063   - " - c - " -
d 100 100 0x64 U+0064   - " - d - " -
e 101 101 0x65 U+0065   - " - e - " -
f 102 102 0x66 U+0066   - " - f - " -
g 103 103 0x67 U+0067   - " - g - " -
h 104 104 0x68 U+0068   - " - h - " -
i 105 105 0x69 U+0069   - " - i - " -
j 106 106 0x6A U+006A   - " - j - " -
k 107 107 0x6B U+006B   - " - k - " -
l 108 108 0x6C U+006C   - " - l - " -
m 109 109 0x6D U+006D   - " - m - " -
n 110 110 0x6E U+006E   - " - n - " -
o 111 111 0x6F U+006F   - " - o - " -
p 112 112 0x70 U+0070   - " - p - " -
q 113 113 0x71 U+0071   - " - q - " -
r 114 114 0x72 U+0072   - " - r - " -
s 115 115 0x73 U+0073   - " - s - " -
t 116 116 0x74 U+0074   - " - t - " -
u 117 117 0x75 U+0075   - " - u - " -
v 118 118 0x76 U+0076   - " - v - " -
w 119 119 0x77 U+0077   - " - w - " -
x 120 120 0x78 U+0078   - " - x - " -
y 121 121 0x79 U+0079   - " - y - " -
z 122 122 0x7A U+007A   - " - z - " -
{ 123 123 0x7B U+007B   left curly bracket - " -
| 124 124 0x7C U+007C   vertical line - " -
} 125 125 0x7D U+007D   right curly bracket - " -
~ 126 126 0x7E U+007E   tilde - " -

Character set: 127 - 255
Unicode U+007F - U+00FF

Ch AN UN AH U. Hex HTML Unicode Name U. Range
 127 127 0x7F U+007F   (not used) Basic Latin
€ 128 8364 0x80 U+20AC &euro; euro sign Currency Symbols
ВЃ 129 129 0x81 U+0081   (not used)  
‚ 130 8218 0x82 U+201A &sbquo; single low-9 q. m. G. Punctuation
Ж’ 131 402 0x83 U+0192 &fnof; L.s.l. f with hook Latin Extended-B
„ 132 8222 0x84 U+201E &bdquo; double low-9 q. m. G. Punctuation
… 133 8230 0x85 U+2026 &hellip; horizontal ellipsis - " -
† 134 8224 0x86 U+2020 &dagger; dagger - " -
‡ 135 8225 0x87 U+2021 &Dagger; double dagger - " -
Л† 136 710 0x88 U+02C6 &circ; m.l. circumflex accent Spacing m.l.
‰ 137 8240 0x89 U+2030 &permil; per mille sign G. Punctuation
Е  138 352 0x8A U+0160 &Scaron; L.c.l. S with caron Latin Extended-A
‹ 139 8249 0x8B U+2039 &lsaquo; single l.-p. angle q. m. G. Punctuation
Е’ 140 338 0x8C U+0152 &OElig; L. C. ligature OE Latin Extended-A
ВЌ 141 141 0x8D U+008D   (not used)  
ЕЅ 142 381 0x8E U+017D   L.c.l. Z with caron Latin Extended-A
ВЏ 143 143 0x8F U+008F   (not used)  
Вђ 144 144 0x90 U+0090   (not used)  
‘ 145 8216 0x91 U+2018 &lsquo; left single q. m. G. Punctuation
’ 146 8217 0x92 U+2019 &rsquo; right single q. m. - " -
“ 147 8220 0x93 U+201C &ldquo; left double q. m. - " -
” 148 8221 0x94 U+201D &rdquo; right double q. m. - " -
• 149 8226 0x95 U+2022 &bull; bullet - " -
– 150 8211 0x96 U+2013 &ndash; en dash - " -
— 151 8212 0x97 U+2014 &mdash; em dash - " -
Лњ 152 732 0x98 U+02DC &tilde; small tilde Spacing m.l.
в„ў 153 8482 0x99 U+2122 &trade; trade mark sign Letterlike Symbols
ЕЎ 154 353 0x9A U+0161 &scaron; L.s.l. s with caron Latin Extended-A
› 155 8250 0x9B U+203A &rsaquo; single r.-p. angle q. m. G. Punctuation
Е“ 156 339 0x9C U+0153 &oelig; L. S. ligature oe Latin Extended-A
Вќ 157 157 0x9D U+009D   (not used)  
Еѕ 158 382 0x9E U+017E   L.s.l. z with caron Latin Extended-A
Её 159 376 0x9F U+0178 &Yuml; L.c.l. Y with diaeresis Latin Extended-A
160 160 0xA0 U+00A0 &nbsp; no-break space Latin-1 Supplement
ВЎ 161 161 0xA1 U+00A1 &iexcl; inverted exclamation mark - " -
Вў 162 162 0xA2 U+00A2 &cent; cent sign - " -
ВЈ 163 163 0xA3 U+00A3 &pound; pound sign - " -
В¤ 164 164 0xA4 U+00A4 &curren; currency sign - " -
ВҐ 165 165 0xA5 U+00A5 &yen; yen sign - " -
В¦ 166 166 0xA6 U+00A6 &brvbar; broken bar - " -
В§ 167 167 0xA7 U+00A7 &sect; section sign - " -
ВЁ 168 168 0xA8 U+00A8 &uml; diaeresis - " -
В© 169 169 0xA9 U+00A9 &copy; copyright sign - " -
ВЄ 170 170 0xAA U+00AA &ordf; feminine ordinal indicator - " -
В« 171 171 0xAB U+00AB &laquo; l.-p. double angle q. m. - " -
В¬ 172 172 0xAC U+00AC &not; not sign - " -
­ 173 173 0xAD U+00AD &shy; soft hyphen - " -
В® 174 174 0xAE U+00AE &reg; registered sign - " -
ВЇ 175 175 0xAF U+00AF &macr; macron - " -
В° 176 176 0xB0 U+00B0 &deg; degree sign - " -
В± 177 177 0xB1 U+00B1 &plusmn; plus-minus sign - " -
ВІ 178 178 0xB2 U+00B2 &sup2; superscript two - " -
Ві 179 179 0xB3 U+00B3 &sup3; superscript three - " -
Вґ 180 180 0xB4 U+00B4 &acute; acute accent - " -
Вµ 181 181 0xB5 U+00B5 &micro; micro sign - " -
В¶ 182 182 0xB6 U+00B6 &para; pilcrow sign - " -
В· 183 183 0xB7 U+00B7 &middot; middle dot - " -
Вё 184 184 0xB8 U+00B8 &cedil; cedilla - " -
В№ 185 185 0xB9 U+00B9 &sup1; superscript one - " -
Вє 186 186 0xBA U+00BA &ordm; masculine ordinal indicator - " -
В» 187 187 0xBB U+00BB &raquo; r.-p. double angle q. m. - " -
Вј 188 188 0xBC U+00BC &frac14; vulgar fraction one quarter - " -
ВЅ 189 189 0xBD U+00BD &frac12; vulgar fraction one half - " -
Вѕ 190 190 0xBE U+00BE &frac34; vulgar fraction 3 quarters - " -
Вї 191 191 0xBF U+00BF &iquest; inverted question mark - " -
ГЂ 192 192 0xC0 U+00C0 &Agrave; L.c.l. A with grave - " -
ГЃ 193 193 0xC1 U+00C1 &Aacute; L.c.l. A with acute - " -
Г‚ 194 194 0xC2 U+00C2 &Acirc; L.c.l. A with circumflex - " -
Гѓ 195 195 0xC3 U+00C3 &Atilde; L.c.l. A with tilde - " -
Г„ 196 196 0xC4 U+00C4 &Auml; L.c.l. A with diaeresis - " -
Г… 197 197 0xC5 U+00C5 &Aring; L.c.l. A with ring above - " -
Г† 198 198 0xC6 U+00C6 &AElig; L.C. letter AE - " -
Г‡ 199 199 0xC7 U+00C7 &Ccedil; L.c.l. C with cedilla - " -
Г€ 200 200 0xC8 U+00C8 &Egrave; L.c.l. E with grave - " -
Г‰ 201 201 0xC9 U+00C9 &Eacute; L.c.l. E with acute - " -
ГЉ 202 202 0xCA U+00CA &Ecirc; L.c.l. E with circumflex - " -
Г‹ 203 203 0xCB U+00CB &Euml; L.c.l. E with diaeresis - " -
ГЊ 204 204 0xCC U+00CC &Igrave; L.c.l. I with grave - " -
ГЌ 205 205 0xCD U+00CD &Iacute; L.c.l. I with acute - " -
ГЋ 206 206 0xCE U+00CE &Icirc; L.c.l. I with circumflex - " -
ГЏ 207 207 0xCF U+00CF &Iuml; L.c.l. I with diaeresis - " -
Гђ 208 208 0xD0 U+00D0 &ETH; L.c.l. Eth - " -
Г‘ 209 209 0xD1 U+00D1 &Ntilde; L.c.l. N with tilde - " -
Г’ 210 210 0xD2 U+00D2 &Ograve; L.c.l. O with grave - " -
Г“ 211 211 0xD3 U+00D3 &Oacute; L.c.l. O with acute - " -
Г” 212 212 0xD4 U+00D4 &Ocirc; L.c.l. O with circumflex - " -
Г• 213 213 0xD5 U+00D5 &Otilde; L.c.l. O with tilde - " -
Г– 214 214 0xD6 U+00D6 &Ouml; L.c.l. O with diaeresis - " -
Г— 215 215 0xD7 U+00D7 &times; multiplication sign - " -
Ø 216 216 0xD8 U+00D8 &Oslash; L.c.l. O with stroke - " -
Г™ 217 217 0xD9 U+00D9 &Ugrave; L.c.l. U with grave - " -
Гљ 218 218 0xDA U+00DA &Uacute; L.c.l. U with acute - " -
Г› 219 219 0xDB U+00DB &Ucirc; L.c.l. U with circumflex - " -
Гњ 220 220 0xDC U+00DC &Uuml; L.c.l. U with diaeresis - " -
Гќ 221 221 0xDD U+00DD &Yacute; L.c.l. Y with acute - " -
Гћ 222 222 0xDE U+00DE &THORN; L.c.l. Thorn - " -
Гџ 223 223 0xDF U+00DF &szlig; L.s.l. sharp s - " -
Г  224 224 0xE0 U+00E0 &agrave; L.s.l. a with grave - " -
ГЎ 225 225 0xE1 U+00E1 &aacute; L.s.l. a with acute - " -
Гў 226 226 0xE2 U+00E2 &acirc; L.s.l. a with circumflex - " -
ГЈ 227 227 0xE3 U+00E3 &atilde; L.s.l. a with tilde - " -
Г¤ 228 228 0xE4 U+00E4 &auml; L.s.l. a with diaeresis - " -
ГҐ 229 229 0xE5 U+00E5 &aring; L.s.l. a with ring above - " -
Г¦ 230 230 0xE6 U+00E6 &aelig; L.s.l. ae - " -
Г§ 231 231 0xE7 U+00E7 &ccedil; L.s.l. c with cedilla - " -
ГЁ 232 232 0xE8 U+00E8 &egrave; L.s.l. e with grave - " -
Г© 233 233 0xE9 U+00E9 &eacute; L.s.l. e with acute - " -
ГЄ 234 234 0xEA U+00EA &ecirc; L.s.l. e with circumflex - " -
Г« 235 235 0xEB U+00EB &euml; L.s.l. e with diaeresis - " -
Г¬ 236 236 0xEC U+00EC &igrave; L.s.l. i with grave - " -
Г­ 237 237 0xED U+00ED &iacute; L.s.l. i with acute - " -
Г® 238 238 0xEE U+00EE &icirc; L.s.l. i with circumflex - " -
ГЇ 239 239 0xEF U+00EF &iuml; L.s.l. i with diaeresis - " -
Г° 240 240 0xF0 U+00F0 &eth; L.s.l. eth - " -
Г± 241 241 0xF1 U+00F1 &ntilde; L.s.l. n with tilde - " -
ГІ 242 242 0xF2 U+00F2 &ograve; L.s.l. o with grave - " -
Гі 243 243 0xF3 U+00F3 &oacute; L.s.l. o with acute - " -
Гґ 244 244 0xF4 U+00F4 &ocirc; L.s.l. o with circumflex - " -
Гµ 245 245 0xF5 U+00F5 &otilde; L.s.l. o with tilde - " -
Г¶ 246 246 0xF6 U+00F6 &ouml; L.s.l. o with diaeresis - " -
Г· 247 247 0xF7 U+00F7 &divide; division sign - " -
Гё 248 248 0xF8 U+00F8 &oslash; L.s.l. o with stroke - " -
Г№ 249 249 0xF9 U+00F9 &ugrave; L.s.l. u with grave - " -
Гє 250 250 0xFA U+00FA &uacute; L.s.l. u with acute - " -
Г» 251 251 0xFB U+00FB &ucirc; L.s.l. with circumflex - " -
Гј 252 252 0xFC U+00FC &uuml; L.s.l. u with diaeresis - " -
ГЅ 253 253 0xFD U+00FD &yacute; L.s.l. y with acute - " -
Гѕ 254 254 0xFE U+00FE &thorn; L.s.l. thorn - " -
Гї 255 255 0xFF U+00FF &yuml; L.s.l. y with diaeresis - " -

Problems with non-Latin scripts

If you use the Arabic, Greek, Hebrew, Russian or Thai versions of Microsoft Windows to view a file that uses Latin script and includes accented characters, then the accented characters may be replaced or omitted. For example:

ANSI characters with US Windows
US Windows
ANSI characters with US Windows
Russian Windows
ANSI characters with Russian Windows
Thai Windows

This happens because the characters for these non-Latin scripts are coded to the same numbers as the accented Latin characters in the ANSI character set; this problem will be resolved when Unicode becomes more widely used, because it provides a unique numeric identifier for each character.

________
Copyright © 1997 - 2001 Alan Wood
URL: http://www.hclrss.demon.co.uk/demos/
E-mail: [email protected]