Input may be:

Input
Type here
Input was:numeral
 Input does not describe a codepoint...
Character search 𝍠 𝍡 𝍢 𝍣 𝍤 𝍥 𝍦 𝍫 𝍬 𝍭 𝍰 𝍱 𝍨 𝍮 𝍯 𝍧 𝍩 𝍪 𐅖 𐅚 𐅝 0 1 2 3 4 5 6 7 𐅢 𐅣 𐅤 𐅥 𐅘 𐹦 ◌ ◌ 𐅙 ٩ 𐅰 𐅱 𒑀 𐅲 𐅳 𐅴 𐅵 𐅶 𐅷 𐅸 # 𐅹 𐅜 𐅺 𐅻 𐅼 𒐧 𐅽 𐅾 𐅿 𐄀 𐄁 𐄂 𐄇 𐄈 𐄉 𐄊 𐄋 𐄌 𐄍 𐄎 𐄏 𐄐 𐄑 𐄒 𐄓 𐄔 𐄕 𐄖 𐄗 𐄘 𐄙 𐄚 𐄛 𐄜 𐄝 𐄞 𐄟 𐄠 𐄢 𐄣 𐄤 𐄥 𐄦 𐄨 𐄩 𐄪 𐄫 𐄬 𐄮 𐄯 𐄰 𐄱 𐄲 𐄷 𐄸 𐄺 𐄻 𐄼 𐄽 𐄾 𐅀 𐅁 𐅂 𐅃 𐅄 𐅆 𐅇 𐅈 𐅉 𐅊 𐅌 𐅍 𐅎 𐅏 8 9 𐅛 𐅟 𐆈 𐆉 𐆊 𐅗 𒑔 𐅦 𐅧 𐅠 𒐉 𐅨 𒑡 𒑢 𐅩 𐅪 𒑱 𐅬 𐅭 𐅮 𐅯 𐌠 𐌡 𐌢 𐌣 𐆃 ʹ ◌͵ 𒐀 𒐁 𒐂 𒐃 𒐄 𒐅 𒐆 𒐇 𒐈 𐄳 𒐊 𒐋 𒐌 𒐍 𒐎 𒐏 𒐐 𒐑 𒐒 𒐓 𒐔 𒐕 𒐖 𒐗 𒐘 𒐙 𒐚 𒐛 𒐜 𒐝 𒐞 𒐟 𒐠 𒐡 𒐢 𒐣 𒐤 𒐥 𒐦 𒐨 𒐩 𒐪 𒐫 𒐬 𒐭 𒐮 𒐯 𒐰 𒐱 𒐲 𒐳 𒐴 𒐶 𒐷 𒐸 𒐹 𒐺 𒐻 𒐼 𒐽 𒐾 𒐿 𒑁 𒑂 𒑃 𒑄 𒑅 𒑆 𒑇 𒑈 𒑉 𒑊 𒑋 𒑌 𒑍 𒑎 𒑏 𒑐 𒑑 𒑒 𒑓 𒑕 𒑖 𒑗 𒑘 𒑙 𒑚 𒑛 𒑜 𒑝 𒑞 𒑟 𒑰 𒑲 𐆀 𐆁 𐅫 𐆂 𐄿 𐆄 𐅞 𐆅 𐆆 𐆇 𐅑 ٠ ١ ٢ ٣ 𐹥 𐹠 𐹡 𐹢 𐹣 𐹤 ٥ ٦ ٧ ٨ 𐹩 𐹪 𐹧 𐹬 𐹭 𐹮 𐹯 𐹰 𐹨 𐹲 𐹳 𐹴 𐹵 𐹶 𐹷 𐹸 𐹹 𐹺 𐹻 𐹼 𐹽 𐹾 𐹫 𒑳 𐹱 𐄡 𐄧 ۰ ۱ ۲ ۳ ۴ ۵ ۶ ۷ ۸ ۹ 𐄭 𐄹 𒐵 𐅡 𐅐 𐅅 𒑠 𐅋 𐅒 ٤ 𐅓 𐅔 𐅕 齿
 ...describing intput string as-is (full input string)
Constituent codepoints:
   006E   LATIN SMALL LETTER N
   0075   LATIN SMALL LETTER U
   006D   LATIN SMALL LETTER M
   0065   LATIN SMALL LETTER E
   0072   LATIN SMALL LETTER R
   0061   LATIN SMALL LETTER A
   006C   LATIN SMALL LETTER L
 
Some unicode data
NormalizationNo normalisations change the data
(does not necessarily mean nothing decomposes to this form)
 
Font info (experiment)
Some fonts with all of these charactersunifont, Code2000, HanaMinB, IPAmjMincho, 'Noto Sans CJK TC Black', 'Noto Sans CJK KR Bold', 'Noto Sans CJK KR Medium', 'Noto Sans CJK SC Light', 'Noto Sans CJK TC Thin', 'Noto Sans CJK JP Thin', 'Noto Sans CJK KR DemiLight', 'Noto Sans CJK KR Light', 'Noto Sans CJK JP Light', 'Noto Sans CJK KR Black', 'Noto Sans CJK TC Light', 'Noto Sans CJK SC Thin', 'Noto Sans CJK TC DemiLight', 'Noto Sans CJK TC Medium', 'Noto Sans CJK KR Regular', 'Noto Sans CJK JP DemiLight', 'Noto Sans CJK SC Bold', 'Noto Sans CJK SC Black', 'Noto Sans CJK JP Regular', 'Noto Sans CJK JP Black', 'Noto Sans CJK TC Bold', 'Noto Sans CJK SC Medium', 'Noto Sans CJK TC Regular', 'Noto Sans CJK KR Thin', 'Noto Sans CJK JP Bold', 'Noto Sans CJK JP Medium', 'Noto Sans CJK SC DemiLight', 'Noto Sans CJK SC Regular', TW-Kai, TW-Sung, HanaMinA, 'WenQuanYi Zen Hei', 'Bitstream Cyberbit', 'UKIJ CJK', 'Droid Sans Fallback', cwTeXMing, cwTeXKai, cwTeXHeiBold, cwTeXHei, MOESongUN, SeoulNamsan, SeoulHangang, 'SeoulHangang CM', 'SeoulHangang CL', 'SeoulNamsan CBL', 'SeoulNamsan CB', 'SeoulNamsan CL', 'SeoulNamsan CM', 'SeoulNamsan CEB', 'SeoulHangang CEB', 'SeoulHangang CB', 'SeoulHangang CBL', UnBatang, UnDotum, UnGungseo, UnShinmun, JejuMyeongjoOTF, JejuHallasanOTF, JejuGothicOTF, 'NanumGothic Eco', 'Nanum Brush Script OTF', 'Nanum Pen Script OTF', UnYetgul, NanumBarunGothic, CTChaoHeiSF, CTXianTiSF, CTCuYuanSF, KoPubBatang_Pro, 'Baekmuk Gulim', Code2002, 'SeoulNamsan B', 'SeoulHangang L', 'SeoulHangang M', 'SeoulNamsan L', 'SeoulNamsan M', 'SeoulHangang B', 'SeoulHangang EB', 'SeoulNamsan EB', NanumGothic, 'SeoulNamsan vert', JejuMyeongjo, JejuHallasan, JejuGothic, 'Nanum Brush Script', 'Nanum Pen', 'Nanum Pen Script', 'Baekmuk Headline', 'Baekmuk Batang', 'Baekmuk Dotum', 'KoPub Batang', 'VL PGothic', 'UmePlus P Gothic', UnGraphic, 'UmePlus Gothic', 'Komatuna P', 'VL Gothic'
 
Encoding
HTML/XML
numeric entities
All but basic alphanumeric encoded (hexadecimal and decimal):
numeral
numeral
UTF8 bytestringas hex: 6e756d6572616c
(UTF8 bytestring length is 7)
URL-encoded UTF8numeral
Python string
before py3k
Unicode string:
  u'numeral'
UTF8 bytestring:
  'numeral'
...in py3kUnicode string:
  'numeral'
UTF8 bytestring:
  b'numeral'
Javascript (≥1.3)"numeral"
LaTeX
(incomplete experiment)
nothing interesting to report here
Encodings that can encode this properlyutf_8   utf_16   ascii   latin_1   iso8859_2   iso8859_3   iso8859_4   iso8859_5   iso8859_6   iso8859_7   iso8859_8   iso8859_9   iso8859_10   iso8859_13   iso8859_14   iso8859_15   iso2022_jp   iso2022_jp_1   iso2022_jp_2   iso2022_jp_2004   iso2022_jp_3   iso2022_jp_ext   iso2022_kr   gb2312   gbk   gb18030   big5   big5hkscs   euc_jp   euc_jis_2004   euc_jisx0213   euc_kr   hz   johab   koi8_r   koi8_u   mac_cyrillic   mac_greek   mac_iceland   mac_latin2   mac_roman   mac_turkish   ptcp154   shift_jis   shift_jis_2004   shift_jisx0213   cp037   cp424   cp437   cp500   cp737   cp775   cp850   cp852   cp855   cp856   cp857   cp860   cp861   cp862   cp863   cp864   cp865   cp866   cp869   cp874   cp875   cp932   cp949   cp950   cp1006   cp1026   cp1140   cp1250   cp1251   cp1252   cp1253   cp1254   cp1255   cp1256   cp1257   cp1258
Encodings that will cripple your data

Layout

Note that of the ~1.1 million codepoints under U+10FFFF (the current cap), ~900K are unused, ~130K are private use, and only ~100K are general-purpose graphic codepoints (about half in BMP).
Grouping used below is somewhat arbitrary, but looks halfway sensible
For more on planes, see http://en.wikipedia.org/wiki/Mapping_of_Unicode_characters#Planes

Mouseover shows a range's codepoint range, links are to unicode.org PDFs.
Nameless gray blocks are reserved, unused or restricted areas, and show their size.
If we are showing a single codepoint above, the range it is in is bolded below.


BMP - Basic Multilingual Plane:






not allocated (63)
not allocated (63)








not allocated (15)
not allocated (79)










(End of range that UCS2(-based Unicode implementations) can store)




SMP - Supplemental Multilingual Plane:
not allocated (127)
not allocated (31)
not allocated (79)
not allocated (143)
not allocated (127)
not allocated (79)
not allocated (63)
not allocated (31)
not allocated (79)
not allocated (527)
not allocated (383)
not allocated (95)
not allocated (255)
not allocated (159)
not allocated (31)
not allocated (463)
not allocated (447)
not allocated (1279)
not allocated (2943)
not allocated (13263)
not allocated (95)
not allocated (879)
not allocated (16479)
not allocated (2815)
not allocated (4943)


not allocated (175)


not allocated (127)
not allocated (4095)
not allocated (1311)
not allocated (255)






not allocated (511)
not allocated (1791)




SIP - Supplemental Ideographic Plane:
not allocated (31)
not allocated (16351)
not allocated (1503)




TIP - Tertiary Ideographic Plane (plane 3, tentatively allocated):
Small Seal Script (tentative allocation) (10528)
not allocated (223)
Oracle Bone Script (tentative allocation) (6144)
not allocated (48639)




Planes 4 through 13 - not allocated:
plane 4 (not allocated) (65536)
plane 5 (not allocated) (65536)
plane 6 (not allocated) (65536)
plane 7 (not allocated) (65536)
plane 8 (not allocated) (65536)
plane 9 (not allocated) (65536)
plane 10 (not allocated) (65536)
plane 11 (not allocated) (65536)
plane 12 (not allocated) (65536)
plane 13 (not allocated) (65536)




SSP - Supplemental Special-purpose Plane:
not allocated (127)
not allocated (65039)




PUA-A - Private Use Area A:




PUA-B - Private Use Area B: