Inspect Unicode code points, names, blocks, and escape sequences for any character or string. Free, browser-based, no sign-up required.

INPUT STRING Paste or type any text, emoji, symbols, or Unicode characters

ESCAPE FORMAT

𝕌

Ready to inspect

Paste a string and click Inspect

HOW TO USE

Paste your string

Type or paste any text into the input box — plain text, emoji, symbols, CJK, RTL, or any Unicode characters.

Choose escape format

Select your preferred output format: JavaScript, Python, CSS, HTML entities, URL encoding, or plain U+ code points.

Inspect and copy

Click Inspect to see every code point, its Unicode name, block, plane, and escape sequence. Copy individual rows or the full table.

FEATURES

Code Points Unicode Names Block Detection Multi-format Escapes Emoji Support Surrogate Pairs

USE CASES

🔧 Debugging Unicode encoding issues in code

🔧 Finding the escape sequence for a special character

🔧 Identifying unknown glyphs or symbols

🔧 Working with emoji and multi-codepoint sequences

🔧 Preparing Unicode strings for CSS content property

🔧 Auditing user input for invisible/control characters

WHAT IS THIS?

The Unicode Code Point Inspector breaks any string into its individual Unicode code points, revealing the name, block, category, plane, and escape sequences for every character — including emoji, CJK ideographs, combining marks, and invisible control characters.

RELATED TOOLS

FREQUENTLY ASKED QUESTIONS

What is a Unicode code point?

A Unicode code point is a unique number assigned to every character in the Unicode standard. Written as U+XXXX (e.g. U+0041 for "A"), code points range from U+0000 to U+10FFFF, covering over 1.1 million possible values across 17 planes.

Why does one emoji show as multiple code points?

Many emoji are sequences of multiple code points — for example, family emoji combine base characters with Zero Width Joiners (U+200D), and flag emoji use pairs of Regional Indicator letters. This tool displays each code point individually so you can see every component.

What is a Unicode block?

Unicode is divided into named blocks — contiguous ranges of code points grouped by script or purpose. Examples include "Basic Latin" (U+0000–U+007F), "CJK Unified Ideographs" (U+4E00–U+9FFF), and "Emoticons" (U+1F600–U+1F64F). Knowing the block helps identify the script a character belongs to.

What is the difference between UTF-8, UTF-16, and code points?

A code point is the abstract number for a character (e.g. U+1F600). UTF-8 and UTF-16 are encoding schemes that store those numbers as bytes. UTF-8 uses 1–4 bytes per code point; UTF-16 uses 2 or 4 bytes. This tool shows the code point value and the UTF-8 byte count, independent of any specific encoding.

What escape formats are supported?

The tool supports JavaScript (\uXXXX / \u{XXXXX} for supplementary characters), Python (\uXXXX / \UXXXXXXXX), CSS (\XXXXXX), HTML numeric entities (&#xXXXXX;), URL percent-encoding, and plain U+ notation.

Can it handle right-to-left (RTL) text?

Yes. Arabic, Hebrew, and other RTL scripts are fully supported. The inspector identifies each code point regardless of writing direction, and includes any directional control characters (like U+200F RIGHT-TO-LEFT MARK) that may be present in the string.

Are invisible or control characters shown?

Absolutely. Control characters, zero-width spaces, non-breaking spaces, directional marks, and other invisible characters are often the cause of hard-to-debug text issues. This tool makes them visible by displaying their code point, official Unicode name, and category.

Is there a character limit?

The tool works entirely in your browser with no server-side processing, so there is no hard limit enforced. For very long strings (thousands of characters), rendering may slow slightly, but the inspection will still complete accurately.

{ Unicode Code Point Inspector }

HOW TO USE

FEATURES

USE CASES

WHAT IS THIS?

RELATED TOOLS

FREQUENTLY ASKED QUESTIONS

What Is the Unicode Code Point Inspector?

Understanding Unicode Code Points

Escape Sequences Across Programming Languages

Unicode Blocks and Scripts

Emoji and Multi-Codepoint Sequences

Invisible and Control Characters

UTF-8 Byte Counts

Developer Use Cases