{ HTML Entities to Unicode }

// convert & ♥ and entities back to characters

Convert HTML entities like &, ♥, ♥ back to Unicode characters instantly. Supports named, decimal, and hex entities. Free browser-based tool.

TRY:
Supports named (&), decimal (♥), and hex (♥) entities
โ™ฅ

Ready to convert

Paste entities and click Decode

HOW TO USE

  1. 01
    Paste your text

    Paste any HTML code or text containing entities into the input field

  2. 02
    Choose mode

    Select Decode (entities โ†’ characters) or Encode (characters โ†’ entities)

  3. 03
    Click Convert

    Hit Decode Entities and copy your clean Unicode output instantly

SUPPORTED FORMATS

& Named ♥ Decimal ♥ Hex HTML5 Entities UTF-8 Output

USE CASES

  • ๐Ÿ”ง Clean up scraped HTML content
  • ๐Ÿ”ง Debug template or CMS output
  • ๐Ÿ”ง Convert database-stored HTML entities
  • ๐Ÿ”ง Prepare text for plain-text rendering
  • ๐Ÿ”ง Reverse-encode symbols for HTML output

WHAT IS THIS?

HTML entities are special codes that represent characters in HTML. For example, & represents &, ♥ represents โ™ฅ, and ♥ represents โ™ฅ using hexadecimal. This tool decodes them back to readable Unicode characters, or encodes text the other way around.

RELATED TOOLS

FREQUENTLY ASKED QUESTIONS

What's the difference between named and numeric entities?

Named entities use a keyword like & or ©. Numeric entities use the character's Unicode code point โ€” either decimal (©) or hexadecimal (©). All three forms represent the same characters and are fully supported by this tool.

Does this tool work with HTML5 entities?

Yes. The tool uses PHP's html_entity_decode() with the ENT_HTML5 flag, which covers the full HTML5 named character references including newer entities like 	, 
, and math symbols.

Is my data sent to a server?

Your text is sent to our PHP backend for processing, but it is never stored, logged, or shared. Processing happens instantly and the data is discarded immediately after the response is returned. For very sensitive content, you can also use the browser's built-in DOM API as an alternative.

Why does   sometimes not look different?

  decodes to a non-breaking space (Unicode U+00A0), which looks identical to a regular space visually. The difference matters in HTML layout and text processing โ€” it prevents line breaks at that position. The tool correctly converts it; the character is just invisible.

Can I encode text back to HTML entities?

Yes! Switch to "Encode Unicode โ†’ Entities" mode using the toggle at the top. This will convert special characters like <, >, &, and quotes into their safe HTML entity equivalents, useful for embedding code inside HTML.

What's the maximum input size?

The tool supports up to 200,000 characters per conversion. For larger files, consider splitting your content into chunks or using a server-side script with PHP's html_entity_decode() function directly.

HTML Entities to Unicode Converter

Working with HTML can be tricky when your text is littered with entity codes like &amp;, &#9829;, or &#x2665;. These codes are how web browsers safely represent special characters in HTML documents, but when you're processing text programmatically, debugging templates, or cleaning scraped content, you often need those raw Unicode characters back. This tool decodes all three entity formats instantly and shows you exactly what was converted.

๐Ÿ’ก Looking for premium HTML templates and themes? MonsterONE offers unlimited downloads of professional HTML themes, UI kits, and web assets โ€” worth checking out.

Understanding the Three Types of HTML Entities

HTML supports three distinct entity formats, and all of them decode to the same Unicode characters:

When Do HTML Entities Appear?

HTML entities are inserted by templating engines, content management systems, and sanitizers to prevent XSS attacks and ensure valid HTML. WordPress, Drupal, and most PHP frameworks will automatically encode user-submitted content using htmlspecialchars() or similar functions. When you retrieve that content from a database and display it in a non-HTML context โ€” like a PDF, an email, an API response, or a plain-text file โ€” the raw entity codes show up verbatim instead of the intended characters.

Similarly, web scraping tools often return HTML source with entities intact. If you extract text with a regex or basic string parser instead of a proper HTML parser, you'll end up with entities that need to be decoded before the text is usable.

How the Decode Process Works

Under the hood, this tool uses PHP's html_entity_decode() function with the ENT_QUOTES | ENT_HTML5 flags and UTF-8 encoding. This combination ensures that:

The Entity Map section shows you a side-by-side breakdown of each entity found in your input and the Unicode character it decoded to โ€” useful for auditing or learning what specific codes mean.

Encoding Mode: Converting Characters to Entities

The reverse operation โ€” encoding plain text into HTML entities โ€” is equally useful. If you're writing HTML documentation, blog posts about code, or generating HTML that will be displayed inside a <pre> or <code> block, you need to encode characters like <, >, and & so the browser renders them as literal characters instead of interpreting them as HTML tags. The Encode mode uses htmlspecialchars() with HTML5 flag for standards-compliant output.

Common HTML Entity Reference

Here are some of the most frequently encountered entities and their Unicode equivalents: &amp; โ†’ & (U+0026), &lt; โ†’ < (U+003C), &gt; โ†’ > (U+003E), &quot; โ†’ " (U+0022), &apos; โ†’ ' (U+0027), &nbsp; โ†’ non-breaking space (U+00A0), &copy; โ†’ ยฉ (U+00A9), &reg; โ†’ ยฎ (U+00AE), &trade; โ†’ โ„ข (U+2122), &euro; โ†’ โ‚ฌ (U+20AC), &pound; โ†’ ยฃ (U+00A3), &yen; โ†’ ยฅ (U+00A5), &mdash; โ†’ โ€” (U+2014), &ndash; โ†’ โ€“ (U+2013), &hellip; โ†’ โ€ฆ (U+2026), &laquo; โ†’ ยซ (U+00AB), &raquo; โ†’ ยป (U+00BB).

Frequently Used Emoji and Symbol Entities

Modern HTML5 named entities include many symbols you might not expect: &starf; or &#9733; โ†’ โ˜… (black star), &hearts; or &#9829; โ†’ โ™ฅ (heart suit), &check; โ†’ โœ“ (check mark), &cross; โ†’ โœ— (cross mark), &phone; โ†’ โ˜Ž (telephone), &clubs; โ†’ โ™ฃ (club suit), &spades; โ†’ โ™  (spade suit), &diams; โ†’ โ™ฆ (diamond suit). Emoji in the higher Unicode planes (U+1F000 and above) are typically written as decimal entities like &#128512; (๐Ÿ˜€) or hex like &#x1F600;.

Practical Tips for Working with HTML Entities

When building a PHP application, use html_entity_decode($string, ENT_QUOTES | ENT_HTML5, 'UTF-8') to decode entities safely. Always specify the encoding โ€” without it, PHP may default to ISO-8859-1, which will corrupt multi-byte UTF-8 characters. When encoding for HTML output, use htmlspecialchars() rather than htmlentities() โ€” the latter will encode every character with a named equivalent, producing much larger output unnecessarily. For JavaScript, DOMParser or creating a temporary textarea element are reliable ways to decode entities client-side without a library.

โ˜•