What characters are encoded?

At minimum , &, ", and ' are encoded. Non-ASCII characters can also be encoded as numeric entities.

What is the difference between named and numeric entities?

Named entities like & are human-readable; numeric entities like & are universally supported. Both are valid HTML.

Does this protect against XSS?

Encoding HTML entities is a key step in preventing XSS, but full security requires proper context-aware sanitization on the server side as well.

When should I encode HTML entities in web development?

Always encode user-supplied data before inserting it into HTML to prevent Cross-Site Scripting (XSS) attacks. The five characters with special meaning in HTML — < (less-than), > (greater-than), & (ampersand), " (double quote), and ' (apostrophe) — must be encoded as <, >, &, ", and ' respectively. Modern web frameworks (React, Vue, Angular) do this automatically for text content, but it is crucial to understand when working with raw HTML strings.

What is the difference between named entities and numeric entities?

Named entities use descriptive names like (non-breaking space), © (copyright ©), and € (€). Numeric entities use decimal (©) or hexadecimal (©) Unicode code points. Named entities are more readable but only cover a limited set of characters defined in the HTML specification. Numeric entities can represent any Unicode character, making them the universal option for encoding any character not already covered by a named entity.

HTML Entity Converter — Encode & Decode HTML Entities — Free Online Tool

What Are HTML Entities?

HTML entities are short codes that represent characters with special meaning in HTML syntax. The most critical ones are the angle brackets (< and >), which the browser interprets as the start and end of HTML tags, and the ampersand (&), which begins every entity reference. When you want to display these characters as visible text rather than having the browser parse them as markup, you must use their entity equivalents: < for <, > for >, and & for &. Other commonly encoded characters include the double quote (") and apostrophe ('), which matter inside attribute values. HTML also supports hundreds of named entities for symbols and accented letters — for example, © for ©, é for é, and — for an em dash. Numeric entities (< or <) work as an alternative when a named form does not exist. Understanding entities helps you write more robust, standards-compliant HTML that displays correctly across all browsers and character sets.

Named vs. Numeric Entities

HTML entities come in two flavors: named and numeric. Named entities use a descriptive keyword, such as & (ampersand), < (less-than), and   (non-breaking space). They are human-readable and widely recognized, but not every character has a named form. Numeric entities reference a character by its Unicode code point, either in decimal form (& for &) or hexadecimal form (& for &). Every Unicode character can be expressed numerically, making numeric entities more universal. Both forms are fully equivalent to browsers — & and & both render as an ampersand. The choice between them is mostly stylistic: use named entities for readability in hand-written HTML, and numeric entities when you need to represent a character that has no name or when generating HTML programmatically without worrying about entity name lookups.

Character Encoding vs. HTML Entities

It is important to distinguish between HTML entity encoding and character encoding. Character encoding (such as UTF-8) defines how text bytes are stored and transmitted at the byte level. As long as your page declares UTF-8 with a proper <meta charset="UTF-8"> tag and your server sends the correct Content-Type header, you can include most Unicode characters — including Japanese, emoji, and accented letters — directly in your HTML source without entity-encoding them. HTML entities, by contrast, are specifically for escaping characters that have syntactic significance in HTML (&, <, >) or that must appear inside attribute values safely. A common misconception is that all non-ASCII characters must be entity-encoded; with modern UTF-8 pages, only the handful of HTML syntax characters truly require escaping.

HTML Entity Converter — Encode & Decode HTML Entities

About HTML Entity Converter — Encode & Decode HTML Entities

How to Use

Features

Understanding HTML Entities and Character Encoding

What Are HTML Entities?

Named vs. Numeric Entities

Character Encoding vs. HTML Entities

HTML Encoding and XSS Prevention

How XSS Attacks Work

How HTML Encoding Prevents XSS

FAQ