kb/data/developer.mozilla.org/en-US/docs/Glossary/Unicode-0.md

10 KiB

title chunk source category tags date_saved instance
Unicode - Glossary | MDN 1/3 https://developer.mozilla.org/en-US/docs/Glossary/Unicode reference web, html, css, javascript, documentation 2026-05-05T05:48:12.080098+00:00 kb-cron

MDN HTML HTML: Markup language

HTML reference

HTML guides

Markup languages

CSS CSS: Styling language

CSS reference

CSS guides

Layout cookbook

JavaScriptJS JavaScript: Scripting language

JS reference

JS guides

Web APIs Web APIs: Programming interfaces

Web API reference

Web API guides

All All web technology

Technologies

Topics

Learn Learn web development

Frontend developer course

Learn HTML

Learn CSS

Learn JavaScript

Tools Discover our tools

About Get to know MDN better

Blog

  1. Glossary
  2. Unicode

Unicode

Unicode is a standard character set that numbers and defines characters from the world's different languages, writing systems, and symbols. By assigning each character a number, programmers can create character encodings, to let computers store, process, and transmit any combination of languages in the same file or program. Before Unicode, it was difficult and error-prone to mix languages in the same data. For example, one character set would store Japanese characters, and another would store the Arabic alphabet. If it was not clearly marked which parts of the data were in which character set, other programs and computers would display the text incorrectly, or damage it during processing. If you've ever seen text where characters like curly quotes (“”) were replaced with gibberish like £, then you've seen this problem, known as Mojibake. The most common Unicode character encoding on the Web is UTF-8. Other encodings exist, like UTF-16 or the obsolete UCS-2, but UTF-8 is recommended.

In this article

See also