Convertidor Unicode

Convertir texto a secuencias de escape Unicode o viceversa.

Caracteres (Unicode)

Unicode is an international standard for encoding, representing, and handling text. It assigns a unique number (codepoint) to every character from every writing system, plus symbols, emojis, and control characters. Unicode aims to be the universal character set, currently containing over 149,000 characters covering 161 scripts.

Unicode Encodings

Unicode codepoints can be encoded in different formats: UTF-8 (variable 1-4 bytes, ASCII-compatible), UTF-16 (2 or 4 bytes, used by JavaScript/Windows), and UTF-32 (fixed 4 bytes). UTF-8 has become the dominant encoding on the web, handling all languages while remaining efficient for ASCII text.

Analizador Cron
  • Debugging encoding issues in text
  • Finding special character codepoints
  • Converting escaped Unicode sequences
  • Analyzing character composition
  • Working with internationalization (i18n)
Formatos soportados
U+XXXX Standard Unicode notation (e.g., U+0041 for 'A')
\uXXXX JavaScript/JSON escape sequence
&#DDDD; HTML decimal entity
&#xHHHH; HTML hexadecimal entity
Frequently Asked Questions

What's the difference between UTF-8 and Unicode?

Unicode is the character set (mapping of characters to numbers). UTF-8 is one way to encode those numbers as bytes. Other encodings include UTF-16 and UTF-32.

Why do some characters look like boxes or question marks?

This happens when your system doesn't have a font that includes that character, or when encoding is misdetected. The character exists but can't be displayed.