Audisto Crawler Hints
Character Encoding Analyzer Content & Technical Implementation Tool
What is the encoding checker for
Ensure Proper Character Display: Detect issues with character encoding, charset declarations, and special character handling that affect content readability.
Validate Technical Implementation: Find problems with BOM markers, non-printable characters, and content compression that impact content delivery.
Maintain Content Integrity: Identify encoding mismatches between HTTP headers and HTML documents that could cause display issues.
Monitor Encoding Health: Track character encoding implementation across your website to maintain consistent content display and prevent rendering issues.
How to check encoding issues at scale

Proper encoding ensures readability
Correct character encoding is crucial for content display and interpretation. Our analysis helps you maintain proper encoding across your website. While our quick check provides immediate encoding insights for single pages, our full crawler ensures consistent character handling across your entire website.
Critical Encoding related hints

<html> contains too many uncommon non-printable characters
The HTML documents contains too many uncommon non-printable characters, and not all of them will be shown in live analysis.Learn more
<html> contains uncommon non-printable characters
If uncommon non-printable characters are detected, the URL of the document containing the character is flagged.Learn more
<html> starts with BOM
There is a unicode byte order mark (BOM) at the top of the HTML.Learn more
Charset: Charset set in HTTP Content-Type header and document differ
Both the document and the HTTP Content-Type header specify a charset, but these are not identical.Learn more
Charset: Invalid charset in Content-Type HTTP header
The Content-Type HTTP header specifies an invalid charset.Learn more
Charset: Invalid charset in Document
This hint identifies all HTML documents where the charset declaration in the HTML document specifies an invalid charset.Learn more
Charset: Not set
There is no charset set, neither in the Content-Type HTTP header, nor in the document, e.g. through a <meta> tag.Learn more
Problem indicating Encoding related hints

<html> contains unencoded Left-To-Right-Mark or Right-To-Left-Mark
A Left-To-Right or Right-To-Left-Mark was found, but it is not encoded as HTML entity.Learn more
<html> contains unencoded joiner chars
A zero-width joiner or zero-width non-joiner was found, but it is not encoded as HTML entity.Learn more
Charset: Not set in Content-Type HTTP header
The Content-Type HTTP header does not specify a charset.Learn more
Charset: Not set in document
There is no charset set in the document, e.g. through a <meta> tag.Learn more
Compression not enabled
Content compression was not enabled, or gzip, deflate or brotli compression is not supported by the server.Learn more
Informational Encoding related hints

<html> contains unencoded soft hyphen (SHY)
A soft hyphen was found, but it is not encoded as HTML entity.Learn more
Content-Type does not match file extension
This hint identifies all downloaded URLs where the Content-Type HTTP header does not match the file extension.Learn more