Skip to content
#

charset-normalizer

Here are 3 public repositories matching this topic...

Language: All
Filter by language

A lightweight, fast, and optimized XML file splitter with build in tag data validation, written with the XMLParser library. The main goal of this is to split an XML file into multiple small chunks (hence the name), then save it into multiple different little XML files.

  • Updated Oct 26, 2023
  • PHP
bytesense

Charset and encoding detection for Python. Identifies the encoding of any byte sequence using byte-distribution fingerprinting, language coherence scoring, and mess detection. Supports streaming detection, mojibake repair, multi-encoding document analysis, and in-band HTML/XML hints. Drop-in replacement for chardet and charset-normalizer.

  • Updated Mar 26, 2026
  • Python

Improve this page

Add a description, image, and links to the charset-normalizer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the charset-normalizer topic, visit your repo's landing page and select "manage topics."

Learn more