All Versions
40
Latest Version
Avg Release Cycle
26 days
Latest Release
588 days ago

Changelog History
Page 4

  • v1.3.1 Changes

    October 11, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ‘Œ Improvement : Expose version in package (#18)
    • ๐Ÿ›  Bugfix : Division by zero (#23)
    • ๐Ÿ‘Œ Improvement : Prefers unicode (utf-8) when detected (#19)
  • v1.3.0 Changes

    September 30, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ”‹ Feature : Now support unicodedata2 backport. To benefit from it install using pip install charset-normalizer[UnicodeDataBackport]. Python 3.7 have UnicodeData v11. You could upgrade it to v12.
    • ๐Ÿ”‹ Feature : Added preemptive behaviour. Looking for a declared encoding. Using positional parameter preemptive_behaviour. Default to True. Does not take declared encoding for it, testing it first.
    • ๐Ÿ‘Œ Improvement : Added aliases to CharsetNormalizerMatches class. CharsetDetector; EncodingDetector and CharsetDoctor.
  • v1.2.0 Changes

    September 28, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ”‹ Feature : Added has_submatch, percent_chaos and percent_coherence properties on single match object.
    • ๐Ÿ‘Œ Improvement : best() method of CharsetNormalizerMatches has been rewritten for better readability.
    • ๐Ÿ”‹ Feature : Added explain boolean positional parameter to print out what actually happen when searching for a match.
    • ๐Ÿ‘Œ Improvement : Detection has been globally improved.
    • ๐Ÿ”‹ Feature : You can exclude some encoding when searching for a match with parameter cp_exclusion. List of str. for from_bytes from_path and from_fp.
    • ๐Ÿ”‹ Feature : You can limit the search to some encoding when looking for a match with parameter cp_isolation. List of str. for from_bytes from_path and from_fp.
    • ๐Ÿ”‹ Feature : import charset_normalizer is enough to provide additional help when you encounter UnicodeDecodeError exception.
  • v1.1.1 Changes

    September 23, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ›  Bugfix : from_bytes parameters steps and chunk_size were not adapted to sequence len if provided values were not fitted to content. Therefore could lead to misdetection on small content.
  • v1.1.0 Changes

    September 21, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ›  Bugfix : Sequence having lenght bellow 10 chars was not checked by ProbeChaos at all. (#14)
    • ๐Ÿ›  Bugfix : Legacy detect method inspired by chardet was not returning intended result when having no result. (#14)
  • v1.0.0

    September 17, 2019
  • v0.3 Changes

    September 12, 2019
    • ๐Ÿ‘Œ Improvement on detection
    • ๐ŸŽ Performance loss to expect
    • โž• Added --threshold option to CLI
  • v0.2.3 Changes

    September 06, 2019
    • ๐Ÿ›  Bugfix on UTF 7 support
    • Legacy detect(byte_str) method
  • v0.2.2 Changes

    September 04, 2019

    RC 5

    • ๐Ÿ‘ BOM support (Unicode mostly)
    • Chaos prober improved on small text
  • v0.2.1 Changes

    September 03, 2019

    RC 4

    • ๐ŸŽ Probe Chaos: Code cleanup, performance review and accuracy improved