All Versions
13
Latest Version
Avg Release Cycle
9 days
Latest Release
679 days ago

Changelog History
Page 1

  • v1.3.4 Changes

    December 16, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ‘Œ Improvement/Bugfix : False positive when searching for successive upper, lower char. (ProbeChaos) (#31)
  • v1.3.3 Changes

    December 16, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ‘Œ Improvement : Noticeable better detection for jp #30
  • v1.3.2 Changes

    December 13, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ›  Bugfix : Passing zero-length bytes to from_bytes (#29)
  • v1.3.1 Changes

    October 11, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ‘Œ Improvement : Expose version in package (#18)
    • ๐Ÿ›  Bugfix : Division by zero (#23)
    • ๐Ÿ‘Œ Improvement : Prefers unicode (utf-8) when detected (#19)
  • v1.3.0 Changes

    September 30, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ”‹ Feature : Now support unicodedata2 backport. To benefit from it install using pip install charset-normalizer[UnicodeDataBackport]. Python 3.7 have UnicodeData v11. You could upgrade it to v12.
    • ๐Ÿ”‹ Feature : Added preemptive behaviour. Looking for a declared encoding. Using positional parameter preemptive_behaviour. Default to True. Does not take declared encoding for it, testing it first.
    • ๐Ÿ‘Œ Improvement : Added aliases to CharsetNormalizerMatches class. CharsetDetector; EncodingDetector and CharsetDoctor.
  • v1.2.0 Changes

    September 28, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ”‹ Feature : Added has_submatch, percent_chaos and percent_coherence properties on single match object.
    • ๐Ÿ‘Œ Improvement : best() method of CharsetNormalizerMatches has been rewritten for better readability.
    • ๐Ÿ”‹ Feature : Added explain boolean positional parameter to print out what actually happen when searching for a match.
    • ๐Ÿ‘Œ Improvement : Detection has been globally improved.
    • ๐Ÿ”‹ Feature : You can exclude some encoding when searching for a match with parameter cp_exclusion. List of str. for from_bytes from_path and from_fp.
    • ๐Ÿ”‹ Feature : You can limit the search to some encoding when looking for a match with parameter cp_isolation. List of str. for from_bytes from_path and from_fp.
    • ๐Ÿ”‹ Feature : import charset_normalizer is enough to provide additional help when you encounter UnicodeDecodeError exception.
  • v1.1.1 Changes

    September 23, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ›  Bugfix : from_bytes parameters steps and chunk_size were not adapted to sequence len if provided values were not fitted to content. Therefore could lead to misdetection on small content.
  • v1.1.0 Changes

    September 21, 2019

    ๐Ÿ”„ Changes :

    • ๐Ÿ›  Bugfix : Sequence having lenght bellow 10 chars was not checked by ProbeChaos at all. (#14)
    • ๐Ÿ›  Bugfix : Legacy detect method inspired by chardet was not returning intended result when having no result. (#14)
  • v1.0.0

    September 17, 2019
  • v0.3 Changes

    September 12, 2019
    • ๐Ÿ‘Œ Improvement on detection
    • ๐ŸŽ Performance loss to expect
    • โž• Added --threshold option to CLI