Charset Normalizer v3.0.0.rc1 Release Notes

Release Date: 2022-10-18 // over 1 year ago
  • โž• Added

    • ๐ŸŒฒ Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results
    • ๐Ÿ‘Œ Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES
    • Add parameter language_threshold in from_bytes, from_path and from_fp to adjust the minimum expected coherence ratio

    ๐Ÿ”„ Changed

    • ๐Ÿ“‡ Build with static metadata using 'build' frontend
    • ๐Ÿ‘‰ Make the language detection stricter

    ๐Ÿ›  Fixed

    • CLI with opt --normalize fail when using full path for files
    • ๐Ÿ”Œ TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha character have been fed to it

    โœ‚ Removed

    • Coherence detector no longer return 'Simple English' instead return 'English'
    • Coherence detector no longer return 'Classical Chinese' instead return 'Chinese'