Charset Normalizer v3.0.0.rc1 Release Notes
Release Date: 2022-10-18 // over 1 year ago-
โ Added
- ๐ฒ Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results
- ๐ Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES
- Add parameter
language_threshold
infrom_bytes
,from_path
andfrom_fp
to adjust the minimum expected coherence ratio
๐ Changed
- ๐ Build with static metadata using 'build' frontend
- ๐ Make the language detection stricter
๐ Fixed
- CLI with opt --normalize fail when using full path for files
- ๐ TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha character have been fed to it
โ Removed
- Coherence detector no longer return 'Simple English' instead return 'English'
- Coherence detector no longer return 'Classical Chinese' instead return 'Chinese'