All Versions
33
Latest Version
Avg Release Cycle
37 days
Latest Release
134 days ago
Changelog History
Page 1
Changelog History
Page 1
-
v2.0.12 Changes
February 12, 2022๐ Fixed
- ASCII miss-detection on rare cases (PR #170)
-
v2.0.11 Changes
January 30, 2022โ Added
- ๐ Explicit support for Python 3.11 (PR #164)
๐ Changed
- ๐ฒ The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels (PR #163 #165)
-
v2.0.10 Changes
January 04, 2022๐ Fixed
- Fallback match entries might lead to UnicodeDecodeError for large bytes sequence (PR #154)
๐ Changed
- Skipping the language-detection (CD) on ASCII (PR #155)
-
v2.0.9 Changes
December 03, 2021๐ Changed
- ๐ฒ Moderating the logging impact (since 2.0.8) for specific environments (PR #147)
๐ Fixed
- ๐ฒ Wrong logging level applied when setting kwarg
explain
to True (PR #146)
-
v2.0.8 Changes
November 24, 2021๐ Changed
- ๐ Improvement over Vietnamese detection (PR #126)
- MD improvement on trailing data and long foreign (non-pure latin) data (PR #124)
- Efficiency improvements in cd/alphabet_languages from @adbar (PR #122)
- call sum() without an intermediary list following PEP 289 recommendations from @adbar (PR #129)
- ๐ Code style as refactored by Sourcery-AI (PR #131)
- Minor adjustment on the MD around european words (PR #133)
- โ Remove and replace SRTs from assets / tests (PR #139)
- ๐ Initialize the library logger with a
NullHandler
by default from @nmaynes (PR #135) - Setting kwarg
explain
to True will add provisionally (bounded to function lifespan) a specific stream handler (PR #135)
๐ Fixed
- ๐ Fix large (misleading) sequence giving UnicodeDecodeError (PR #137)
- Avoid using too insignificant chunk (PR #137)
โ Added
- ๐ฒ Add and expose function
set_logging_handler
to configure a specific StreamHandler from @nmaynes (PR #135) - โ Add
CHANGELOG.md
entries, format is based on Keep a Changelog (PR #141)
-
v2.0.7 Changes
October 11, 2021โ Added
- โ Add support for Kazakh (Cyrillic) language detection (PR #109)
๐ Changed
- Further, improve inferring the language from a given single-byte code page (PR #112)
- ๐ Vainly trying to leverage PEP263 when PEP3120 is not supported (PR #116)
- ๐ Refactoring for potential performance improvements in loops from @adbar (PR #113)
- Various detection improvement (MD+CD) (PR #117)
โ Removed
- โ Remove redundant logging entry about detected language(s) (PR #115)
๐ Fixed
- ๐ Fix a minor inconsistency between Python 3.5 and other versions regarding language detection (PR #117 #102)
-
v2.0.6 Changes
September 18, 2021๐ Fixed
- ๐ Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x (PR #100)
- ๐ Fix CLI crash when using --minimal output in certain cases (PR #103)
๐ Changed
- Minor improvement to the detection efficiency (less than 1%) (PR #106 #101)
-
v2.0.5 Changes
September 14, 2021๐ Changed
- ๐ The project now comply with: flake8, mypy, isort and black to ensure a better overall quality (PR #81)
- โช The BC-support with v1.x was improved, the old staticmethods are restored (PR #82)
- The Unicode detection is slightly improved (PR #93)
- Add syntax sugar __bool__ for results CharsetMatches list-container (PR #91)
โ Removed
- โ The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead (PR #92)
๐ Fixed
- In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection (PR #95)
- ๐ Some rare 'space' characters could trip up the UnprintablePlugin/Mess detection (PR #96)
- The MANIFEST.in was not exhaustive (PR #78)
-
v2.0.4 Changes
July 30, 2021๐ Fixed
- ๐ป The CLI no longer raise an unexpected exception when no encoding has been found (PR #70)
- ๐ Fix accessing the 'alphabets' property when the payload contains surrogate characters (PR #68)
- The logger could mislead (explain=True) on detected languages and the impact of one MBCS match (PR #72)
- Submatch factoring could be wrong in rare edge cases (PR #72)
- Multiple files given to the CLI were ignored when publishing results to STDOUT. (After the first path) (PR #72)
- ๐ Fix line endings from CRLF to LF for certain project files (PR #67)
๐ Changed
- Adjust the MD to lower the sensitivity, thus improving the global detection reliability (PR #69 #76)
- ๐ Allow fallback on specified encoding if any (PR #71)
-
v2.0.3 Changes
July 16, 2021๐ Changed
- Part of the detection mechanism has been improved to be less sensitive, resulting in more accurate detection results. Especially ASCII. (PR #63)
- According to the community wishes, the detection will fall back on ASCII or UTF-8 in a last-resort case. (PR #64)