All Versions
33
Latest Version
Avg Release Cycle
37 days
Latest Release
134 days ago

Changelog History
Page 1

  • v2.0.12 Changes

    February 12, 2022

    ๐Ÿ›  Fixed

    • ASCII miss-detection on rare cases (PR #170)
  • v2.0.11 Changes

    January 30, 2022

    โž• Added

    • ๐Ÿ‘ Explicit support for Python 3.11 (PR #164)

    ๐Ÿ”„ Changed

    • ๐ŸŒฒ The logging behavior have been completely reviewed, now using only TRACE and DEBUG levels (PR #163 #165)
  • v2.0.10 Changes

    January 04, 2022

    ๐Ÿ›  Fixed

    • Fallback match entries might lead to UnicodeDecodeError for large bytes sequence (PR #154)

    ๐Ÿ”„ Changed

    • Skipping the language-detection (CD) on ASCII (PR #155)
  • v2.0.9 Changes

    December 03, 2021

    ๐Ÿ”„ Changed

    • ๐ŸŒฒ Moderating the logging impact (since 2.0.8) for specific environments (PR #147)

    ๐Ÿ›  Fixed

    • ๐ŸŒฒ Wrong logging level applied when setting kwarg explain to True (PR #146)
  • v2.0.8 Changes

    November 24, 2021

    ๐Ÿ”„ Changed

    • ๐Ÿ‘Œ Improvement over Vietnamese detection (PR #126)
    • MD improvement on trailing data and long foreign (non-pure latin) data (PR #124)
    • Efficiency improvements in cd/alphabet_languages from @adbar (PR #122)
    • call sum() without an intermediary list following PEP 289 recommendations from @adbar (PR #129)
    • ๐Ÿ’… Code style as refactored by Sourcery-AI (PR #131)
    • Minor adjustment on the MD around european words (PR #133)
    • โœ‚ Remove and replace SRTs from assets / tests (PR #139)
    • ๐ŸŽ‰ Initialize the library logger with a NullHandler by default from @nmaynes (PR #135)
    • Setting kwarg explain to True will add provisionally (bounded to function lifespan) a specific stream handler (PR #135)

    ๐Ÿ›  Fixed

    • ๐Ÿ›  Fix large (misleading) sequence giving UnicodeDecodeError (PR #137)
    • Avoid using too insignificant chunk (PR #137)

    โž• Added

    • ๐ŸŒฒ Add and expose function set_logging_handler to configure a specific StreamHandler from @nmaynes (PR #135)
    • โž• Add CHANGELOG.md entries, format is based on Keep a Changelog (PR #141)
  • v2.0.7 Changes

    October 11, 2021

    โž• Added

    • โž• Add support for Kazakh (Cyrillic) language detection (PR #109)

    ๐Ÿ”„ Changed

    • Further, improve inferring the language from a given single-byte code page (PR #112)
    • ๐Ÿ‘ Vainly trying to leverage PEP263 when PEP3120 is not supported (PR #116)
    • ๐ŸŽ Refactoring for potential performance improvements in loops from @adbar (PR #113)
    • Various detection improvement (MD+CD) (PR #117)

    โœ‚ Removed

    • โœ‚ Remove redundant logging entry about detected language(s) (PR #115)

    ๐Ÿ›  Fixed

    • ๐Ÿ›  Fix a minor inconsistency between Python 3.5 and other versions regarding language detection (PR #117 #102)
  • v2.0.6 Changes

    September 18, 2021

    ๐Ÿ›  Fixed

    • ๐Ÿ‘€ Unforeseen regression with the loss of the backward-compatibility with some older minor of Python 3.5.x (PR #100)
    • ๐Ÿ›  Fix CLI crash when using --minimal output in certain cases (PR #103)

    ๐Ÿ”„ Changed

    • Minor improvement to the detection efficiency (less than 1%) (PR #106 #101)
  • v2.0.5 Changes

    September 14, 2021

    ๐Ÿ”„ Changed

    • ๐Ÿ‘ The project now comply with: flake8, mypy, isort and black to ensure a better overall quality (PR #81)
    • โช The BC-support with v1.x was improved, the old staticmethods are restored (PR #82)
    • The Unicode detection is slightly improved (PR #93)
    • Add syntax sugar __bool__ for results CharsetMatches list-container (PR #91)

    โœ‚ Removed

    • โš  The project no longer raise warning on tiny content given for detection, will be simply logged as warning instead (PR #92)

    ๐Ÿ›  Fixed

    • In some rare case, the chunks extractor could cut in the middle of a multi-byte character and could mislead the mess detection (PR #95)
    • ๐Ÿ”Œ Some rare 'space' characters could trip up the UnprintablePlugin/Mess detection (PR #96)
    • The MANIFEST.in was not exhaustive (PR #78)
  • v2.0.4 Changes

    July 30, 2021

    ๐Ÿ›  Fixed

    • ๐Ÿ‘ป The CLI no longer raise an unexpected exception when no encoding has been found (PR #70)
    • ๐Ÿ›  Fix accessing the 'alphabets' property when the payload contains surrogate characters (PR #68)
    • The logger could mislead (explain=True) on detected languages and the impact of one MBCS match (PR #72)
    • Submatch factoring could be wrong in rare edge cases (PR #72)
    • Multiple files given to the CLI were ignored when publishing results to STDOUT. (After the first path) (PR #72)
    • ๐Ÿ›  Fix line endings from CRLF to LF for certain project files (PR #67)

    ๐Ÿ”„ Changed

    • Adjust the MD to lower the sensitivity, thus improving the global detection reliability (PR #69 #76)
    • ๐Ÿ‘ Allow fallback on specified encoding if any (PR #71)
  • v2.0.3 Changes

    July 16, 2021

    ๐Ÿ”„ Changed

    • Part of the detection mechanism has been improved to be less sensitive, resulting in more accurate detection results. Especially ASCII. (PR #63)
    • According to the community wishes, the detection will fall back on ASCII or UTF-8 in a last-resort case. (PR #64)