ftfy v5.6 Release Notes

Release Date: 2019-08-07 // over 4 years ago
    • ๐Ÿ‘ The unescape_html function now supports all the HTML5 entities that appear in html.entities.html5, including those with long names such as ˝.

    • Unescaping of numeric HTML entities now uses the standard library's html.unescape, making edge cases consistent.

    (The reason we don't run html.unescape on all text is that it's not always appropriate to apply, and can lead to false positive fixes. The text "This&NotThat" should not have "&Not" replaced by a symbol, as html.unescape would do.)

    • ๐Ÿ‘ On top of Python's support for HTML5 entities, ftfy will also convert HTML escapes of common Latin capital letters that are (nonstandardly) written in all caps, such as Ñ for ร‘.