feedparser v4.2 Release Notes

Release Date: 2008-03-12 // about 16 years ago
    • ๐Ÿ“œ Support for parsing microformats, including rel=enclosure, rel=tag, XFN, and hCard.
    • โšก๏ธ Updated the whitelist of acceptable HTML elements and attributes based on the latest draft of the HTML (HyperText Markup Language) 5 specification.
    • ๐Ÿ’… Support for CSS sanitization. (Previous versions of Universal Feed Parser simply stripped all inline styles.) Many thanks to Sam Ruby for implementing this, despite my insistence that it was impossible.
    • ๐Ÿ‘ Support for SVG sanitation.
    • ๐Ÿ‘ Support for MathML sanitation. Many thanks to Jacques Distler for patiently debugging this feature.
    • ๐Ÿ‘ IRI (International Resource Identifier) support for every element that can contain a URI (Uniform Resource Identifier).
    • Ability to disable relative URI resolution.
    • ๐Ÿ“œ Command-line arguments and alternate serializers, for manipulating Universal Feed Parser from shell scripts or other non-Python sources.
    • ๐Ÿ“œ More robust parsing of author email addresses, misencoded win-1252 content, rel=self links, and better detection of HTML content in elements with ambiguous content types.