trafilatura v1.3.0 Release Notes
-
- fast and robust
html2txt()
function added (#221) - ๐ more robust parsing (#228)
- ๐ fixed bugs in metadata extraction, with @felipehertzer in #213 & #226
- ๐ extraction about 10-20% faster, slightly better recall
- ๐ partial fixes for memory leaks (#216)
- ๐ docs extended and updated (#217, #225)
- ๐ prepared deprecation of old
process_record()
function - โก๏ธ more stable processing with updated dependencies
- fast and robust