Popularity
7.5
Stable
Activity
4.6
Growing
3,252
85
490
Programming language: HTML
License: MIT License
Tags:
Web Content Extracting
Latest version: v1.6.3
textract alternatives and similar packages
Based on the "Web Content Extracting" category.
Alternatively, view textract alternatives based on common mentions on social networks and blogs.
-
TWINT
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations. -
newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs: -
python-goose
Html Content / Article Extractor, web scrapping lib in Python -
sumy
Module for automatic summarization of text documents and HTML pages. -
python-readability
fast python port of arc90's readability tool, updated to match latest readability.js! -
Goose3
A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html -
trafilatura
Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments) -
inscriptis -- HTML to text conversion library, command line client and Web service
1.9 7.3 textract VS inscriptis -- HTML to text conversion library, command line client and Web serviceA python based HTML to text conversion library, command line client and Web service. -
htmldate
Fast and robust date extraction from web pages, with Python or on the command-line -
Data Extractor
Combine XPath, CSS Selectors and JSONPath for Web data extracting.
Less time debugging, more time building
Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
Promo
scoutapm.com
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of textract or a related project?