Goose was originally an article extractor written in Java that has most
recently (Aug2011) been converted to a scala project.
This is a complete rewrite in Python. The aim of the software is to take any news article or article-type web page and not only extract what is the main body of the article but also all meta data and most probable image candidate.
Goose will try to extract the following information:
python-goose alternatives and similar packages
Based on the "Web Content Extracting" category
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest. Visit our partner's website for more details.
Do you think we are missing an alternative of python-goose or a related project?