Popularity
8.3
Stable
Activity
0.0
Stable
3,379
196
729

Description

Goose was originally an article extractor written in Java that has most recently (Aug2011) been converted to a scala project.

This is a complete rewrite in Python. The aim of the software is to take any news article or article-type web page and not only extract what is the main body of the article but also all meta data and most probable image candidate.

Goose will try to extract the following information:

Programming language: HTML
License: Apache License 2.0

python-goose alternatives and related packages

Based on the "Web Content Extracting" category

Do you think we are missing an alternative of python-goose or a related project?

Add another 'Web Content Extracting' Package

python-goose Recommendations

There are no recommendations yet. Be the first to promote python-goose!

Have you used python-goose? Share your experience. Write a short recommendation and python-goose, you and your project will be promoted on Awesome Python.
Recommend python-goose

Recently added python-goose resources

Do you know of a usefull tutorial, book or news relevant to python-goose?
Be the first to add one!