newspaper v0.1.7 Release Notes

Release Date: 2016-01-30 // about 4 years ago
  • Full Changelog

    Closed issues:

    • ImportError: cannot import name 'Image' #183
    • Won't let me import #182
    • Install on Mac - El Capitan Failed - "Operation not permitted" #181
    • ⬇️ Downgrades to old versions of required packages upon installation #174
    • Handling 404, 500, and other non-200 http response codes to prevent scraping error pages #142
    • ⬇️ Libray downgrading in installation #138

    🔀 Merged pull requests:

    • Don't scrape error pages #190 (yprez)
    • ➕ Added Hebrew stop words for language support #188 (alon7)
    • 🛠 Fix installation and build #187 (yprez)
    • 🛠 Fix installation docs #184 (yprez)
    • 👷 Travis CI integration #180 (yprez)
    • requirements.txt - Use minimal instead of exact versions #179 (yprez)
    • 🖐 Handle lxml raising ValueError on node.itertext() - Python 3 #178 (yprez)
    • 🖐 Handle lxml raising ValueError on node.itertext() #144 (yprez)
    • 📜 Parse byline fix #132 (davecrumbacher)

Previous changes from v0.1.6

  • Full Changelog

    Closed issues:

    • 🚑 Critical leak in newspaper.mthreading.Worker #177
    • 👀 HTMLParseError #165
    • Take local paths to .html files #153
    • Wall Street Journal Full Text is not Correctly Scraped #150
    • Article HTML Returning Null #131
    • No articles #130
    • Loading Pages that use heavy javascript #127
    • Login handling for premium websites #126
    • Installation of nltk is failing #121

    🔀 Merged pull requests: