newspaper v0.1.7 Release Notes
Release Date: 2016-01-30 // about 9 years ago-
Closed issues:
- ImportError: cannot import name 'Image' #183
- Won't let me import #182
- Install on Mac - El Capitan Failed - "Operation not permitted" #181
- ⬇️ Downgrades to old versions of required packages upon installation #174
- Handling 404, 500, and other non-200 http response codes to prevent scraping error pages #142
- ⬇️ Libray downgrading in installation #138
🔀 Merged pull requests:
- Don't scrape error pages #190 (yprez)
- ➕ Added Hebrew stop words for language support #188 (alon7)
- 🛠 Fix installation and build #187 (yprez)
- 🛠 Fix installation docs #184 (yprez)
- 👷 Travis CI integration #180 (yprez)
- requirements.txt - Use minimal instead of exact versions #179 (yprez)
- 🖐 Handle lxml raising ValueError on node.itertext() - Python 3 #178 (yprez)
- 🖐 Handle lxml raising ValueError on node.itertext() #144 (yprez)
- 📜 Parse byline fix #132 (davecrumbacher)
Previous changes from v0.1.6
-
Closed issues:
- 🚑 Critical leak in newspaper.mthreading.Worker #177
- 👀 HTMLParseError #165
- Take local paths to .html files #153
- Wall Street Journal Full Text is not Correctly Scraped #150
- Article HTML Returning Null #131
- No articles #130
- Loading Pages that use heavy javascript #127
- Login handling for premium websites #126
- Installation of nltk is failing #121
🔀 Merged pull requests:
- 👌 Support urls with dots #176 (alexanderlukanin13)
- ⬆️ upgrade beautifulsoup4 to 4.4.1 for python 3.5 #171 (AlJohri)
- ⚡️ Updated requests version #170 (adrienthiery)
- Turkish Language added #169 (muratcorlu)
- ➕ Add macedonian stopwords #166 (dimitrovskif)
- Issue#95 added graceful string concatenation #157 (surajssd)
- 🛠 fix for "jpeg error with PIL, Can't convert 'NoneType' object to str implicitly" #154 (hnykda)
- bugfix in article.py, is_valid_body #149 (ms8r)
- 🛠 Fixed typo #139 (Eleonore9)
- Correct link for the Python 3 branch #136 (jtpio)
- ➕ Add python3-pip install step for Ubuntu #135 (irnc)