Scrapy/CHANGELOG and Scrapy Releases

All Versions

Latest Version

2.4.1

Avg Release Cycle

55 days

Latest Release

1256 days ago

Changelog History

Page 1

v2.4.1 Changes
November 17, 2020
🛠 Fixed feed exports overwrite support
🛠 Fixed the asyncio event loop handling, which could make code hang
🛠 Fixed the IPv6-capable DNS resolver CachingHostnameResolver for download handlers that call reactor.resolve
🛠 Fixed the output of the genspider command showing placeholders instead of the import part of the generated spider module (issue 4874)
v2.4.0 Changes
October 11, 2020
Hihglights:
👍 Python 3.5 support has been dropped.
📄 The file_path method of media pipelines can now access the source item.

This allows you to set a download file path based on item data.
The new item_export_kwargs key of the FEEDS setting allows to define keyword parameters to pass to item exporter classes.
📄 You can now choose whether feed exports overwrite or append to the output file.

📄 For example, when using the crawl or runspider commands, you can use the -O option instead of -o to overwrite the output file.
👍 Zstd-compressed responses are now supported if zstandard is installed.
In settings, where the import path of a class is required, it is now possible to pass a class object instead.

👀 See the full changelog
v2.3.0 Changes
August 04, 2020
Hihglights:
📄 Feed exports now support Google Cloud Storage as a storage backend
The new FEED_EXPORT_BATCH_ITEM_COUNT setting allows to deliver output items in batches of up to the specified number of items.

↪ It also serves as a workaround for delayed file delivery, which causes Scrapy to only start item delivery after the crawl has finished when using certain storage backends (S3, FTP, and now GCS).
🚀 The base implementation of item loaders has been moved into a separate library, itemloaders, allowing usage from outside Scrapy and a separate release schedule

👀 See the full changelog
v2.2.1 Changes
July 17, 2020
📄 The startproject command no longer makes unintended changes to the permissions of files in the destination folder, such as removing execution permissions.
v2.2.0 Changes
June 24, 2020
Highlights:
- Python 3.5.2+ is required now
- 📄 dataclass objects and attrs objects are now valid item types
- 🆕 New TextResponse.json method
- 🚦 New bytes_received signal that allows canceling response download
- 🛠 CookiesMiddleware fixes
👀 See the full changelog
v2.1.0 Changes
April 24, 2020
Highlights:
- 🆕 New FEEDS setting to export to multiple feeds
- ➕ New Response.ip_address attribute
👀 See the full changelog
v2.0.1 Changes
March 18, 2020
- 👍 Response.follow_all now supports an empty URL iterable as input (#4408, #4420)
- ✂ Removed top-level reactor imports to prevent errors about the wrong Twisted reactor being installed when setting a different Twisted reactor using TWISTED_REACTOR (#4401, #4406)
v2.0.0 Changes
March 03, 2020
Highlights:
- 🚚 Python 2 support has been removed
- 👍 Partial coroutine syntax support and experimental asyncio support
- 🆕 New Response.follow_all method
- 👍 FTP support for media pipelines
- 🆕 New Response.certificate attribute
- 👍 IPv6 support through DNS_RESOLVER
👀 See the full changelog
v1.8.0
October 29, 2019
v1.7.4 Changes
October 21, 2019
⏪ Revert the fix for #3804 (#3819), which has a few undesired side effects (#3897, #3976).

Scrapy changelog

Scrapy, a fast high-level web crawling & scraping framework for Python.

Changelog History

Page 1

v2.4.1 Changes

v2.4.0 Changes

v2.3.0 Changes

v2.2.1 Changes

v2.2.0 Changes

v2.1.0 Changes

v2.0.1 Changes

v2.0.0 Changes

v1.8.0

v1.7.4 Changes

Scrapy changelog

Scrapy, a fast high-level web crawling & scraping framework for Python.

Changelog History Page 1

v1.8.0

Changelog History

Page 1