All Versions
8
Latest Version
Avg Release Cycle
-
Latest Release
-
Changelog History
Changelog History
-
v0.3.0 Changes
๐ Does not released yet
- ๐ท
BaseDownloadWorker.get_one(self, request)
->BaseDownloadWorker.process(self, request)
BaseDownloader.get(self, requests)
->BaseDownloader.process(self, crawler, request)
- non
BaseRequest
instances are items BaseHttpResponse.get_request()
instead ofBaseHttpResponse.request
property- ๐ป BaseMiddleware.process_(request|response|exception) now may be awaitable
- BaseCralwer.(process|next_requests|on_processing_done) now may be awaitable
- BaseDownloader.(start|stop) now may be awaitable
BaseDownloader.prepare(self)
renamed toBaseDownloader.start(self,crawler)
- BasePipeline.(start|stop|process) now may be awaitable
- ๐จ process
Future
objects without wrapping it withPlanned
object - asynciotools with async/await syntax
- removed
RUNNING
,CANCELLED_AND_NOTIFIED
,exception
method fromPlanned
class
- ๐ท
-
v0.2.1 Changes
๐ Released at 2016-09-12
- ๐ fix pipe start/stop/process exception processing
- fix:
AioPomp
ensure_future for process_requests in main loop - ๐จ
BaseCrawler.on_processing_done
when request, middlewares, response, extract, pipelines processing was done - count param for
BaseQueue.get_requests
- number allowed concurrent requests for current downloader, may be None
-
v0.2.0 Changes
๐ Released at 2016-03-01
- ๐ middlewares logic moved to engine and BaseMiddleware.process_(request|response|exception) with crawler and downloader params
- Downloader does not have any more middlewares param
- BaseDownloaderMiddleware renamed to BaseMiddleware
- BaseDownloderException renamed to BaseCrawlException
- ๐ AioConcurrentCrawler with asyncio and concurrent futures support
- ๐ pomp.core.item moved to pomp.contrib.item without backward compatibility https://github.com/estin/pomp/issues/6#issuecomment-172342598
- ๐ AioPomp with asyncio support
- queue semaphore to prevent fetching more requests than downloader can process now
- ๐ง bfo and dfo orders now configured via Pomp constructor
- no recursion, internal queue is used
- twsited is not a part of pomp
- defer object replaced to Planned object (like Future object)
-
v0.1.3 Changes
๐ Will not be released
- crawler
extract_items
method can yield next requests - process exception on
extract_items
andnext_requests
by downloader middlewares - ๐ป store result of calling
sys.exc_info
in exception instance
- crawler
-
v0.1.2 Changes
๐ Released at 2015-05-08
- โฑ twistedtools on request timeout can raise ResponseNeverReceived or others (see _newcleint twisted api)...
- utils.DeferredList works with deferred and not-deferred objects
- ๐ fixed Item object behavior (get item field value)
-
v0.1.1 Changes
๐ Released on December 12nd 2013
- urllib and twisted downloader on get method yield result
- ๐ bugfix
depth first
method - concurrenttools change try/finally to generator behavior
- ๐จ processing requests through queue
- ๐ better generator usage
-
v0.1.0 Changes
๐ Released on June 14nd 2013
- ๐ rename SimpleDownloader to UrllibDownloader
- urllib code now in contrib/urllibtools.py
- ENTRY_URL renamed to ENTRY_REQUESTS
- next_url renamed to next_requests
- ๐ async support
- ๐ Twisted support
- ๐ concurrent future support
- pipelines accept crawler on start/stop/process calls
- ๐ป downloader middleware also process exception
-
v0.0.2 Changes
๐ First public preview release.