pyspider v0.3.2 Release Notes
Release Date: 2015-02-11 // about 9 years ago-
โฑ Scheduler
- โฑ The size of task queue is more accurate now, you can use it to determine all done status of scheduler.
Fetcher
- ๐ Fix tornado loss cookies while doing 30x redirects
- You can use cookies with cookie header at same time now
- ๐ Fix proxy not working bug.
- 0๏ธโฃ Enable proxy by default.
- ๐ Proxy now support username and password authorization. @soloradish
- Etag and Last-Modified header will be disabled while last crawl is failed.
Databases
- 0๏ธโฃ MySQL default engine changed to InnoDB @laapsaap
- MySQL, larger result column size, changed to MEDIUMBLOB(up to 16M) @laapsaap
WebUI
- WebUI will use same arguments as the fetcher, fix proxy not word for webui bug.
- โก๏ธ Results will be sorted in the order of updatetime.
One Mode
- ๐ Script exception logs would be printed to screen
๐ New Command
send_message
You can use the command
pyspider send_message [project] [message]
to send a message to project via command-line.Other
- โ Using localhosted test web pages
- โ Remove version specify of lxml, you can use apt-get to install any version of lxml