pyspider v0.3.2 Release Notes

Release Date: 2015-02-11 // about 9 years ago
  • โฑ Scheduler

    • โฑ The size of task queue is more accurate now, you can use it to determine all done status of scheduler.

    Fetcher

    • ๐Ÿ›  Fix tornado loss cookies while doing 30x redirects
    • You can use cookies with cookie header at same time now
    • ๐Ÿ›  Fix proxy not working bug.
    • 0๏ธโƒฃ Enable proxy by default.
    • ๐Ÿ‘ Proxy now support username and password authorization. @soloradish
    • Etag and Last-Modified header will be disabled while last crawl is failed.

    Databases

    • 0๏ธโƒฃ MySQL default engine changed to InnoDB @laapsaap
    • MySQL, larger result column size, changed to MEDIUMBLOB(up to 16M) @laapsaap

    WebUI

    • WebUI will use same arguments as the fetcher, fix proxy not word for webui bug.
    • โšก๏ธ Results will be sorted in the order of updatetime.

    One Mode

    • ๐Ÿ”Š Script exception logs would be printed to screen

    ๐Ÿ†• New Command send_message

    You can use the command pyspider send_message [project] [message] to send a message to project via command-line.

    Other

    • โœ… Using localhosted test web pages
    • โœ‚ Remove version specify of lxml, you can use apt-get to install any version of lxml