Web Crawling packages

Showing projects tagged as Web Crawling

  • Scrapy

    9.9 9.5 L4 Python
    A fast high-level screen scraping and web crawling framework.
  • you-get

    9.8 8.2 L2 Python
    A YouTube/Youku/Niconico video downloader written in Python 3.
  • pyspider

    9.7 9.2 L3 Python
    A powerful spider system.
  • requests-html

    9.3 4.6 HTML
    Pythonic HTML Parsing for Humans™
  • portia

    9.2 3.3 L2 JavaScript
    Visual scraping for Scrapy.
  • RoboBrowser

    7.9 0.0 L4 Python
    A simple, Pythonic library for browsing the web without a standalone web browser.
  • MechanicalSoup

    7.7 8.4 L4 Python
    A Python library for automating interaction with websites.
  • cola

    7.0 0.0 L3 Python
    A distributed crawling framework.
  • gain

    6.7 0.0 Python
    Web crawling framework based on asyncio.
  • Grab

    6.7 5.1 L3 Python
    Site scraping framework.
  • Scrapely

    6.6 3.2 HTML
    A pure-python HTML screen-scraping library
  • PSpider

    6.6 6.8 Python
    A simple web spider frame written by Python, which needs Python3.5+
  • feedparser

    5.3 6.3 L3 Python
    Universal feed parser.
  • Sukhoi

    4.6 0.0 Python
    Minimalist and powerful Web Crawler.
  • MSpider

    4.4 0.0 Python
    Spider
  • Goose3

    3.1 4.6 HTML
    A Python 3 compatible version of goose
  • spidy Web Crawler

    2.9 2.1 Python
    The simple, easy to use command line web crawler.
  • Crawley

    2.7 0.0 Python
    Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
  • selectolax

    2.7 6.4 Python
    Python bindings to Modest engine (very fast HTML5 parser with CSS selectors).
  • brownant

    2.6 0.0 Python
    Brownant is a web data extracting framework.
  • gazpacho

    2.5 7.7 Python
    🥣 Web scraping with pure python
  • Demiurge

    2.0 0.0 L5 Python
    PyQuery-based scraping micro-framework.
  • Pomp

    1.5 0.0 L5 Python
    Web crawling framework inspired by and similar to Scrapy
  • Atoma

    1.1 6.4 Python
    Atom, RSS and JSON feed parser for Python 3
  • FastImage

    0.8 0.0 L4 Python
    Python library that finds the size / type of an image given its URI by fetching as little as needed
  • Mariner

    0.5 6.8 Python
    Command line torrent searcher.