Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter downWWW packages
Showing projects tagged as Text Processing, Markup, and WWW
-
Pattern
8.8 0.0 L2 PythonWeb mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization. -
Python-Markdown
7.8 7.3 PythonA Python implementation of John Gruber’s Markdown with Extension support. -
trafilatura
7.5 7.2 PythonPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
selectolax
5.0 9.0 CythonPython binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors). -
htmldate
2.3 4.8 PythonFast and robust date extraction from web pages, with Python or on the command-line -
GoBeautifulSoup
0.3 3.3 PythonGoBeautifulSoup is a high-performance HTML/XML parsing library that provides a 100% compatible API with BeautifulSoup4, but powered by Go for dramatically improved performance. It's designed as a drop-in replacement for BeautifulSoup4 with significant speed improvements.
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.