Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter down-
HTML
-
Internet
-
HTTP
-
WWW
-
XML
-
HTML Manipulation
-
Markdown
-
Scientific
-
Web Content Extracting
-
Web Crawling
-
Utilities
-
Specific Formats Processing
-
Linguistic
-
Engineering
-
Multimedia
-
Dynamic Content
-
Printing
-
Information Analysis
-
Filters
-
Documentation
-
Indexing
-
Parser
-
Web Scraping
-
General
-
Scraping
-
Education
-
Template Engine
-
Graphics
-
Site Management
Markup packages
Showing projects tagged as Text Processing and Markup
-
Pattern
9.0 0.0 L2 PythonWeb mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization. -
xmltodict
8.0 2.8 L4 PythonPython module that makes working with XML feel like you are working with JSON -
Python-Markdown
7.7 7.5 PythonA Python implementation of John Gruber’s Markdown with Extension support. -
trafilatura
6.7 8.8 PythonPython & command-line tool to gather text on the Web: Crawling & scraping, content extraction, metadata. TXT, Markdown, CSV & XML output. -
aeneas
6.4 0.0 L3 Pythonaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment) -
html5lib
5.2 4.1 L2 PythonStandards-compliant library for parsing and serializing HTML documents and fragments in Python -
selectolax
4.4 7.3 CythonPython binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors). -
htmldate
2.1 7.6 PythonFast and robust date extraction from web pages, with Python or on the command-line
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.