All Versions
12
Latest Version
Avg Release Cycle
-
Latest Release
-
Changelog History
Page 2
Changelog History
Page 2
-
v0.2.0 Changes
- Cleaner and more efficient filtering
- Helper functions to scrub, clean and normalize
- โ Removed two dependencies with more extensive usage of urllib.parse
-
v0.1.0 Changes
- Cleaning and filtering targeting non-spam HTML pages with primarily text
- URL validation
- Sampling by domain name
- Command-line interface (CLI) and Python tool