textacy v0.10.0 Release Notes
Release Date: 2020-03-01 // about 4 years ago-
๐ New:
- Added a logo to textacy's documentation and social preview ๐
- โ Added type hints throughout the code base, for more expressive type indicators in docstrings and for static type checkers used by developers to code more effectively (PR #289)
- โ Added a preprocessing function to normalize sequences of repeating characters (Issue #275)
๐ Changed:
- ๐ Improved core
Corpus
functionality using recent additions to spacy (PR #285)- Re-implemented
Corpus.save()
andCorpus.load()
using spacy's newDocBin
class, which resolved a few bugs/issues (Issue #254) - Added
n_process
arg toCorpus.add()
to set the number of parallel processes used when adding many items to a corpus, following spacy's updates tonlp.pipe()
(Issue #277) - Bumped minimum spaCy version from 2.0.12 => 2.2.0, accordingly
- Re-implemented
- โ Added handling for zero-width whitespaces into
normalize_whitespace()
function (Issue #278) - ๐ Improved a couple rough spots in package administration:
- Moved package setup information into a declarative configuration file, in an attempt to keep up with evolving best practices for Python packaging
- Simplified the configuration and interoperability of sphinx + github pages for generating package documentation
๐ Fixed: