Stanza v0.1.2 Release Notes
Release Date: 2019-02-26 // about 5 years ago-
๐ This is a maintenance release of stanfordnlp. This release features:
- ๐ Allowing the tokenizer to treat the incoming document as pretokenized with space separated words in newline separated sentences. Set
tokenize_pretokenized
toTrue
when building the pipeline to skip the neural tokenizer, and run all downstream components with your own tokenized text. (#24, #34) - Speedup in the POS/Feats tagger in evaluation (up to 2 orders of magnitude). (#18)
- ๐ Various minor fixes and documentation improvements
We would also like to thank the following community members for their contribution:
Code improvements: @lwolfsonkin
๐ Documentation improvements: @0xflotus
And thanks to everyone that raised issues and helped improve stanfordnlp! - ๐ Allowing the tokenizer to treat the incoming document as pretokenized with space separated words in newline separated sentences. Set