spaCy v2.1.3 Release Notes

Release Date: 2019-03-23 // about 5 years ago
  • ๐Ÿฑ โœจ New features and improvements

    • ๐Ÿ‘ Allow customizing punctuation characters in sentencizer and make it serializable.
    • โž• Add new "bow" architecture for TextCategorizer, to do faster bag-of-words text classification.

    ๐Ÿฑ ๐Ÿ”ด Bug fixes

    • ๐Ÿ›  Fix issue #3433, #3458: Fix mismatch of classes in parser after serialization.
    • ๐Ÿ›  Fix issue #3464: Fix training loop in train_textcat.py example.
    • Fix issue #3468: Make sentencizer set Token.is_sent_start correctly.
    • ๐Ÿ›  Fix bug in the "ensemble" TextClassifier architecture that prevented the unigram bag-of-words submodel from working properly.

    ๐Ÿ‘ฅ Contributors

    Thanks to @chkoar for the pull request!