textacy v0.4.1 Release Notes

Release Date: 2017-07-27 // almost 7 years ago
  • ๐Ÿ”„ Changes:

    • โž• Added key classes to the top-level textacy imports, for convenience:
      • textacy.text_stats.TextStats => textacy.TextStats
      • textacy.vsm.Vectorizer => textacy.Vectorizer
      • textacy.tm.TopicModel => textacy.TopicModel
    • โž• Added tests for textacy.Doc and updated the README's usage example

    ๐Ÿ›  Bugfixes:

    • โž• Added explicit encoding when opening Wikipedia database files in text mode to
      ๐Ÿ resolve an issue when doing so without encoding on Windows (PR #118)
    • Fixed keyterms.most_discriminating_terms to use the vsm.Vectorizer class
      rather than the vsm.doc_term_matrix function that it replaced (PR #120)
    • Fixed mishandling of a couple optional args in Doc.to_terms_list

    Contributors:

    ๐Ÿ›  Thanks to @minketeer and @Gregory-Howard for the fixes!