gensim v4.2.0 Release Notes

Release Date: 2022-04-29 // over 1 year ago
  • :+1: New features

    • #3188: Add get_sentence_vector() to FastText and get_mean_vector() to KeyedVectors, by @rock420
    • ๐Ÿ‘€ #3194: Added random_seed parameter to make LsiModel reproducible, by @parashardhapola
    • #3247: Sparse2Corpus: update getitem to work on slices, lists and ellipsis, by @PrimozGodec
    • #3264: Detect when a fasttext executable is available in PATH, by @pabs3
    • #3271: Added new ValueError in place of assertion error for no model data provided in lsi model, by @mark-todd
    • #3299: Enable test_word2vec_stand_alone_script by using sys.executable for python, by @pabs3
    • #3317: Added encoding parameter to TextDirectoryCorpus, by @Sandman-Ren
    • #2656: Streamlining most_similar_cosmul and evaluate_word_analogies, by @n3hrox

    ๐Ÿ“„ :books: Tutorials and docs

    • ๐Ÿ— #3227: Fix FastText doc-comment example for build_vocab and train to use correct argument names, by @HLasse
    • ๐Ÿ“„ #3235: Fix TFIDF docs, by @piskvorky
    • ๐Ÿ™‹ #3257: Dictionary doc: ref FAQ entry about filter_extremes corpus migration, by @zacchiro
    • ๐Ÿ“„ #3279: Add the FastSS and Levenshtein modules to docs, by @piskvorky
    • ๐Ÿ“š #3284: Documentation fixes + added CITATION.cff, by @piskvorky
    • โœ๏ธ #3289: Typos, text and code fix in LDA tutorial, by @davebulaval
    • ๐Ÿšš #3301: Remove unused Jupyter screenshots, by @pabs3
    • ๐Ÿ“š #3307: Documentation fixes, by @piskvorky
    • ๐Ÿ“œ #3339: Fix parsing error in FastText docs, by @MattYoon
    • #3251: Apply new convention of delimiting instance params in str function, by @menshikh-iv

    ๐Ÿ›  :red_circle: Bug fixes

    ๐Ÿšš :warning: Removed functionality & deprecations

    โœ… ๐Ÿ”ฎ Testing, CI, housekeeping

Previous changes from v4.1.2

  • ๐Ÿ›  This is a bugfix release that addresses left over compatibility issues with older versions of numpy and MacOS.