Simplemma v0.4.0 Release Notes
-
- 🆕 new languages: Armenian, Greek, Macedonian, Norwegian (Bokmål), and Polish
- language data reviewed for: Dutch, Finnish, German, Hungarian, Latin, Russian, and Swedish
- 🚚 Urdu removed of language list due to issues with the data
- ➕ add support for Python 3.10 and drop support for Python 3.4
- 👌 improved decomposition and tokenization algorithms