pytext v0.2.2 Release Notes

Release Date: 2019-08-15 // over 4 years ago
  • ๐Ÿš€ Note: this is the last release with _Deprecated classes. Those classes will be removed in the next release.

    ๐Ÿ†• New Features:

    • DeepCNN Representation for word tagging
    • Combine KLDivergenceBCELoss with SoftHardBCELoss and F.cross_entropy() in CrossEntropyLoss (#689)
    • โž• add dense feature support for doc model (#710)
    • โž• add torchscript quantizaiton support in pytext
    • ๐Ÿ‘ pytext multi-label support (#731)
    • open source transformer representations (#736)
    • open source transformer based models - data, tensorizers and tokenizer (#708)
    • Create AlternatingRandomizedBatchSampler (#737)
    • open source MaskedLM and BERT models (#734)
    • ๐Ÿ‘Œ Support bytes input in word tagging model OSS (#745)
    • open source extractive question answering models (#742)
    • torchscriptify for ensemle task
    • enabled lmlstm labels exporting (#767)
    • Enable dense features in ByteTokensDocumentModel (#763)
    • created bilstm dropout condition (#769)
    • enabled lmlstm caffe2 exporting (#766)
    • โฑ PolynomialDecayScheduler (#791)
    • โœ‚ removed bilstm dependence on seq_lengths (#776)
    • โšก๏ธ fp16 optimizer (#782)
    • โž• Add Dense Feature Normalization to FloatListTensorizer and DocModel (#859)
    • โž• Add Sparsifier component to PyText and L0-projection based sparsifier (#860)
    • implemented cnn pooling for doc classification (#872)
    • implemented bottleneck separable convolutions (#855)
    • โž• Add eps to Adam (#858)
    • implemented mobile exporter (#785)
    • ๐Ÿ‘Œ support starting training from saved checkpoint (#824)
    • implemented separable convolutions (#830)
    • implemented gelu activations (#829)
    • implemented causal convolutions (#811)
    • implemented dilation for convolutions (#810)
    • created weight norm option (#809)
    • Ordered Neuron LSTM (#854)
    • โž• Add PersonalizedByteDocModel (#816)
    • CNN based language models (#827)
    • ๐Ÿ‘Œ improve csv support in TSVDataSource (#777)
    • ๐Ÿ”„ Change default batch sampler DisjointMultitaskData to RoundRobinBatchSampler (#802)
    • ๐Ÿ‘Œ Support using serialized pretrained embedding file (#797)

    ๐Ÿ“š Documentation / Usability / Logging:

    • ๐Ÿ–จ Fewer out-of-vocab print messages, with some stats (#697)
    • Echo epoch number to console while training (#712)
    • Separate timing for prediction and metric calculation. (#738)
    • multi-label soft metrics (#754)
    • ๐Ÿ”„ changed lm metric reporting (#765)
    • ๐Ÿ›  fix data source tutorial (#762)
    • ๐Ÿ›  fix doc sphinx deprecation warning (#775)
    • โž• Add the ability to pass parameter values to gen-default-config (#856)
    • โœ‚ Remove "pytext/" from paths in demo json config (#878)
    • ๐Ÿ†• New documentation about hacking pytext and dealing with github. (#862)
    • โšก๏ธ install_deps supports updates (#863)
    • โฌ‡๏ธ Reduce number of PEP print (#861)
    • ๐Ÿ‘ better error message for config with unknown component (#801)
    • โž• Add Recall at Precision Thresholds to Config (#792)
    • implemented perplexity reductions for lm score reporting (#799)
    • adapt prediction workflow to new design (#746)

    ๐Ÿ› Bug fixes:

    • โœ… block sharded tsv eval/test fix (#698)
    • ๐Ÿ›  Fix BoundaryPooling tracing (#713)
    • ๐Ÿ›  fixes LMLSTM weight tying bug (#704)
    • ๐Ÿ›  Fix duplicate entries in vocab (#721)
    • ๐Ÿ›  Bugfix for trainer not reporting eval results (#740)
    • Reintroduce metrics export in new task (#748)
    • ๐Ÿ›  fix open source tests (#750)
    • ๐Ÿ›  Fix missing init_tensorizers arg (#893)
    • ๐Ÿ›  Fix intent slot metric reporter not working with byte offset (#883)
    • ๐Ÿ›  Fix issue with some tensorizers still re-initializing vocab when loaded from saved state (#848)
    • ๐Ÿ›  fixed overflow error in lm reporting (#831)
    • ๐Ÿ›  fix BlockShardedTSVDataSource (#832)

    v0.2.1
    (skipped because of packaging issues)