Bytewax v0.9.0 Release Notes

    • ➕ Adds bytewax.AdvanceTo and bytewax.Emit to control when processing happens.

    • ➕ Adds bytewax.run_main() as a way to test input and output builders without starting a cluster.

    • ➕ Adds a bytewax.testing module with helpers for testing.

    • bytewax.run_cluster() and bytewax.spawn_cluster() now take a mp_ctx argument to allow you to change the multiprocessing behavior. E.g. from "fork" to "spawn". Defaults now to "spawn".

    • ➕ Adds dataflow recovery capabilities. See bytewax.recovery.

    • Stateful operators bytewax.Dataflow.reduce() and bytewax.Dataflow.stateful_map() now require a step_id argument to handle recovery.

    • 🔧 Execution entry points now take configuration arguments as kwargs.