ZODB v3.9.0 Release Notes

Release Date: 2009-09-08 // over 13 years ago
  • ๐Ÿ†• New Features (in more or less reverse chronological order)

    • The Database class now has an xrefs keyword argument and a corresponding allow-implicit-cross-references configuration option. which default to true. When set to false, cross-database references are disallowed.

    • โž• Added support for RelStorage.

    • As a convenience, the connection root method for returning the root object can now also be used as an object with attributes mapped to the root-object keys.

    • Databases have a new method, transaction, that can be used with the Python (2.5 and later) with statement::

      db = ZODB.DB(...) with db.transaction() as conn: # ... do stuff with conn

    This uses a private transaction manager for the connection. If control exits the block without an error, the transaction is committed, otherwise, it is aborted.

    • Convenience functions ZODB.connection and ZEO.connection provide a convenient way to open a connection to a database. They open a database and return a connection to it. When the connection is closed, the database is closed as well.

    • ๐Ÿ‘ The ZODB.config databaseFrom... methods now support multi-databases. If multiple zodb sections are used to define multiple databases, the databases are connected in a multi-database arrangement and the first of the defined databases is returned.

    • The zeopack script has gotten a number of improvements:

      • Simplified command-line interface. (The old interface is still supported, except that support for ZEO version 1 servers has been dropped.)
      • Multiple storages can be packed in sequence.
      • This simplifies pack scheduling on servers serving multiple databases.
      • All storages are packed to the same time.
      • You can now specify a time of day to pack to.
      • The script will now time out if it can't connect to s storage in 60 seconds.
    • The connection now estimates the object size based on its pickle size and informs the cache about size changes.

    The database got additional configurations options (cache-size-bytes and historical-cache-size-bytes) to limit the cache size based on the estimated total size of cached objects. The default values are 0 which has the interpretation "do not limit based on the total estimated size". There are corresponding methods to read and set the new configuration parameters.

    • Connections now have a public opened attribute that is true when the connection is open, and false otherwise. When true, it is the seconds since the epoch (time.time()) when the connection was opened. This is a renaming of the previous _opened private variable.

    • ๐Ÿ‘ FileStorage now supports blobs directly.

    • You can now control whether FileStorages keep .old files when packing.

    • POSKeyErrors are no longer logged by ZEO servers, because they are really client errors.

    • ๐Ÿ‘ A new storage interface, IExternalGC, to support external garbage collection, http://wiki.zope.org/ZODB/ExternalGC, has been defined and implemented for FileStorage and ClientStorage.

    • โœ… As a small convenience (mainly for tests), you can now specify initial data as a string argument to the Blob constructor.

    • ZEO Servers now provide an option, invalidation-age, that allows quick verification of ZEO clients have been disconnected for less than a given time even if the number of transactions the client hasn't seen exceeds the invalidation queue size. This is only recommended if the storage being served supports efficient iteration from a point near the end of the transaction history.

    • ๐Ÿ‘ The FileStorage iterator now handles large files better. When iterating from a starting transaction near the end of the file, the iterator will scan backward from the end of the file to find the starting point. This enhancement makes it practical to take advantage of the new storage server invalidation-age option.

    • Previously, database connections were managed as a stack. This tended to cause the same connection(s) to be used over and over. For example, the most used connection would typically be the only connection used. In some rare situations, extra connections could be opened and end up on the top of the stack, causing extreme memory wastage. Now, when connections are placed on the stack, they sink below existing connections that have more active objects.

    • ๐Ÿ”ง There is a new pool-timeout database configuration option to specify that connections unused after the given time interval should be garbage collection. This will provide a means of dealing with extra connections that are created in rare circumstances and that would consume an unreasonable amount of memory.

    • ๐Ÿ‘ The Blob open method now supports a new mode, 'c', to open committed data for reading as an ordinary file, rather than as a blob file. The ordinary file may be used outside the current transaction and even after the blob's database connection has been closed.

    • ClientStorage now provides blob cache management. When using non-shared blob directories, you can set a target cache size and the cache will periodically be reduced try to keep it below the target size.

    The client blob directory layout has changed. If you have existing non-shared blob directories, you will have to remove them.

    • ZODB 3.9 ZEO clients can connect to ZODB 3.8 servers. ZODB ZEO clients from ZODB 3.2 on can connect to ZODB 3.9 servers.

    • When a ZEO cache is stale and would need verification, a ZEO.interfaces.StaleCache event is published (to zope.event). Applications may handle this event and take action such as exiting the application without verifying the cache or starting cold.

    • There's a new convenience function, ZEO.DB, for creating databases using ZEO Client Storages. Just call ZEO.DB with the same arguments you would otherwise pass to ZEO.ClientStorage.ClientStorage::

      import ZEO db = ZEO.DB(('some_host', 8200))

    • Object saves are a little faster

    • ๐Ÿ”ง When configuring storages in a storage server, the storage name now defaults to "1". In the overwhelmingly common case that a single storage, the name can now be omitted.

    • FileStorage now provides optional garbage collection. A 'gc' keyword option can be passed to the pack method. A false value prevents garbage collection.

    • The FileStorage constructor now provides a boolean pack_gc option, which defaults to True, to control whether garbage collection is performed when packing by default. This can be overridden with the gc option to the pack method.

    The ZConfig configuration for FileStorage now includes a pack-gc option, corresponding to the pack_gc constructor argument.

    • The FileStorage constructor now has a packer keyword argument that allows an alternative packer to be supplied.

    The ZConfig configuration for FileStorage now includes a packer option, corresponding to the packer constructor argument.

    • ๐Ÿ‘ MappingStorage now supports multi-version concurrency control and iteration and provides a better storage implementation example.

    • DemoStorage has a number of new features:

      • The ability to use a separate storage, such as a file storage to store changes
      • Blob support
      • Multi-version concurrency control and iteration
      • Explicit support for demo-storage stacking via push and pop methods.
    • Wen calling ZODB.DB to create a database, you can now pass a file name, rather than a storage to use a file storage.

    • โž• Added support for copying and recovery of blob storages:

      • Added a helper function, ZODB.blob.is_blob_record for testing whether a data record is for a blob. This can be used when iterating over a storage to detect blob records so that blob data can be copied.

      In the future, we may want to build this into a blob-aware iteration interface, so that records get blob file attributes automatically.

      • Added the IBlobStorageRestoreable interfaces for blob storages that support recovery via a restoreBlob method.
      • Updated ZODB.blob.BlobStorage to implement IBlobStorageRestoreable and to have a copyTransactionsFrom method that also copies blob data.
    • New ClientStorage configuration option drop_cache_rather_verify. If this option is true then the ZEO client cache is dropped instead of the long (unoptimized) verification. For large caches, setting this option can avoid effective down times in the order of hours when the connection to the ZEO server was interrupted for a longer time.

    • Cleaned-up the storage iteration API and provided an iterator implementation for ZEO.

    • ๐Ÿ”– Versions are no-longer supported.

    • ๐Ÿ‘€ Document conflict resolution (see ZODB/ConflictResolution.txt).

    • ๐Ÿ‘Œ Support multi-database references in conflict resolution.

    • ๐Ÿ‘‰ Make it possible to examine oid and (in some situations) database name of persistent object references during conflict resolution.

    • ๐Ÿšš Moved the 'transaction' module out of ZODB. ZODB depends upon this module, but it must be installed separately.

    • ZODB installation now requires setuptools.

    • โž• Added offset information to output of fstail script. Added test harness for this script.

    • โž• Added support for read-only, historical connections based on datetimes or serials (TIDs). See src/ZODB/historical_connections.txt.

    • โœ‚ Removed the ThreadedAsync module.

    • Now depend on zc.lockfile

    ๐Ÿ› Bugs Fixed

    • ๐Ÿ›  CVE-2009-2701: Fixed a vulnerability in ZEO storage servers when blobs are available. Someone with write access to a ZEO server configured to support blobs could read any file on the system readable by the server process and remove any file removable by the server process.

    • BTrees (and TreeSets) kept references to internal keys. https://bugs.launchpad.net/zope3/+bug/294788

    • ๐Ÿ‘ BTree Sets and TreeSets don't support the standard set add method. (Now either add or the original insert method can be used to add an object to a BTree-based set.)

    • ๐Ÿ”ง The runzeo script didn't work without a configuration file. (https://bugs.launchpad.net/zodb/+bug/410571)

    • ๐Ÿ—„ Officially deprecated PersistentDict (https://bugs.launchpad.net/zodb/+bug/400775)

    • Calling setstate on a persistent object could under certain uncommon cause the process to crash. (https://bugs.launchpad.net/zodb/+bug/262158)

    • When committing transactions involving blobs to ClientStorages with non-shared blob directories, a failure could occur in tpc_finish if there was insufficient disk space to copy the blob file or if the file wasn't available. https://bugs.launchpad.net/zodb/+bug/224169

    • ๐Ÿ’พ Savepoint blob data wasn't properly isolated. If multiple simultaneous savepoints in separate transactions modified the same blob, data from one savepoint would overwrite data for another.

    • ๐Ÿ’พ Savepoint blob data wasn't cleaned up after a transaction abort. https://bugs.launchpad.net/zodb/+bug/323067

    • Opening a blob with modes 'r+' or 'a' would fail when the blob had no committed changes.

    • PersistentList's sort method did not allow passing of keyword parameters. Changed its sort parameter list to match that of its (Python 2.4+) UserList base class.

    • Certain ZEO server errors could cause a client to get into a state where it couldn't commit transactions. https://bugs.launchpad.net/zodb/+bug/374737

    • ๐Ÿ›  Fixed vulnerabilities in the ZEO network protocol that allow:

      • CVE-2009-0668 Arbitrary Python code execution in ZODB ZEO storage servers
      • CVE-2009-0669 Authentication bypass in ZODB ZEO storage servers

    The vulnerabilities only apply if you are using ZEO to share a database among multiple applications or application instances and if untrusted clients are able to connect to your ZEO servers.

    • ๐Ÿ›  Fixed the setup test command. It previously depended on private functions in zope.testing.testrunner that don't exist any more.

    • ZEO client threads were unnamed, making it hard to debug thread management.

    • ๐Ÿ‘ ZEO protocol 2 support was broken. This caused very old clients to be unable to use new servers.

    • 0๏ธโƒฃ zeopack was less flexible than it was before. -h should default to local host.

    • 0๏ธโƒฃ The "lawn" layout was being selected by default if the root of the blob directory happened to contain a hidden file or directory such as ".svn". Now hidden files and directories are ignored when choosing the default layout.

    • BlobStorage was not compatible with MVCC storages because the wrappers were being removed by each database connection. Fixed.

    • Saving indexes for large file storages failed (with the error: RuntimeError: maximum recursion depth exceeded). This can cause a FileStorage to fail to start because it gets an error trying to save its index.

    • Sizes of new objects weren't added to the object cache size estimation, causing the object-cache size limiting feature to let the cache grow too large when many objects were added.

    • โœ‚ Deleted records weren't removed when packing file storages.

    • ๐Ÿ›  Fixed analyze.py and added test.

    • ๐Ÿ›  fixed Python 2.6 compatibility issue with ZEO/zeoserverlog.py

    • ๐Ÿ—„ using hashlib.sha1 if available in order to avoid DeprecationWarning under Python 2.6

    • made runzeo -h work

    • The monitor server didn't correctly report the actual number of clients.

    • Packing could return spurious errors due to errors notifying disconnected clients of new database size statistics.

    • ๐Ÿ”ง Undo sometimes failed for FileStorages configured to support blobs.

    • Starting ClientStorages sometimes failed with non-new but empty cache files.

    • The history method on ZEO clients failed.

    • ๐Ÿ›  Fix for bug #251037: Make packing of blob storages non-blocking.

    • ๐Ÿ›  Fix for bug #220856: Completed implementation of ZEO authentication.

    • ๐Ÿ›  Fix for bug #184057: Make initialisation of small ZEO client file cache sizes not fail.

    • ๐Ÿ›  Fix for bug #184054: MappingStorage used to raise a KeyError during load instead of a POSKeyError.

    • ๐Ÿ›  Fixed bug in Connection.TmpStore: load() would not defer to the backend storage for loading blobs.

    • ๐Ÿ›  Fix for bug #181712: Make ClientStorage update lastTransaction directly after connecting to a server, even when no cache verification is necessary.

    • ๐Ÿ›  Fixed bug in blob filesystem helper: the isSecure check was inverted.

    • ๐Ÿ›  Fixed bug in transaction buffer: a tuple was unpacked incorrectly in clear.

    • ๐Ÿ›  Bugfix the situation in which comparing persistent objects (for instance, as members in BTree set or keys of BTree) might cause data inconsistency during conflict resolution.

    • ๐Ÿ›  Fixed bug 153316: persistent and BTrees were using int for memory sizes which caused errors on x86_64 Intel Xeon machines (using 64-bit Linux).

    • ๐Ÿ›  Fixed small bug that the Connection.isReadOnly method didn't work after a savepoint.

    • ๐Ÿ› Bug #98275: Made ZEO cache more tolerant when invalidating current versions of objects.

    • ๐Ÿ›  Fixed a serious bug that could cause client I/O to stop (hang). This was accompanied by a critical log message along the lines of: "RuntimeError: dictionary changed size during iteration".

    • ๐Ÿ›  Fixed bug #127182: Blobs were subclassable which was not desired.

    • ๐Ÿ›  Fixed bug #126007: tpc_abort had untested code path that was broken.

    • ๐Ÿ›  Fixed bug #129921: getSize() function in BlobStorage could not deal with garbage files

    • ๐Ÿ›  Fixed bug in which MVCC would not work for blobs.

    • ๐Ÿ›  Fixed bug in ClientCache that occurred with objects larger than the total cache size.

    • ๐Ÿ”’ When an error occured attempting to lock a file and logging of said error was enabled.

    • FileStorages previously saved indexes after a certain number of writes. This was done during the last phase of two-phase commit, which made this critical phase more subject to errors than it should have been. Also, for large databases, saves were done so infrequently as to be useless. The feature was removed to reduce the chance for errors during the last phase of two-phase commit.

    • File storages previously kept an internal object id to transaction id mapping as an optimization. This mapping caused excessive memory usage and failures during the last phase of two-phase commit. This optimization has been removed.

    • ๐Ÿ”จ Refactored handling of invalidations on ZEO clients to fix a possible ordering problem for invalidation messages.

    • On many systems, it was impossible to create more than 32K blobs. Added a new blob-directory layout to work around this limitation.

    • ๐Ÿ›  Fixed bug that could lead to memory errors due to the use of a Python dictionary for a mapping that can grow large.

    • ๐Ÿ›  Fixed bug #251037: Made packing of blob storages non-blocking.

    • ๐Ÿ›  Fixed a bug that could cause InvalidObjectReference errors for objects that were explicitly added to a database if the object was modified after a savepoint that added the object.

    • ๐Ÿ›  Fixed several bugs that caused ZEO cache corruption when connecting to servers. These bugs affected both persistent and non-persistent caches.

    • ๐Ÿ‘Œ Improved the the ZEO client shutdown support to try to avoid spurious errors on exit, especially for scripts, such as zeopack.

    • Packing failed for databases containing cross-database references.

    • Cross-database references to databases with empty names weren't constructed properly.

    • The zeo client cache used an excessive amount of memory, causing applications with large caches to exhaust available memory.

    • ๐Ÿ›  Fixed a number of bugs in the handling of persistent ZEO caches:

      • Cache records are written in several steps. If a process exits after writing begins and before it is finishes, the cache will be corrupt on restart. The way records are written was changed to make cache record updates atomic.
      • There was no lock file to prevent opening a cache multiple times at once, which would lead to corruption. Persistent caches now use lock files, in the same way that file storages do.
      • A bug in the cache-opening logic led to cache failure in the unlikely event that a cache has no free blocks.
    • When using ZEO Client Storages, Errors occured when trying to store objects too big to fit in the ZEO cache file.

    • ๐Ÿ›  Fixed bug in blob filesystem helper: the isSecure check was inverted.

    • ๐Ÿ›  Fixed bug in transaction buffer: a tuple was unpacked incorrectly in clear.

    • ๐Ÿ›  Fixed bug in Connection.TmpStore: load() would not defer to the back-end storage for loading blobs.

    • ๐Ÿ›  Fixed bug #190884: Wrong reference to POSKeyError caused NameError.

    • ๐Ÿ›  Completed implementation of ZEO authentication. This fixes issue 220856.

    What's new in ZODB 3.8.0


    • ๐Ÿš€ (unreleased) Fixed setup.py use of setuptools vs distutils, so .c and .h files are included in the bdist_egg.

    • The ZODB Storage APIs have been documented and cleaned up.

    • ๐Ÿ—„ ZODB versions are now officially deprecated and support for them will be removed in ZODB 3.9. (They have been widely recognized as deprecated for quite a while.)

    • ๐Ÿ”„ Changed the automatic garbage collection when opening a connection to only apply the garbage collections on those connections in the pool that are closed. (This fixed issue 113923.)


    • (3.8a1) ZEO's strategoes for avoiding client cache verification were improved in the case that servers are restarted. Before, if transactions were committed after the restart, clients that were up to date or nearly up to date at the time of the restart and then connected had to verify their caches. Now, it is far more likely that a client that reconnects soon after a server restart won't have to verify its cache.

    • ๐Ÿ›  (3.8a1) Fixed a serious bug that could cause clients that disconnect from and reconnect to a server to get bad invalidation data if the server serves multiple storages with active writes.

    • (3.8a1) It is now theoretically possible to use a ClientStorage in a storage server. This might make it possible to offload read load from a storage server at the cost of increasing write latency. This should increase write throughput by offloading reads from the final storage server. This feature is somewhat experimental. It has tests, but hasn't been used in production.


    • (3.8a1) Add a doom() and isDoomed() interface to the transaction module.

    First step towards the resolution of http://www.zope.org/Collectors/Zope3-dev/655

    A doomed transaction behaves exactly the same way as an active transaction but raises an error on any attempt to commit it, thus forcing an abort.

    Doom is useful in places where abort is unsafe and an exception cannot be raised. This occurs when the programmer wants the code following the doom to run but not commit. It is unsafe to abort in these circumstances as a following get() may implicitly open a new transaction.

    Any attempt to commit a doomed transaction will raise a DoomedTransaction exception.

    • (3.8a1) Clean up the ZODB imports in transaction.

    Clean up weird import dance with ZODB. This is unnecessary since the transaction module stopped being imported in ZODB/init.py in rev 39622.

    • ๐Ÿšš (3.8a1) Support for subtransactions has been removed in favor of save points.


    • โšก๏ธ (3.8b1) Updated the Blob implementation in a number of ways. Some of these are backward incompatible with 3.8a1:

    o The Blob class now lives in ZODB.blob

    o The blob openDetached method has been replaced by the committed method.

    • ๐Ÿ‘€ (3.8a1) Added new blob feature. See the ZODB/Blobs directory for documentation.

    ZODB now handles (reasonably) large binary objects efficiently. Useful to use from a few kilobytes to at least multiple hundred megabytes.


    • ๐Ÿ‘ (3.8a1) Added support for 64-bit integer BTrees as separate types.

    (For now, we're retaining compile-time support for making the regular integer BTrees 64-bit.)

    • (3.8a1) Normalize names in modules so that BTrees, Buckets, Sets, and TreeSets can all be accessed with those names in the modules (e.g., BTrees.IOBTree.BTree). This is in addition to the older names (e.g., BTrees.IOBTree.IOBTree). This allows easier drop-in replacement, which can especially be simplify code for packages that want to support both 32-bit and 64-bit BTrees.

    • (3.8a1) Describe the interfaces for each module and actually declare the interfaces for each.

    • (3.8a1) Fix module references so klass.module points to the Python wrapper module, not the C extension.

    • (3.8a1) introduce module families, to group all 32-bit and all 64-bit modules.