AWS Data Wrangler v1.1.0 Release Notes

Release Date: 2020-05-05 // almost 4 years ago
  • ๐Ÿ†• New Functionalities

    • ๐Ÿ‘Œ Support for nested arrays and structs on wr.s3.to_parquet() #206
    • ๐Ÿ‘Œ Support for Read Parquet/Athena/Redshift chunked by number of rows #192
    • Add custom_classifications to wr.emr.create_cluster() #193
    • ๐Ÿ‘Œ Support for Docker on EMR #193
    • Add kms_key_id, max_file_size, region arguments to wr.db.unload_redshift() #197
    • ๐Ÿ”– Add catalog_versioning argument to wr.s3.to_csv() and wr.s3.to_parquet() #198
    • Add keep_files and ctas_temp_table_name arguments to wr.athena.read_sql_*() #203
    • Add replace_filenames argument to wr.s3.copy_objects() #215

    โœจ Enhancements

    • wr.s3.to_csv() and wr.s3.to_parquet() no longer need delete table permission to overwrite catalog table #198
    • Added support for UUID on wr.db.read_sql_query()(PostgreSQL) #200
    • ๐Ÿ”จ Refactoring of Athena encryption and workgroup support #212

    ๐Ÿ› Bug Fix

    • ๐Ÿ‘Œ Support for read full NULL columns from PostgreSQL, MySQL, and Redshift #218

    Thanks

    ๐Ÿš€ We thank the following contributors/users for their work on this release:

    @robkano ,@luigift, @parasml, @OElesin, @jar-no1, @keatmin, @pmleveque, @sapientderek, @jadayn, @igorborgest.


    _ P.S. _ Lambda Layer's zip-file and Glue's wheel/egg are available below. Just upload it and run!

    _ P.P.S. _ AWS Data Wrangler counts on compiled dependencies (C/C++) so there is no support for Glue PySpark by now (Only Glue Python Shell).