Skip to content

Modin 0.23.0rc0

Pre-release
Pre-release
Compare
Choose a tag to compare
@anmyachev anmyachev released this 17 Jun 01:23
· 404 commits to master since this release
0.23.0rc0
e1d4241

This release includes support for pandas 2.0, '.corr' speed-up, new features and bug fixes.

Note: this is a release candidate. If everything goes well, we'll release Modin 0.23.0 in two weeks.

Key Features and Updates Since 0.22.0

  • Stability and Bugfixes
    • FIX-#3371: Remove pandas patch level pin (#6211)
    • FIX-#4954: Defaults to pandas in read_json in case of rows having different columns (#5946)
    • FIX-#6215: Process '.corr(numeric_only=False)' parameter at the qc level (#6242)
    • FIX-#6218: Fix read_excel and unpin openpyxl (#6247)
    • FIX-#6232: Support DataFrame.cov(numeric_only=False) without fallback to pandas (#6262)
    • FIX-#6237: Log errors only from deepest modin layer (#6238)
    • FIX-#6245: Support datetime64 with different resolutions types for HDK (#6255)
    • FIX-#6246: Fix 'groupby(..., as_index=False).agg(...)' case (#6263)
    • FIX-#6258: Fix series to_dict (#6260)
    • FIX-#6259: Fix astype("category") causing read-only buffer error (#6267)
    • FIX-#6273: Fix DataFrame.min/max/mean/median/skew/kurt with axis=None (#6275)
  • Performance enhancements
    • PERF-#4560: Implement '.corr()' method using MapReduce pattern (#6193)
  • New Features
    • FEAT-#5759: Implement lazy Arrow execution for the HDK engine (#6251)
    • FEAT-#5936: Support pandas 2.0.2 (#5995)
    • FEAT-#6048: Add wait method for Dask/Ray/Unidist wrappers (#6049)
    • FEAT-#6253: Add 'dtype_backend' parameter support for read_parquet/read_feather (#6264)
    • FEAT-#6256: HDK: Add support for DataFrameGroupBy.head/tail() (#6257)

Contributors

@AndreyPavlenko
@YarShev
@anmyachev
@dchigarev
@mvashishtha
@vnlitvinov