Modin 0.23.0rc0
Pre-release
Pre-release
This release includes support for pandas 2.0, '.corr' speed-up, new features and bug fixes.
Note: this is a release candidate. If everything goes well, we'll release Modin 0.23.0 in two weeks.
Key Features and Updates Since 0.22.0
- Stability and Bugfixes
- FIX-#3371: Remove pandas patch level pin (#6211)
- FIX-#4954: Defaults to pandas in
read_json
in case of rows having different columns (#5946) - FIX-#6215: Process '.corr(numeric_only=False)' parameter at the qc level (#6242)
- FIX-#6218: Fix
read_excel
and unpinopenpyxl
(#6247) - FIX-#6232: Support DataFrame.cov(numeric_only=False) without fallback to pandas (#6262)
- FIX-#6237: Log errors only from deepest modin layer (#6238)
- FIX-#6245: Support datetime64 with different resolutions types for HDK (#6255)
- FIX-#6246: Fix 'groupby(..., as_index=False).agg(...)' case (#6263)
- FIX-#6258: Fix series to_dict (#6260)
- FIX-#6259: Fix astype("category") causing read-only buffer error (#6267)
- FIX-#6273: Fix DataFrame.min/max/mean/median/skew/kurt with axis=None (#6275)
- Performance enhancements
- New Features
- FEAT-#5759: Implement lazy Arrow execution for the HDK engine (#6251)
- FEAT-#5936: Support pandas 2.0.2 (#5995)
- FEAT-#6048: Add
wait
method for Dask/Ray/Unidist wrappers (#6049) - FEAT-#6253: Add 'dtype_backend' parameter support for read_parquet/read_feather (#6264)
- FEAT-#6256: HDK: Add support for DataFrameGroupBy.head/tail() (#6257)
Contributors
@AndreyPavlenko
@YarShev
@anmyachev
@dchigarev
@mvashishtha
@vnlitvinov