Modin 0.5.0
devin-petersohn
released this
06 May 04:51
·
2538 commits
to master
since this release
Modin 0.5.0 release notes
This release includes many major new features and updates.
Bugfixes + Pandas Concordance (🐛 + 🐼)
- Fix loc with MultiIndex (#508)
- #516 Fix duplicated index in concat (#521)
- read_excel(sheet_name=None) not working (#512) (#532)
- Change how describe excludes columns (#535)
- Add "options" support #291 (#538)
- Correct behavior for
read_table
whensep=False
(#547) - Fix
read_csv
whenparse_dates
andindex_col
are the same (#548) - Fix issue where
repr
was not correct after mapreduce operation (#552) - Fix
reset_index
whenname
field of the index is set (#553) - Support for arguments not explicity in the signature for
read_fwf
(#561) - Add
datetime
to top level API. issue: #542 (#564) - Allow
concat
to accept non-subscriptable objects askeys
parameter (#568) - Fix support for
level
parameter in groupby (#575) - Fix numeric_only parameter (#578)
- Set series to dataframe (#545)
- Fix
astype
with"category"
as the type passed (#587)
User experience 👤
- Remove typing dependency (#571)
- Add warning when using the constructor of DataFrame and Series (#572)
- Fix compatibility for Python2 (#606)
New functionality ⭐️
- Add Gandiva as a partition engine for the Ray backend (#489)
- parallel read_sql() using limit and offset (#499)
- Integrate pyarrow's CSV reader into modin (#511)
- Added read_csv support for S3 (#505, #543)
- Distributed Series 🎉 (#522)
- Add parallelism parameter to read_sql() #455 (#594)
Backend enhancements + Performance 🚀
- Add fastrack for empty mask computation (#565)
- Change
QueryCompiler.view
to use index-based lookup (#566)
Dependencies 🔗
- Move
sqlalchemy
import statement in experimental io (#498) - pin pytables version in tests to avoid dependency mismatches (#500)
- Update numpy version to 1.16 (#506)
- Bump pandas version to 0.24.2 (#509)
- Update Ray version to 0.6.6 (#567)
Testing and Code Quality (📈 + 💯)
- Refactor and rename files to be more descriptive (#496)
- Add stress tests for modin (#481)
- Refactor to move all ci related files to ci/ (#479)
- Refactor the QueryCompiler module to separate backends (#510)
- Formatting with black (#527)
- Fix Travis incompatibility (#534)
- Dtype cleanup (#570)
Regressions ↩️
- Fix single-column DataFrame index on MapReduce operations (#580)
- Fix drop after transpose (#582)
- Add support for concat with empty DataFrames and new Series (#584)
- Fix Series.getitem for bool indexers and slices (#591)
- Fix binary operation after transpose (#589)
- Fix indexing on empty_series (#596)
- Correctly compute
reindex
after a transpose (#600) - Correctly apply Series functions element-wise for correct cases (#598)
- Fix regression in constructor for lists/dicts of Series (#602)
- Fix dtype checking if other is a scalar (#604)
Contributors this release
The following users contributed code to Modin since the last release.
@ipacheco-uy (First time contributor) 🌟
@pcmoritz (First time contributor) 🌟
@wuisawesome (First time contributor) 🌟
@williamma12 (Committer)
@eavidan (Committer)
@devin-petersohn (Admin)
🎉🎉 Thank you! 🎉🎉