Skip to content

Modin 0.5.0

Compare
Choose a tag to compare
@devin-petersohn devin-petersohn released this 06 May 04:51
· 2538 commits to master since this release
09ff0c2

Modin 0.5.0 release notes

This release includes many major new features and updates.

Bugfixes + Pandas Concordance (🐛 + 🐼)

  • Fix loc with MultiIndex (#508)
  • #516 Fix duplicated index in concat (#521)
  • read_excel(sheet_name=None) not working (#512) (#532)
  • Change how describe excludes columns (#535)
  • Add "options" support #291 (#538)
  • Correct behavior for read_table when sep=False (#547)
  • Fix read_csv when parse_dates and index_col are the same (#548)
  • Fix issue where repr was not correct after mapreduce operation (#552)
  • Fix reset_index when name field of the index is set (#553)
  • Support for arguments not explicity in the signature for read_fwf (#561)
  • Add datetime to top level API. issue: #542 (#564)
  • Allow concat to accept non-subscriptable objects as keys parameter (#568)
  • Fix support for level parameter in groupby (#575)
  • Fix numeric_only parameter (#578)
  • Set series to dataframe (#545)
  • Fix astype with "category" as the type passed (#587)

User experience 👤

  • Remove typing dependency (#571)
  • Add warning when using the constructor of DataFrame and Series (#572)
  • Fix compatibility for Python2 (#606)

New functionality ⭐️

  • Add Gandiva as a partition engine for the Ray backend (#489)
  • parallel read_sql() using limit and offset (#499)
  • Integrate pyarrow's CSV reader into modin (#511)
  • Added read_csv support for S3 (#505, #543)
  • Distributed Series 🎉 (#522)
  • Add parallelism parameter to read_sql() #455 (#594)

Backend enhancements + Performance 🚀

  • Add fastrack for empty mask computation (#565)
  • Change QueryCompiler.view to use index-based lookup (#566)

Dependencies 🔗

  • Move sqlalchemy import statement in experimental io (#498)
  • pin pytables version in tests to avoid dependency mismatches (#500)
  • Update numpy version to 1.16 (#506)
  • Bump pandas version to 0.24.2 (#509)
  • Update Ray version to 0.6.6 (#567)

Testing and Code Quality (📈 + 💯)

  • Refactor and rename files to be more descriptive (#496)
  • Add stress tests for modin (#481)
  • Refactor to move all ci related files to ci/ (#479)
  • Refactor the QueryCompiler module to separate backends (#510)
  • Formatting with black (#527)
  • Fix Travis incompatibility (#534)
  • Dtype cleanup (#570)

Regressions ↩️

  • Fix single-column DataFrame index on MapReduce operations (#580)
  • Fix drop after transpose (#582)
  • Add support for concat with empty DataFrames and new Series (#584)
  • Fix Series.getitem for bool indexers and slices (#591)
  • Fix binary operation after transpose (#589)
  • Fix indexing on empty_series (#596)
  • Correctly compute reindex after a transpose (#600)
  • Correctly apply Series functions element-wise for correct cases (#598)
  • Fix regression in constructor for lists/dicts of Series (#602)
  • Fix dtype checking if other is a scalar (#604)

Contributors this release

The following users contributed code to Modin since the last release.

@ipacheco-uy (First time contributor) 🌟
@pcmoritz (First time contributor) 🌟
@wuisawesome (First time contributor) 🌟
@williamma12 (Committer)
@eavidan (Committer)
@devin-petersohn (Admin)

🎉🎉 Thank you! 🎉🎉