v23.02.00
github-actions
released this
08 Mar 16:32
·
130 commits
to refs/heads/main
since this release
What’s Changed
⚠ Breaking Changes
- Remove use of
is_list
/is_ragged
and replace with setting shapes @karlhigley (#215) - Add a new
shape
field toColumnSchema
@karlhigley (#195)
🐜 Bug Fixes
- Save schema with consistent dtype when
dtypes
is used @oliverholworthy (#182)
🚀 Features
- Update HAS_GPU variable to account for
CUDA_VISIBLE_DEVICES
@oliverholworthy (#221) - Clean up of make_df function @jperez999 (#205)
- separate cupy import from rapids @jperez999 (#211)
- Support partially specified value_count when used with
is_ragged=False
@oliverholworthy (#213) - Fix for updated versions of cudf to parquet @jperez999 (#204)
- Create standard Merlin dtypes in the
merlin.dtypes
module @karlhigley (#170)
🔧 Maintenance
- Remove use of
is_list
/is_ragged
and replace with setting shapes @karlhigley (#215) - Reduce the overhead of using
LocalExecutor
(esp. dtype validation) @karlhigley (#219) - Clean up of make_df function @jperez999 (#205)
- Add util functions for un/grouping column values/offsets in dicts @karlhigley (#216)
- Fill in some missing docstrings @karlhigley (#217)
- Serialize shapes to and from Merlin schema files @karlhigley (#214)
- Fix for updated versions of cudf to parquet @jperez999 (#204)
- add gcp label to jenkinsfile @AyodeAwe (#181)
- Add a new
shape
field toColumnSchema
@karlhigley (#195) - Increase upper bound of
pandas
version from 1.4 to 1.6 @oliverholworthy (#210) - Update pre-commit config with latest versions of repos @oliverholworthy (#208)
- Install latest version of NVTabular/dataloader with systems tests @oliverholworthy (#209)
- Add note on why we're using
device_get_count
instead ofcuda.gpus
@oliverholworthy (#207) - Add Formatter (Prettier) for YAML and Markdown files @karlhigley (#199)
- Change the name of the package building action @karlhigley (#198)
- Split CPU tests and building packages for release into separate actions @karlhigley (#197)
- Simplify
ColumnSchema.with
methods usingdataclasses.replace()
@karlhigley (#194) - Handle executor transform case when parent node provides no new columns @oliverholworthy (#226)
- Update Models/NVTabular test config @oliverholworthy (#185)
- skip notebook tests in models test @edknv (#193)
- add a build pandas column api for easier multihot column creation @jperez999 (#183)
- Use pre-commit for linting in GitHub Actions Workflow @oliverholworthy (#184)
- Convert to cudf.Series in create_multihot_col @oliverholworthy (#187)
- adding workflow for GPU CI on gha @jperez999 (#191)