What's Changed
- Added optional support for two new gradient boosting ML libraries: XGBoost and LightGBM. You can read more about these libraries and how to install them with their dependencies in the docs here. PR #165
- Added a new
hlink.linking.transformers.RenameVectorAttributes
transformer which can rename the attributes or "slots" of Spark vector columns. Hlink uses this to support LightGBM, which disallows certain characters in its feature names. PR #165 - Documented comparisons, which are not the same as comparison features. Previously the documentation was misleading and seemed to indicate that these were the same thing. PR #159
- Fixed a bug in the substitution file documentation. The documentation had the meaning of the substitution file columns flip-flopped, which was confusing. PR #166
Developer-Facing Changes
- Updated Sphinx to 8.1.3 and fixed two Sphinx build warnings. PR #159
- Updated CI/CD to automatically run only on PRs and on pushes to main. You can also now manually trigger a CI/CD run from the Actions tab in GitHub. Also removed the custom "quickcheck" pytest marker in favor of using
pytest -k
and removed flake8 from CI/CD because it kept causing more trouble than it was worth. PR #164
Full Changelog: v3.7.0...v3.8.0