-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Distribution data check #21
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
left some comments, seems like docstrings are generally not correct. Also, we should be using this data check on the overall data, not just the target values.
checkmates/data_checks/datacheck_meta/data_check_message_code.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might be best to add some starter tests here to make sure that the data check and transformer work as expected for simple cases. Right now, there's missing logic that doesn't piece things together.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Left a few comments, are we planning on adding tests as part of this MR or is that later?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Couple of comments but almost there!
distribution_data_check
to screen for positive and negative skews as well as bimodal distributions Distribution data check #21simple_normalizer
transformer to transform data using the Yeo-Johnson methodResolves #20