Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why are the pickle results files so big? #39

Closed
mattwigway opened this issue Jul 28, 2020 · 2 comments
Closed

Why are the pickle results files so big? #39

mattwigway opened this issue Jul 28, 2020 · 2 comments

Comments

@mattwigway
Copy link
Owner

For the final model I'm submitting to the St Anselm conference, the Pickle file is 10.1 GB and the numpy file is only 55.4 MB. Now the pickle file does contain the houshold_housing_attributes which are big, but the numpy file contains a bunch of arrays that are just as big, although they do not have string index. In any case, moving to an npz-only format (#37) may solve.

@mattwigway
Copy link
Owner Author

I believe this is due to some functions getting serialized which seems to serialiaze self along with them.

@mattwigway
Copy link
Owner Author

This is fixed in 0.3.7.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant