Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update of numpy to version 1.21.0 causing conflict with pyarrow version #106

Open
AlexanderMeldonWhite opened this issue Mar 29, 2022 · 3 comments
Labels
bug Something isn't working

Comments

@AlexanderMeldonWhite
Copy link

Hi, I believe the recent upgrade of numpy to version 1.21.0 has caused the below error to occur when converting a dataframe to parquet format (df.to_parquet(f)).

pyarrow.lib.ArrowTypeError: ('Did not pass numpy.dtype object', 'Conversion failed for column xxxxxxxx with type int64')

This was fixed locally by upgrading pyarrow to version 3.0.

Would it be possible to upgrade the pyarrow version in the image?

Many thanks.

@yiyi-zz
Copy link

yiyi-zz commented May 2, 2022

Have the same issue... It would be great if the image gets fixed

@mabunday
Copy link
Contributor

mabunday commented Jun 4, 2022

I'm investigating this but don't have an ETA yet for the fix. The main difficulty is the dependency on MLIO - bumping the PyArrow version causes some integration failures.

To confirm, this only started occurring when the numpy version was bumped, right? If so, I may investigate bumping the numpy version instead as an easier path forward.

@mabunday mabunday added the bug Something isn't working label Jun 4, 2022
@AlexanderMeldonWhite
Copy link
Author

Hi, yes, that's correct, the issue only occurred after the upgrade of numpy.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants