-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add eNATL recipe #75
base: main
Are you sure you want to change the base?
Add eNATL recipe #75
Conversation
pre-commit.ci autofix |
for more information, see https://pre-commit.ci
@cisaacstern this failed with the same error I encountered earlier. Could you take a look at this? |
Trying out pangeo-forge/deploy-recipe-action#27 |
Ok I was able to deploy this using the super hacky changes made in pangeo-forge/deploy-recipe-action#27. I will check later if the dataflow job successfully ran But maybe more important, we need to wait here how the discussion over at pangeo-forge/deploy-recipe-action#27 goes. Sorry for the delay. |
pre-commit.ci autofix |
Yay, this worked! @auraoupa do you have access to the leap hub? You can inspect the dataset with the following snippet: import xarray as xr
path = 'gs://leap-persistent-ro/data-library/enatl60-blbt02-595733423-7175544257-1/eNATL60_BLBT02.zarr'
ds = xr.open_dataset(path, engine='zarr', chunks={})
ds Two things I noticed:
|
Finally we should think about the chunking of the final product. These are all things we can/should discuss before we have the deployment figured out. |
Than you @jbusecke for advancing so quickly on this !
|
Thanks for the quick response @auraoupa. Lets deal with the most challenging issue first:
I think ideally each file would contain the masks as coordinates, then we could apply the masking on each file, and also retain the masks in the final output (this might be very important for budget analysis etc). I have raised pangeo-forge/pangeo-forge-recipes#663 to discuss this more broadly. Just as a heads up, this will probably not move before next week earliest, since folks are at AGU.
I already renamed to
That seems fairly small to me. I would aim for chunksizes in the 100-200MB range, but this is a detail we can discuss at the end. |
Hi @jbusecke, I hope you had a nice end of 2023 and wish you the best for 2024 ! |
Thanks for doing this @auraoupa. This might unblock us here. I will keep track of it over at pgf. |
* Update eNATL60.py new name and zenodo records for homogeneously masked data * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Seems like we are getting an error that |
@auraoupa should |
pre-commit.ci autofix |
for more information, see https://pre-commit.ci
Well thats a new one (cc @cisaacstern ):
Let me change the target_chunks to see if this goes away |
Ok now I am getting yet another error that I cannot quite grok:
@cisaacstern could we dig into this in the coming days? Sorry this will still be blocked for now @auraoupa. |
Yes forgot that, it is time now.
No it is not dependent on time indeed, sorry I missed it Thanks and good luck with the unusual errors ... |
Fixing:
|
@SammyAgrawal can you move any further discussion to https://github.com/leap-stc/eNATL_feedstock and close this? |
Towards #73
DO NOT MERGE AS IS. HIGHLY EXPERIMENTAL