Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Start uploading production data to tape storage at NESE #254

Closed
landreev opened this issue Mar 21, 2024 · 7 comments
Closed

Start uploading production data to tape storage at NESE #254

landreev opened this issue Mar 21, 2024 · 7 comments
Assignees
Labels
Size: 10 A percentage of a sprint.

Comments

@landreev
Copy link
Collaborator

We have the first batch of data to upload there: https://fly.cs.umb.edu/omama/
We will use it to resolve and iron out any technical kinks in the workflow, and to develop service policies for future use and for offering it to clients.

@cmbz
Copy link
Collaborator

cmbz commented Mar 21, 2024

2024/03/21

@landreev landreev added the Size: 10 A percentage of a sprint. label Mar 22, 2024
@landreev
Copy link
Collaborator Author

I put a size 10 on it for now. I am actively working/focusing on this, so probably needs to be in progress.

@landreev landreev moved this from SPRINT- NEEDS SIZING to In Progress 💻 in IQSS Dataverse Project Mar 24, 2024
@landreev landreev self-assigned this Mar 24, 2024
@landreev
Copy link
Collaborator Author

landreev commented Apr 2, 2024

Got the new (prod.) NESE end point set up and the new service account and the app client given the right permissions on their end. Everything is seemingly configured identically to how the DEMO tape was set up. Tried the first upload in prod. - it bombed. Trying to figure out what's going on and what is different from that demo configuration that is known to work.
Reached out to Jim and Victoria, in case they can help me diagnosing it. But at least I have something to work with now. Will try to resolve it asap.

@landreev
Copy link
Collaborator Author

landreev commented Apr 2, 2024

Got the globus app to work.
Can now upload data to the production tape. Everything is working like a charm when I'm doing it from my own Dataverse instance. With our actual prod. instance, something is failing in the very last step, when Dataverse just needs to register the file in its own database.
Working on figuring this part out. We are very close. 🥲

@landreev
Copy link
Collaborator Author

landreev commented Apr 3, 2024

OK, we are in business of uploading serious prod. data.
These are the first bundles of 2D images from the Omama collection:

Screen Shot 2024-04-02 at 7 59 24 PM

@landreev
Copy link
Collaborator Author

landreev commented Apr 4, 2024

One remaining task (other than uploading the second OMAMA dataset-worth of data which is pending on local delivery) is to get the globus app to work properly in prod. for redirecting _down_load requests for NESE-stored objects.

@landreev
Copy link
Collaborator Author

landreev commented Apr 8, 2024

The OMAMA dataset mentioned above: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/KXJCIU&version=DRAFT
Thinking of closing the issue since the tasks outlined above have been completed. Will be opening new issues for next steps and phases of the effort. The most important one being developing a doc. guide for future customers of this big data-to-tape storage service.

@landreev landreev closed this as completed Apr 8, 2024
@cmbz cmbz moved this from In Progress 💻 to Done 🧹 in IQSS Dataverse Project Apr 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Size: 10 A percentage of a sprint.
Projects
None yet
Development

No branches or pull requests

2 participants