Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bulk upload #667

Open
ockentap opened this issue Nov 29, 2024 · 2 comments
Open

Bulk upload #667

ockentap opened this issue Nov 29, 2024 · 2 comments

Comments

@ockentap
Copy link

Hello,

there is no bulk upload feature in the ia tool, when trying to upload 300GB the connection drops at times which means i have to start from the beginning as the tool doesn't keep track of what is on the server compared to what is there locally. This is just a waste of bandwidth. I have made a tool(script) that keeps track of what has been uploaded so that it is easier to resume in case of interruptions. I wish this would be added as a function to the ia tool you are developing.

https://github.com/ockentap/Internet-Archive-CLI-Bulk-Upload-Script

@Dobatymo
Copy link
Contributor

Dobatymo commented Dec 2, 2024

You can skip files based on checksum (--checksum). Is that not enough?

@ockentap
Copy link
Author

ockentap commented Dec 2, 2024

Maybe for most usecases but having to checksum several hundred GB of files every upload can take hours at best and days at worst, speaking from my case. I am still uploading a 300 GB archive with a Raspberry pi 3 and everytime my connection drops I have to restart from beginning meaning it has to rehash all the files both locally and on IA. The SQLite database keeps track of what files have been uploaded and can thus just continue where it left off and only checks the MD5 when all files are uploaded.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants