You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
S3 with CESNET must be configured in the clarin-dspace.cfg
S3 CESNET configuration must be documented
Upload a regular file
Upload a tens of GBs file - (6BG, 5GB is CESNET limit) - the bitstream is added into DB after upload
Download as a normal user
Download as a admin
Download as a Anonymous user
Downloading - pause and continue
Delete bitstream -> run cleanup -> the file should be removed from the S3
Create a new version of the Item and check the storeNumber, if the bitstream was added into S3, try download a bitstream from new version
Questions:
0. Is some S3 issues fixed in the DSpace7.6.1.?
1. Is the bitstream first stored locally and synchronized to S3 at some later point?
2. If a user uploads a 40GB file does the system need the storage capacity to hold that temporarily?
3. Can frequently accessed bitstreams be cached on the repository system or is a download not going through the repository system at all?
4. Is the bitstream’s checksum computed by the repository system or fetched from S3 metadata?
5. Is there an option to use both a local assetstore and an S3 assetstore?
6. What happens when the software can’t connect to S3, or when the connection fails during an upload/download? Does the user notice? Is it possible to resume?
7. When a curation task runs, is the bitstream first downloaded locally?
8. When is created checksum?
Answers:
Yes, the file is copied into local (tomcat) temp file and then it is uploaded as a multipart file to the S3
Yes the system needs 40GB capacity because the file is copied into tomcat/temp folder.
Bitstreams are not cached. Every time it is downloading using Stream.
Object is fetched from the S3 with checksum, then the checksum is retrieved from that object. Checksum is computed before upload.
No
I've started uploading and then removed the Ethernet cable. The user will see Upload failed message and the uploading must be started from the beginning.
Which curation task?
S3: Checksum is computed before upload. The checksum value is fetched from the S3.
TODO:
Configure S3 with CESNET
Try upload/download/delete some file from CESNET
Extends S3BitStoreService and update store, remove method to add/delete data from the local assetstore
Create tests
Admin UI - bitstream
Create synchronization checker between local and S3 assetstore
Original issue: ufal#1065
Use cases:
clarin-dspace.cfg
storeNumber
, if the bitstream was added into S3, try download a bitstream from new versionQuestions:
Answers:
tomcat/temp
folder.Upload failed
message and the uploading must be started from the beginning.TODO:
store
,remove
method to add/delete data from the local assetstoreNOTE:
most_recent_checksum
result is changed toCHECKSUM_NO_MATCH
S3 CESNET limits:
Wiki: https://github.com/dataquest-dev/DSpace/wiki/S3-%E2%80%90-CESNET
The text was updated successfully, but these errors were encountered: