-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix mddatasrc for all studyforrest datasets #62
Comments
Here's a log of me doing it for
|
Thanks @adswa for the great instructions! I have changed the I get the same |
Yes, this message is unrelated to the special remote 👍 |
This is what I get for https://github.com/psychoinformatics-de/studyforrest-data-aggregate. Two git-annex remotes, nothing about
|
Thx! In this case, the remote is not called |
I'm investigating 👍 |
enabled the new remote with the correct location, but still getting errors when retrieving file content:
with
|
Thx, I will investigate and report back what I found! 👍 |
EDIT: The problem is that the dataset on https://datapub.fz-juelich.de/studyforrest/studyforrest/templatetransforms/.git/ contains an old version, with the last commit from 2016. The dataset on GitHub has more recent commits. They seem to originate from juseless, but other than the commits, these changes were not published. If we push this dataset from data1:/data/project/studyforrest/superds/derivative/image_space_transformations to datapub, this should get fixed. I don't have permissions to do this. For https://github.com/psychoinformatics-de/studyforrest-data-templatetransforms I also need some help, so I'm tagging @mih and @bpoldrack: There are files that can't be retrieved, e.g., This is the availability information registered for those files (exemplary for one, matches all of them) - the important bit is that the enabled
However, the files are available at I already did an A side question is whether those files are left unregistered on purpose, e.g., because of data privacy. |
As for https://github.com/psychoinformatics-de/studyforrest-data-phase2-denoised , we don't have this data, all sources are with OpenNeuro as far as I can see. Edit: The dataset here on github is outdated. The problem is that the data was updated upstream, and the content from the now unavailable files was moved to *_decomposition.json in commit de145f67a3da26f1d39187403340d7380d928cf2 tag 1.3.0. |
A quick overview of a TODO for @mih:
|
That is strange. It should update availability if there was a change and it would be the way to go. |
See my edit in that post, and most recent comment to @mih with a fix, @bpoldrack: The files in question differ in version between GitHub and datapub. |
Another TODO for @mih: I lack the permissions to do so, and this dataset is superfluous as I have forked the openneuro dataset as discussed in the chat as a maintained alternative to https://github.com/psychoinformatics-de/studyforrest-data-phase2-denoised_openneuro TODO for me:
|
At the moment, the Studyforrest datasets hosted here on GitHub are all broken. The reason for this is a faulty special remote
mddatasrc
pointing topsydata.ovgu.de
, which used to redirect todatapub.fz-juelich.de
(where the data was migrated to), but was taken down recently. The first user issue that brought this problem to light is psychoinformatics-de/studyforrest-data-visualrois#6.Although I've only probed a handful of repositories/subdatasets in this repo, I believe they all have a now broken
mddatasrc
special remote registered. I suggest we put in a coordinated effort to fixing this with as many people as possible. @bpoldrack outlined a fix for this issue in psychoinformatics-de/studyforrest-data-visualrois#6. Here's my translation for the general procedure that anyone can follow:mddatasrc
during cloning. If not, nevertheless try to retrieve data to make sure it all works. If everything works, move to the next dataset; if not, move to 3.remote.log
and make sure there is only onemddatasrc
special remote (git cat-file -p git-annex:remote.log
is the command to do it). If there are two, leave a note, and move to the next dataset for now.mddatasrc
special remote inremote.log
/.git
mddatasrc
usinggit remote remove mddatasrc
mddatasrc
using its UUID as an identifier, and the URL you constructed from datapub.fz-juelich.de (see example below) to fix the location information:datalad get
to confirm that this fix worked, and retrieval frommddatasrc
is possible againdatalad push
the changes back to GitHub. There is no need (or possibility) to do a pull request. Make sure that the git-annex branch gets successfully pushed. If you run into permission errors, seek help in the chat.List of repositories:
mddatasrc
error)mddatasrc
error)(nofixed!mddatasrc
error; connection errors duringclone
andget
)mddatasrc
error)mddatasrc
, leave for later!datalad get sub-01
)mddatasrc
error; getting.gz
files works, but.txt
files look like availability was never pushed. Seesub-05/ses-movie/func/sub-05_ses-movie_task-movie_run-1_desc-MELODICSm5_componentLabels.txt
) -> this dataset was an external contribution, and has been replaced by a fork of the corresponding maintained openneuro dataset (https://github.com/psychoinformatics-de/studyforrest-data-phase2-denoised_openneuro)The text was updated successfully, but these errors were encountered: