-
-
Notifications
You must be signed in to change notification settings - Fork 989
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] KemonoParty: Preventing duplicates with revisions #6096
Comments
Remove the title from the file name. Download the post's unique files only. Save the metadata of the post from Kemono and use it to sort the files with symbolic or hard links to not waste any storage space. Replace creator_id and post_id in the URL with the correct ID of the creator and post. Trying to sort files from Kemono without wasting space will only lead to frustration. https://kemono.su/api/v1/service/user/creator_id/post/post_id
|
Thanks, but
Yes, I'm already saving unique files using their hash. However, your suggestion still uses |
The suggestion above does not use |
I was referring to the directory option ( |
@mikf Bumping since an admin announced that Kemono is shutting down on November 22nd. Since this issue never got a label, is it considered a won't do/out of scope? |
You might as well consider this "won't fix" then, as there is a good chance the next release will be after 2024.11.22.
Each revision has 4 metadata fields:
The earliest revision entry has a Using conditional file/directory names, you could do do something like "directory": {
"revision_index == 1" : ["{username}", "{service}", "[{id}]", "earliest revision: {title}"],
"revision_index == revision_count": ["{username}", "{service}", "[{id}]", "latest revision: {title}"],
"": ["{username}", "{service}", "[{id}]", "{revision_id}: {title}"]
} |
Thanks for the suggestion, though that still creates duplicate files. I think what I may end up having to do is to only use the post ID as the folder name, then write a script to rename them properly after downloading. |
It won't if you use an archive with |
I might try that instead, now that I think about it. The majority of content shouldn't have that many revisions, so fixing the few duplicate folders might be easier. In regards to the comment I just left, #6415 (comment), would this still be a viable feature to add? |
I'm currently downloading all post revisions and organizing them with the following directory structure:
However, both the post's
date
andtitle
could change between revisions. I can send example URLs privately for both situations, if needed. This creates a duplicate folder, and could potentially eat up space with all the content being redownloaded. I have also tried using{published[:10]}
instead of date, but in later revisions it can be null, so it duplicates using "None". Though, that still wouldn't address title changes.If there currently isn't a way to solve this (aside from obviously not using date/title), could we get some extra options to use with the format strings, such as
{earliest_revision_date}
,{latest_revision_date}
,{earliest_revision_title}
, and{latest_revision_title}
?The text was updated successfully, but these errors were encountered: