-
Notifications
You must be signed in to change notification settings - Fork 438
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Basic Duplicate Detection #2749
Basic Duplicate Detection #2749
Conversation
ab46576
to
73e9ae0
Compare
...y-dspace-result-list-element/pool-search-result/pool-search-result-list-element.component.ts
Outdated
Show resolved
Hide resolved
src/app/submission/sections/duplicates/section-duplicates.component.html
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Works as intended also using the new search endpoint.
src/app/shared/object-list/duplicate-data/duplicate.resource-type.ts
Outdated
Show resolved
Hide resolved
...-result-list-element/claimed-search-result/claimed-search-result-list-element.component.html
Show resolved
Hide resolved
...dspace-result-list-element/pool-search-result/pool-search-result-list-element.component.html
Show resolved
Hide resolved
@MW3000 thank you for the further review. I have fixed the issues identified. |
Hi @kshepherd, |
3a34a16
to
a362c6b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @kshepherd for this PR.
I've reviewed this PR and found out this is not ported as is from implemented on DSpace-CRIS.
I agree to keep it simple than the most comprehensive functionality that is present on DSpace-CRIS, but i've couple of suggestions i'd like to be ported here in this PR:
- in my opinion is not necessary to show the submission step when no duplicates were detected because I don't think it gives any added value to the submission process
We hide mandatory section without section data here on DSpace-CRIS https://github.com/4Science/dspace-angular/blob/e4e919307924a2f912bdcc53cf00fd20fc18e8c0/src/app/submission/objects/submission-objects.effects.ts#L87C25-L88C70 - when a duplicate is detected i'd highlight the message with a warning alert, something like
In this case i'd also change the section status to warning
Moreover in console there is a repetitive error which occurs when the submission is deposited due to the detect duplicate step
@atarix83 thank you for the review! Yes, this isn‘t a port anymore. As the port didn‘t got reviewed for several DSpace versions, we reimplemented this to get much smaller PRs. We hoped to get it reviewed this time by reducing the PR to roughly 20% of the original size. |
@atarix83 we addressed your feedback. Can you please confirm? |
@atarix83 thanks for your review, I have addressed your feedback as below:
However, if the angular env config
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i'm sorry i didn't realize that the rest part is fully changed and my thought was that only the Angular interface has been simplified.
also based on what @abollini pointed out in this comment DSpace/RestContract#252 (review) I'd not approve this PR.
There are any changes the old porting PR is resumed?
i18n fix alignment fix comment fix
A new config property allows the user to force the duplicate section to be displayed even if there are no duplicates as sometimes this is useful information to a reviewer or submitter
force pushed a large rebase (lots of conflicts with modules now that other submission PRs are pushed and this one has so many commits), and includes changes to use searchBy endpoint again as per last reversion/change in backend PR |
5f1e082
to
1206d61
Compare
@tdonohue although code cov looked ok after i made my last changes, i also added new tests for the data service and the object reducer / actions, to keep things in line with general good practice and get coverage up a bit more, hope that works well |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 Thank you @kshepherd ! I retested this today and it works perfectly from the Submission form & the Workflow tasks page. I've also verified that errors no longer appear in DevTools when this feature is disabled. The new specs/tests also look good & bring this up to our CodeCov requirements
Flagging as |
Removing "needs documentation" flag as docs now exist in https://wiki.lyrasis.org/pages/viewpage.action?pageId=328958055 |
References
The initial work on this feature was supported by TU Berlin. This work was supported by FHNW and ZHAW.
Description
This feature introduces basic Duplicate Detection into DSpace submission and item links, using Solr's ability to search by levenshtein distance.
A new solr field
item_signature
is indexed for in-progress and archived items, with a normalised version of the title to allow it to be treated as a single term using the~n
operator, wheren
is a configurable edit distance.The potential duplicates detected by this search will be displayed to submitters or reviewers as a submission step, for their information (they could then choose to ignore the report if they know this is a new different item, or cancel their submission if there is a legitimate duplicate already present).
If a potential duplicate is in workspace, only the submitter of an item can see this information.
Workflow reviewers are given a small hint indicating how many potential duplicates are detected for a pooled or claimed task, and directing them to the 'Edit' (form) where the step will show details. If the potential duplicate is in workflow, only a reviewer or admin will be able to see this information.
If versioning is used, only the latest version is included in comparison.
Instructions for Reviewers
duplicate
stepdspace/config/modules/duplicate-detection.cfg
and configure as desiredChecklist
This checklist provides a reminder of what we are going to look for when reviewing your PR. You need not complete this checklist prior to creating your PR (draft PRs are always welcome). If you are unsure about an item in the checklist, don't hesitate to ask. We're here to help!
yarn lint
yarn check-circ-deps
)