-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request/Idea: Make OAI-PMH harvesting more configurable #10677
Comments
@pdurbin and I were having a chat about this issue on Zulip, but found out it would be good to share the conversation in a public channel, so I've pasted it below: Philip Durbin:
Philipp Conzett:
Philipp Conzett:
Philip Durbin:
Philipp Conzett:
Philip Durbin:
Philipp Conzett:
Philip Durbin:
|
@philippconzett we are also interested in this feature. Could we implement something similar to the Exporter SPI, i.e. add custom modules (Importers) responsible of the transformation of a harvested metadata format into corresponding metadatablocks? |
Per dataverse/src/main/java/edu/harvard/iq/dataverse/harvest/server/web/servlet/OAIServlet.java Line 161 in 54767b9
We don't yet have the equivalent of the exporter spi to make importers, but if the idea here is just to let non-Dataverse catalogs harvest DV content, and it's XML, I think you just have to create/install the exporter you want. |
Thanks for the feedback! I'm not sure if I understand the technical details. Could we schedule a call with Jim and/or Phil and those interested? |
@qqmyers: That's great, I wasn't aware of this feature! I thought we were talking about the other way round, collecting more metadata from other repositories... @philippconzett That would be nice, we or at least one of us (@vera, @julian-schneider, @johannes-darms ) would like to join. |
Great! I've created a when2meet calendar to help us schedule a call. I'll be on and off in vacation mode from today, but maybe Thursday or Friday next week could work for most of us? It would be good if someone knowing the details of metadata export could join. I see that @poikilotherm, @qqmyers, and @pdurbin have contributed to the GDCC dataverse-exporters GitHub repo. |
Just to make sure I'm on the right track: The functionality @qqmyers refers to above, is the one described in section Metadata Export Formats in the Developer Guide? |
Yes indeed! |
@qqmyers Thanks for filling in the when2meet calendar! Pinging @vera, @julian-schneider, @johannes-darms, @DS-INRA, @gwendoux I've created a collaborative notes doc. It currently contains a brief description of the DataverseNO-SIOS use case and how we could approach it to make the requested feature useful for other, similar use cases in the Dataverse community. Please feel free to contribute! Thanks! |
Leaving a note here that I shamelessly made use of my admin rights and #2721 to the initial description. Another note: I've been talking about creating an XML-RDF exporter for a long time now. That's the way to go when you want to expose all metadata in XML without much need for configuration. Not sure if we'd prefer some standalone thing specialised in XML stuff or if we want to look into using sth like https://github.com/gdcc/exporter-transformer. Also, not sure if these issues are related with regards to technical implementation: #10042, #9344, #10000 |
Thanks all for indicating your availability. I've sent you a calendar invite. Please let me know if you haven't got it. |
Overview of the Feature Request
The idea is to make it possible to make OAI-PMH metadata harvesting more configurable, so that 1) the metadata about the datasets included in a given harvesting set can come from any selection of fields from any metadata schema defined in a Dataverse installation, and 2) the metadata can be based on other standards than Dublin Core (DC). See discussion in Dataverse Users Community Google Group.
What kind of user is the feature intended for?
API User, Superuser, Sysadmin
What inspired the request?
DataverseNO would like to implement interoperability support for data to be made searchable and reusable through the Svalbard Integrated Arctic Earth Observing System (SIOS), which is an international observing system for long-term measurements in and around the Norwegian archipelago of Svalbard addressing Earth System Science questions. There is a growing community in Europe and beyond who makes or is interested in making their data reusable through SIOS. Currently, SIOS only supports harvesting of discovery metadata using OAI-PMH.
What existing behavior do you want changed?
Currently, Dataverse supports OAI-PMH harvesting using a DC representation of (some of?) the metadata in the Citation Metadata block.
Any brand new behavior you want to add to Dataverse?
Yes, the requested feature would extend the possibility of how to configure OAI-PMH metadata harvesting.
Any open or closed issues related to this feature request?
Some of the issues below might be related:
IQSS/dataverse:
IQSS/dataverse-pm:
The text was updated successfully, but these errors were encountered: