-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error on indexing #24
Comments
I guess all I needed to do was write the issue out to find the problem. In this example, I had a problem in the file name I'd used for the nt file. Once I fixed that, I was able to get the indexing completed and start the service back-end. I could then proxy the port for that through nginx and can query the data through the Qlever UI. So, I'm still having trouble with TTL files, but I can deal with writing to n-triples for now. |
@skybristol Which problems are you having with your TTL files? If you provide a link, we will be happy to try it out on our side. |
Cool! Thank you. The main set of content I need to pull into an index is the Geoscience Ontology. This is the most comprehensive encoding I know of that organizes things like rock types and mineral species within the Basic Formal Ontology and Common Core Ontologies. I know there are a few information issues here and there in the modules, but the TTL files seem to check out structurally using rdflib. I tried setting up a Qleverfile using get-data methods I saw you all use elsewhere to pull from the GitHub source and then index. It's probably something simple that I'm missing. |
@hannahbast - Thanks for engaging on this. I'm excited to get into all the things Qlever should enable us to do. I am working through some of my other use cases in terms of content that we'll be needing to index and have narrowed down a problematic piece seen in the following error:
This is an ORCID record for a person retrieved via content negotiation for text/turtle. I tried just pulling a small handful of what will be about 6000 records we need to build into an index to see where we might run into problems. Once I remove the gn:Feature triple from that record, I'm able to index the batch of 20 that I cached and start/test the Qlever backend successfully. Here is a curl request for that document where I seem to be tripping a parsing error trying to deal with brackets. I've also seen this error elsewhere, including a whole set of triples I tried to index from our Wikibase instance.
|
@skybristol I just found this issue again (stuff tends to get overlooked here on https://github.com/ad-freiburg/qlever-control/issues, the main issue tracker is over at https://github.com/ad-freiburg/qlever/issues) and tried to download https://zenodo.org/records/4743221/files/Loop3D/GKM-1.0.1.zip . This contains many |
I'm having trouble indexing anything other than something from the examples. It seems like I must be missing some critical step. I keep getting the following error lines upon running 'qlever index':
My latest attempt is a small ntriples file built from CrossRef metadata on a collection of 50 DOIs. I need to pull these records into a graph that I can combine with other data in queries that refer to the DOIs as identifiers. I have not yet done anything specific with the get-data part of the process, but I will eventually make this a call to a Python script that fetches the data and caches files for indexing. I've also tried this on a number of sources serialized as TTL, but I'm trying to narrow down the problem and figure out why indexing is not working for me.
The indexing process does run up to a point, producing a number of files in my index folder for this collection but then errors out. When I try to run 'qlever start' I get the following error:
I'm happy to provide files on what I'm doing, but here is the relevant snippet from the Qleverfile along with the top of my nt file.
Start of NT file built from calling CrossRef content negotiation and serializing with rdflib in Python.
The text was updated successfully, but these errors were encountered: