-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cannot parse dicom_ontology.owl #14
Comments
Hi @tgbugs -
Also, I have uploaded a similar python dict file that includes the tag value and the definition/notes text from the DICOM XML docbook. Try running your units-detecting code on that file and see how it goes. I kept the utf-8 encoding so that things like the mu and degrees symbols were still intact. This is not true in the current owl file in which I substituted u's for mu's. |
|
Hi, I would like to confirm that I could not parse the OWL file either trying several different tools (e.g. OWLGrEd, WebVOWL) |
I reviewed my .bash_history file to see what I did, and unfortunately it looks like I made all the changes using vim's ex mode ( |
@khelm I was trying to parse dicom_ontology.owl with my usual suite of ttl parsers (rapper among others gives pretty good debug info) and noticed that there are multiple cases where the descriptions are malformed. I think all that needs to be changed is to add an additional cleaning and proper escaping rules after
dicom-ontology/create_dicom_ttl.0.4.py
Line 336 in e3d7e47
rdflib.Literal
orjson.dumps
may be sufficient.Some examples of issues.
<200b>
char at the end of every definition (I have to open it in vim to see this).\
is not escaped so parsers try to interpret things like-1\-1
as an escape sequence and fail.An incomplete set of fixes with the examples (as a patch).
ontdiff.txt
The text was updated successfully, but these errors were encountered: