-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SBOL 3->2 needs to remap sequence encodings and component types #16
Comments
orientations also appear to be failing in the same manner. |
take a look at this issue today, and looks like we should transfer this remapping you made to sbolgraph. the workaround @jakebeal have made could be found here: https://github.com/iGEM-Engineering/iGEM-distribution/blob/a697bfcb9da4db38da07e19b379968013f284a35/scripts/scriptutils/conversions.py#L121 the encoding constants could be found here and here. should we move these constants to sbolgraph or most of them will be unnecessary? bioterms doesn't have the encoding constants either so, last but not least, I clone sbolgraph repo, run [email protected] test
bash test.sh
🔄 Converting file: SBOLTestSuite/GenBank/EF587312.gb ...and that's it. i think will be interesting to be able to run and create some tests for this issue and also know if it's necessary to move the constants to sbolgraph. |
@isaacguerreiros I believe that the conversion tests in sbol-utilities will be good set of test cases to use here. The same conversions should be true, given that this issue is essentially asking for the corrective RDF changes in that library to be brought upstream into this library. With regards to the constants --- anything that appears in the SBOL specification is, I think, fine to encode in the library. If you disagree, @udp , please comment. @isaacguerreiros : do you need any other information in order to proceed? |
I analyzed some of the code, and apparently SBOL Specification constants from pySBOL3 and bioterms are different. Bioterms have the same URI for encoding in SBOL2 and SBOL3 (see permalinks for the exact lines) while pySBOL3 have different identifiers from identifiers.org. For me, looks like if we make bioterms specifiers and pySBOL3 specifiers for encoding equal it will be not necessary anymore to remap sequence encodings. My understanding is: because the bioterms specification of SBOL3 and SBOL2 for encoding is the same as the pySBOL2, looks like it's important to convert making this remapping. But maybe if bioterms and pySBOL3 agree with the specification for encoding this remapping step will be unnecessary. My pull request in bioterms is my attempt to resolve this. Also, will be interesting to start discussing how I could test this change #19 |
Last, but not least: I could not find the smiles encoding at bioterms or sbolgraph. Is this a concern? At least, by looking the remapping you made @jakebeal, this could be a problem. |
@isaacguerreir The pySBOL3 constants follow the SBOL 3.0.1 specification. If I'm understanding the constants file here correctly, it looks like the terms you identify just didn't get updated to their new values yet. Also agree that it looks like the smiles term just isn't there; I don't see it anywhere in the library with a search. |
Perfect. So the bioterms pull request could resolve the first part of the problem. |
Take a look and the same problem occurs at SBOL3 Specification for Types. Added similar changes in the PR to correct the problem with type remapping. |
My current workaround in python:
The text was updated successfully, but these errors were encountered: