-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Triggering Run Basic Checks #14
Comments
Unfortunately, all the checking code is currently in the Paratext executable (which is not public). |
I wonder if maybe something could be added to the API to indicate that you want the USFM tokens, but only if/when the checks have been run and passed cleanly. |
@GeoDirk, I'm not sure how clean you need your data or if it would work for what you need, but you could try get the USX first using strict=true to make sure that the data is clean before reading in the tokens. |
Would using USX and Basically I need to produce the equivalent of:
Which is why bad verse tags are problematic. Obtaining the verse text without the extra attributes has been surprisingly easy with parsing through your USFM tokens. We are using this data to send it off to NLP for processing and looking for alignments hence why we need the precision. |
Yeah, it probably won't work to use |
For our purposes, when parsing through the USFM tokens, we are coming across projects that have a bunch of weird things with their verse tags:
I'm sure that we will be finding more and more of these types of USFM errors as we go along. We usually can detect these in our plugin and report them back to the user. However, ideally it would be fantastic if we could trigger the "Run Basic Checks" function and make the user clean up the mess. For what we've encountered thus far, the basic checks would have caught all of these issues and then get back a report on what is bad. This feature would be a new enhancement to the API.
Alternatively, you all probably have a standard library out there that could look at the USFM and do the checks. Anything like that out there in your public libraries?
The text was updated successfully, but these errors were encountered: