-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cleaning rdf:type range in the enriched STO #1
Comments
To meet this requirement, there should be something to compare to. The ontology could be one and maybe instances could be compared if they are correct instantiations of a given class in the ontology. In case that this occurs, these classes should be removed from the full chain. |
The only solution that came to my mind was to narrow down the classes for each standard. That is - to exclude all super classes and leave only those which are at the bottom level of the "DBpedia class tree". Such an approach was implemented here. Applying it to the OPC_UA leads to the following.
after:
Applying it to the enriched ontology yields into this. Such a process removes 429 triples overall. In addition, some of the class chains, like |
The problem, in this case, is that we may be removing facts that are true. E.g., OPC UA can be considered as a |
The reason why I excluded such superclasses as
Some of the classes were removed because their "top-level" super class was This solution might be not the best because it excludes some of the classes which are true, but at least it narrows down to those classes which are easy to check and to unerstand where they come from. |
Can you evaluate what would be the precision only of this example, with and without removing? - Check this |
For the From the perspective of DBpedia, as a system, all of these classes, i.e. |
In the DBpedia, in our area of interest, range of the property
rdf:type
sometimes consists of irrelevant data.Example can be
sto:BBF_TR-069 -- rdf:type -- dbpcy:Rule106652242
.In this case, the unrelated object
dbpcy:Rule106652242
is just a result of implementing the predicaterdfs:subClassOf
todbpcy:Protocol106665108
.Thus, we have the full chain
dbpcy:Protocol106665108 < dbpcy:Rule106652242 < dbpcy:Direction106786629 < dbpcy:Message106598915 < dbpcy:Communication100033020 < dbpcy:Abstraction100002137
in the list of ranges forrdf:type
of thesto:BBF_TR-069
.The question is - should anything from such chains be removed from the enriched ontology?
Another example is
sto:SCOR -- rdf:type -- dbpcy:Person100007846
.This case is easier because such a concept is simply wrong and we can exclude the whole chain with
dbpcy:Person100007846
in it from the enriched ontology.The text was updated successfully, but these errors were encountered: