Skip to content

Dataset openlinksw com lod cloud cache ns prov

Tim L edited this page Mar 9, 2015 · 27 revisions

What is first

What we will cover

We set up a dataset source/openlinksw-com/lod-cloud-cache-ns-prov to look for occurrences of the PROV ontology in http://lod.openlinksw.com. This page discusses the creation of our PROV ontology search dataset.

Retrieval

Summary of what version/retrieve.sh leaves behind:

$ cat manual/prov.csv 
http://www.w3.org/ns/prov#Accept
http://www.w3.org/ns/prov#Activity
...

$ cat manual/class_Activity.rq 
select count(?s) where {?s a <http://www.w3.org/ns/prov#Activity>}

$ cat source/class_Activity.rq.csv
"callret-0"
214

$ cat source/class_Activity.rq.csv.prov.ttl
...
<class_Activity.rq.csv>
   a prov:Entity;
   prov:wasQuotedFrom <http://lod.openlinksw.com/sparql?query=select%20count%28%3fs%29%20where%20%7b%3fs%20a%20%3chttp%3a%2f%2fwww.w3.org%2fns%2fprov%23Activity%3e%7d&output=csv>;
...

$ cat manual/property_wasDerivedFrom.rq 
select count(distinct ?s) where {?s <http://www.w3.org/ns/prov#wasDerivedFrom> []}

$ cat source/property_wasDerivedFrom.rq.csv
"callret-0"
24975410

Preparation

We want to model it something like the following. Although we could use the summary modeling of SPO Balance, we'll just fall in line with VoID.

<http://purl.org/twc/lodcloud/source/openlinksw-com/dataset/lod-cloud-cache-ns-prov/version/2014-Feb-20>
   a void:Dataset;
   prov:specializationOf <http://datahub.io/dataset/openlink-lod-cache>;
   void:classPartition [
        void:class <http://www.w3.org/ns/prov#Activity>;
        void:entities 214;
        dcterms:created "2014-02-20T17:27:45+00:00"^^xsd:dateTime;
        prov:wasDerivedFrom <http://lod.openlinksw.com/sparql?query=select%20count%28%3fs%29%20where%20%7b%3fs%20a%20%3chttp%3a%2f%2fwww.w3.org%2fns%2fprov%23Activity%3e%7d&output=csv>;
   ];
   void:propertyPartition [ 
        void:property <http://www.w3.org/ns/prov#wasDerivedFrom>;
        void:distinctSubjects 24975410;
        dcterms:created "2014-02-20T17:27:45+00:00"^^xsd:dateTime;
        prov:wasDerivedFrom <http://lod.openlinksw.com/sparql?query=select%20count%28distinct%20%3fs%29%20where%20%7b%3fs%20%3chttp%3a%2f%2fwww.w3.org%2fns%2fprov%23wasDerivedFrom%3e%20%5b%5d%7d&output=csv>;
   ];
.

version/prepare.sh results cached on 2014-Feb-20:

Conversion