Section 6.6.1 Automatically deriving datatypes is underspecified #91

chrdebru · 2024-02-16T08:52:10Z

Section "6.6.1 Automatically deriving datatypes" is underspecified. The test cases assume that all values are string literals. The spec does mention automatically deriving datatypes for SQL with no "conversion tables" as specified in R2RML.

Proposal 1:

all values are string literals when they are returned.
only allow data type derivation when using rml-io.
do we return values as they appear or return a string representation of normalized values (JSON allows 3.4e-5 and 3.4e-5, for example)?

Proposal 2:

"copy/paste" R2RML's SQL derivations
plain CSV --> all strings
JSON --> string, integer, double (because of exponents)
XML --> based on XSD if available
And the rest defined in rml-io?

Proposal 2 would then allow 6.6.1 be rewritten as

"rml-core does not support the automatic derivation of data types and mappings should explicitly include data type mappings if one wishes to generate literals other than xsd:string. The generation of derived data types is supported and specified by the rml-io specification."

pmaria · 2024-02-16T09:24:33Z

Strong preference for proposal 2.
My preference would be to introduce separate notes for each reference formulation wherein these details can be described.

chrdebru · 2024-02-16T12:35:29Z

Then that means that some test cases need to be adapted (e.g., some JSON values have integer values that should be transformed as such). And somebody creating those notes, of course.

dachafra · 2024-02-20T07:52:24Z

Strong preference for proposal 2.
My preference would be to introduce separate notes for each reference formulation wherein these details can be described.

+1

DylanVanAssche · 2024-02-20T07:56:34Z

+1 for proposal 2

bjdmeest · 2024-02-22T08:36:14Z

+1 for proposal 2, but in line with the respective specs, JSON has following primitive types:

strings -> xsd string
numbers -> xsd double (or float? I would not suggest to specify integer IF parses as integer OR double IF parses as double, you can override the datatype in RML)
booleans --> xsd bool
null: NULL

for XML: why not take over the datatype as specified in XML (sure it'll be XSD in most cases, but all other cases should also be covered no?)

chrdebru · 2024-02-22T12:58:35Z

The datatype for XML without a schema would be a string and one of the schemas if it can be looked up. XML data types only "exist" in the schema. XML DTDs, another XML schema language, only has character data which are strings. The problem, however, is that you then have two cases:

the XSD is referred to AND available (for download)
basic case when no schema is available

I believe core should support the basic case, and IO could handle both

dachafra · 2024-07-05T07:39:21Z

I feel this issue is already being addressed in the rml-io-registry repository, right? Or do we want a default/basic behaviour in the rml-core? @pmaria

DylanVanAssche assigned andimou and pmaria Feb 29, 2024

pmaria linked a pull request Nov 20, 2024 that will close this issue

add logical iterable and describe natural mappings #144

Open

pmaria mentioned this issue Mar 27, 2024

datatype inference kg-construct/rml-io-registry#5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Section 6.6.1 Automatically deriving datatypes is underspecified #91

Section 6.6.1 Automatically deriving datatypes is underspecified #91

chrdebru commented Feb 16, 2024

pmaria commented Feb 16, 2024 •

edited

Loading

chrdebru commented Feb 16, 2024 •

edited

Loading

dachafra commented Feb 20, 2024

DylanVanAssche commented Feb 20, 2024

bjdmeest commented Feb 22, 2024

chrdebru commented Feb 22, 2024

dachafra commented Jul 5, 2024

Section 6.6.1 Automatically deriving datatypes is underspecified #91

Section 6.6.1 Automatically deriving datatypes is underspecified #91

Comments

chrdebru commented Feb 16, 2024

pmaria commented Feb 16, 2024 • edited Loading

chrdebru commented Feb 16, 2024 • edited Loading

dachafra commented Feb 20, 2024

DylanVanAssche commented Feb 20, 2024

bjdmeest commented Feb 22, 2024

chrdebru commented Feb 22, 2024

dachafra commented Jul 5, 2024

pmaria commented Feb 16, 2024 •

edited

Loading

chrdebru commented Feb 16, 2024 •

edited

Loading