From 4cc9feb7bb22da086ba56b29fcf1725199f24ef1 Mon Sep 17 00:00:00 2001 From: Thomas Ubensee <34603111+tomuben@users.noreply.github.com> Date: Fri, 15 Nov 2024 08:24:02 -0300 Subject: [PATCH] #991: Added Requirements Doc for Script Options parser (#469) related to exasol/script-languages-release#991 --------- Co-authored-by: Torsten Kilias --- .../docs/script_options_requirments.md | 375 ++++++++++++++++++ 1 file changed, 375 insertions(+) create mode 100644 exaudfclient/docs/script_options_requirments.md diff --git a/exaudfclient/docs/script_options_requirments.md b/exaudfclient/docs/script_options_requirments.md new file mode 100644 index 00000000..a4a59bc0 --- /dev/null +++ b/exaudfclient/docs/script_options_requirments.md @@ -0,0 +1,375 @@ +# System Requirement Specification + +This document outlines the user-centric requirements for the new Exasol UDF Client Script Options parser. It identifies the key features, high-level requirements, and user roles to ensure clarity and alignment with user needs. + +## Roles + +This section lists the roles that will interact with or benefit from the parser system. + +### Database Administrator +Database Administrators manage the database environment and ensure the efficient execution of UDFs within Exasol, configuring and overseeing script execution. + +### Data Scientist +Data Scientists develop and deploy UDFs in languages such as Java, Python, or R to process and analyze data within Exasol. + +### Developer +Developers who integrates the Script Options parser library into other software. + +## Features + +This section lists the key features of the new UDF Client Script Options parser which you would highlight in a product leaflet. + +### General Script Options Parsing +`feat~general-script-options-parsing~1` + +Script Options must be parsed according to a given syntax definition. +Developers can add additional Options in an easy and consistent way. + +Needs: req + +### Java-specific Script Options +`feat~java-specific-script-options~1` + +The parser must process all Java-specific options correctly. + +Needs: req + +## High-level Requirements + +This section details the high-level requirements for the new parser system, linked to the features listed above. + +### General Script Options Parsing +`req~general-script-options-parsing~1` + +The parser must correctly identify and handle Script Options with the syntax `%;`. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +### White Spaces +`req~white-spaces~1` + +The parser must treat the following list of white spaces as token separator: +|======================================================= +| Name | C syntax | ASCII Dec | ASCII Hex | +| tabulator | '\t' | 9 | 0x09 | +| vertical tab | '\v' | 11 | 0x0b | +| form feed | '\f' | 12 | 0x0c | +| space | ' ' | 30 | 0x20 | +|======================================================= + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + + +### Leading White Spaces Options Parsing +`req~leading-white-spaces-script-options-parsing~1` + +The parser must accept Script Options for lines starting with any number of white space characters before the Script Options. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +Depends: + - `req~white-spaces~1` + +### Ignore anything which is not a Script Option +`req~ignore-none-script-options~1` + +If there is any character in front of a Script Option which is not a white space, the parser must ignore the option(s). + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +Depends: + - `req~white-spaces~1` + +### Multiple Line Script Options Parsing +`req~multiple-lines-script-options-parsing~1` + +The parser must recognize Script Options at any line in the given script code. + +Rationale: This is especially important because the `%import` option (see requirement `req~java-import-option-replace-referenced-scripts~1`) might replace options with Java code in the final script code. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +### White Spaces Options Parsing V1 +`req~white-spaces-script-options-parsing-v1~1` + +All white spaces between the option key and option value are to be interpreted as separator. +White spaces between the option value and the terminating ";" are to be removed from the option value. + +Needs: dsn + +Tags: V1 + +Covers: +- `feat~general-script-options-parsing~1` + +Depends: + - `req~white-spaces~1` + +### White Spaces Options Parsing V2 +`req~white-spaces-script-options-parsing-v2~1` + +All white spaces between the option key and option value are to be ignored. The following rules for escape sequences at **the start** of a script option value are to be applied: +- '\ ' => space character +- '\t' => character +- '\f' =>
character +- '\v' => character + +White spaces in the middle of the option value and between the option value and the terminating ";" shall be interpreted as part of the value. + +Rationale: The new version of the parser should be as much as possible backwards compatible to V1, because it will simplify migration of existing UDF's. + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~general-script-options-parsing~1` + +Depends: + - `req~white-spaces~1` + +### Multiple Options +`req~multiple-options-management~1` + +The parser must collect multiple Script Options with the same key. + +Comment: +"Collecting" in this context means that a merging strategy must be applied. We don't want multiple options with the same key to result in simply overwriting or discarding values. Please note that the specific handling depends on the individual option handler. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +### Duplicate Options Management +`req~multiple-options-management~1` + +The parser must collect multiple Script Options with the same key and same/different value. Note: the specific handling depends on the option handler. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` + +### Script Option Removal +`req~script-option-removal~1` + +The parser handler must remove found known Script Options from the original script code. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` +- `feat~java-specific-script-options~1` + +### Script Option Unknown Options Behavior V1 +`req~script-option-unknown-options-behvaior-v1~1` + +Unknown script options must remain untouched in the script code. The Java compiler is supposed to throw an error message during the compilation. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` +- `feat~java-specific-script-options~1` + +Tags: V1 + +### Script Option Unknown Options Behavior V2 +`req~script-option-unknown-options-behvaior-v2~1` + +The parser handler must detect unknown script options and throw an exception if such an options is found. + +Needs: dsn + +Covers: +- `feat~general-script-options-parsing~1` +- `feat~java-specific-script-options~1` + +Tags: V2 + +### Escape Sequence Script Options Parsing +`req~escape-sequence-script-options-parsing~1` + +The following rules for escape sequences at any place within a script option value are to be applied: +- '\n' => character +- '\r' => character +- '\;' => ';' character + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~general-script-options-parsing~1` + + +### Java %scriptclass Option Handling V1 +`req~java-scriptclass-option-handling-v1~1` + +The Java parser handler must correctly identify the first `%scriptclass` option and remove only this single instance from the script code. Any further occurrences of `%scriptclass` option shall stay in the source script code. +The value should be handled according to the [Java specification for identifies](https://docs.oracle.com/javase/specs/jls/se7/html/jls-3.html#jls-3.8). + +Needs: dsn + +Tags: V1 + +Covers: +- `feat~java-specific-script-options~1` + + +### Java %scriptclass Option Handling V2 +`req~java-scriptclass-option-handling-v2~1` + +The Java parser handler must correctly identify the first `%scriptclass` option and remove any additional occurrences of this option within the script code. +The value should be handled according to the [Java specification for identifies](https://docs.oracle.com/javase/specs/jls/se7/html/jls-3.html#jls-3.8). + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~java-specific-script-options~1` + +### Java %jar Option Handling Multiple Options +`req~java-jar-option-handling-multiple-options~1` + +The Java parser handler must find multiple `%jar` options. The values are to be interpreted as the Java CLASSPATH: `::...:`. The Java parser handler shall split the entries by the colon character. +Compare [OpenJdk implementation](https://github.com/AdoptOpenJDK/openjdk-jdk11/blob/19fb8f93c59dfd791f62d41f332db9e306bc1422/src/java.base/share/classes/jdk/internal/loader/URLClassPath.java#L174) of parsing the classpath. + +Covers: +- `feat~java-specific-script-options~1` + +### Java %jar Option Handling V1 +`req~java-jar-option-handling-v1~1` + +The Java parser handler shall unify duplicated files and order the result of all `%jar` options alphabetically. + +Needs: dsn + +Tags: V1 + +Covers: +- `feat~java-specific-script-options~1` + +Depends: +- `req~java-jar-option-handling-multiple-options~1` + +### Java %jar Option Handling V2 +`req~java-jar-option-handling-v2~1` + +The Java parser handler must keep duplicates. The order of the entries must not change. + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~java-specific-script-options~1` + +Depends: +- `req~java-jar-option-handling-multiple-options~1` + +### Java %jar Option Trailing White Space Handling +`req~java-jar-option-trailing-white-space-handling~1` + +The Java parser handler must remove trailing white spaces for `%jar` option values if they are not part of the escape sequence '\ '. Escape sequences at the end of a found `%jar` option of the form `\ ` must be replaced with ' '. +This approach provides backwards compatibility for most existing UDF's from customers. + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~java-specific-script-options~1` + +Depends: +- `req~white-spaces-script-options-parsing-v2~1` + +### Java %jvmoption Handling +`req~java-jvmoption-handling~1` + +The Java parser handler must find multiple `%jvmoption` options, allowing duplicates and maintaining order. + +Needs: dsn + +Covers: +- `feat~java-specific-script-options~1` + +### Java %import Option Replace Referenced Sripts +`req~java-import-option-replace-referenced-scripts~1` + +For each found `%import` option, the Java parser handler must request and replace the referenced scripts recursively. This means, if the referenced scripts contain also `%import` options, the implementation must replace those, too. + +Needs: dsn + +Covers: +- `feat~java-specific-script-options~1` + +### Java %import Option Referenced Sripts Name +`req~java-import-option-referenced-scripts-name~1` + +The referenced script name should be handled according to the [Exasol SQL identifier specification](https://docs.exasol.com/db/latest/sql_references/basiclanguageelements.htm#SQLidentifier). + +Needs: dsn + +Covers: +- `feat~java-specific-script-options~1` + +### Java %import Option Handling V1 +`req~java-import-option-handling-v1~1` + +For each found `%import` option, the Java parser handler must handle nested Script Options appropriately: +1. `%scriptclass` option must be ignored. +2. All other options must be handled as if they were part of the source script. +3. Already imported scripts must not be imported again, but the `%import` statement must be removed + +Needs: dsn + +Tags: V1 + +Covers: +- `feat~java-specific-script-options~1` + +### Java %import Option Handling V2 +`req~java-import-option-handling-v2~1` + +For each found `%import` option, the Java parser handler must handle nested Script Options appropriately: +1. `%scriptclass` option must be ignored, but removed from the script code. +2. All other options must be handled as if they were part of the source script. +3. Already imported scripts must not be imported again, but the `%import` statement must be removed + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~java-specific-script-options~1` + +### Existing Parser Library License +`req~existing-parser-library-license~1` + +Developers must be able to use the parser in open source and closed source projects. + +Needs: dsn + +Tags: V2 + +Covers: +- `feat~general-script-options-parsing~1` +