Evaluate assert/discriminator expressions after groupContent: WIP #987

jadams-tresys · 2023-03-13T19:23:54Z

According to 9.5 of the DFDL spec assert and discriminators with expressions should be processed after the content of their enclosing sequence, group, or choice. Before these expressions were always being processed before the content.

This commit also moves the setVariable expression evaluaiton to the correct place, which is before the enclosing group.

DAFFODIL-1971, DAFFODIL-1590

jadams-tresys · 2023-03-13T19:25:02Z

This is primarily WIP as it needs more test coverage and I have been informed that I need to hold off on working on this for a little while.

It is fully functional, just needs more tests for edge cases surrounding evaluating expressions after the content of sequences, groups, and choices.

Also need to add tests for the setVariable change.

According to 9.5 of the DFDL spec assert and discriminators with expressions should be processed after the content of their enclosing sequence, group, or choice. Before these expressions were always being processed before the content. This commit also moves the setVariable expression evaluaiton to the correct place, which is before the enclosing group. DAFFODIL-1971, DAFFODIL-1590

Tests for the following: - Sequence body succeeds but discriminator fails - Sequence body fails and discriminator fails - Sequence body fails but discriminator succeeds and references an element in the partial sequence body infoset - Sequence body fails and discriminator fails and references an element in the partial sequence body infoset

stevedlawrence

I think the same assert could be evaluated multiple times?

Also, have you run this against the regression suite? I wonder if there are any schemas that rely on the current behavior and could mess up how they use discriminators?

stevedlawrence · 2025-01-06T15:08:34Z

daffodil-core/src/main/scala/org/apache/daffodil/core/dsom/DFDLStatementMixin.scala

@@ -217,7 +217,7 @@ trait ProvidesDFDLStatementMixin extends ThrowsSDE with HasTermCheck {
  final lazy val patternStatements: Seq[DFDLStatement] = patternAsserts ++ patternDiscrims

  final lazy val lowPriorityStatements: Seq[DFDLStatement] =


Any idea where "low priority" comes from? I'm not familiar with that as a DFDL term. Wondering if something like "expressionStatements" would be more clear?

stevedlawrence · 2025-01-06T15:13:19Z

daffodil-core/src/main/scala/org/apache/daffodil/core/grammar/Grammar.scala

+      new SeqCompParser(
+        context.runtimeData,
+        parserChildren.toVector,
+        assertExpressionChildren.toVector,


I believe the paserChildren vector contains the assertExpressionChildren parsers, which I think means the assert parsers will be evaluated twice? Once when all the parserChildren are evaluated and again after the parserChildren finis and then we evaluate the asserts Parsers? Seems like parserChildren should not contain the assert children?

stevedlawrence · 2025-01-06T15:20:09Z

daffodil-runtime1/src/main/scala/org/apache/daffodil/runtime1/processors/parsers/Parser.scala

+          testAssert.foreach { ap =>
+            pstate.withTempSuccess(ap.parse1)
+          }
+        }


Ah, I guess I now see why it's okay for childrenParsers to also include the assertion parsers. If all the non-assert parsers succeed then we'll don't evaluate the assertion parsers.

However, I think there is still an issue. Say all the non-assertion parsers succeed, then we start evaluating the assertion parsers, and assume one of them fails. The pstate.processorStatus ne Success will be true, and then we'll evaluate all the assertion parsers, including ones we've already evaluated.

So I think this approach still feels like it needs a tweak. Almost feels like the assert parsers need to be completely separate from the other parsers or something.

stevedlawrence · 2025-01-06T15:29:28Z

daffodil-runtime1/src/main/scala/org/apache/daffodil/runtime1/processors/parsers/PState.scala

+    setSuccess()
+    func(this)
+    if (processorStatus eq Success)
+      _processorStatus = priorProcessorStatus


Conditionally resetting back to the previous status feels like it might be confusing? Feels like if this is temporarily ignoring status it should always reset back to whatever the status was before. Maybe the way this wants to work is it it returns the status of func, and the caller can choose to do with it whatever they want? Something more like

def withTempSuccess(func: (PState) => Unit): ProcessorResult = { val priorStatus = processorStatus setSuccess() func(this) val funcStatus = processorStatus _processorStatus = priorStatus funcStatus }

Also, Is there any value in passing in a lambda? An alternative would be to just pass in a Parser, and then instead of calling func(this), it could call parser.parse1(this). That ensures that this is always called with a Parser instead of a function that just happens to accept a PState, which is probably safer?

stevedlawrence · 2025-01-06T15:37:49Z

daffodil-runtime1/src/main/scala/org/apache/daffodil/runtime1/processors/parsers/Parser.scala

-      if (pstate.processorStatus ne Success)
+      if (pstate.processorStatus ne Success) {
+        if (testAssert.nonEmpty) {
+          testAssert.foreach { ap =>


Just had another thought about this, this will evaluate all assertions even if one fails. Is that the correct behavior, or should it stop after the first assertion failure? Seems like if one assertion fails it shouldn't continue to evaluate other assertions?

stevedlawrence · 2025-01-06T15:47:33Z

daffodil-core/src/main/scala/org/apache/daffodil/core/grammar/Grammar.scala

@@ -89,10 +90,19 @@ class SeqComp private (context: SchemaComponent, children: Seq[Gram])
      _.isInstanceOf[NadaParser]
    }

+  lazy val assertExpressionChildren = parserChildren.filter {
+    _.isInstanceOf[AssertExpressionEvaluationParser]


Reading the spec some more, it says:

an attempt to evaluate a discriminator MUST be made even if preceding statements or the parse of the schema component ended in a Processing Error.

I think this implies asserts should not be evaluated if the preceding statements fails? Which kindof makes sense since an assert only causes backtracking and does not discriminate points of uncertainty?

So it feels like the logic needs to differentiate between asserts and discriminators? Both should be evaluated at the end, but only discriminators should be evaluated if the prior parses succeed?

Another question, are discriminators always evaluated before asserts? Or are they evaluated depending on the order defined in the DFDL schema? For example, if you have:

<xs:appinfo source="http://www.ogf.org/dfdl/"> <dfdl:assert test="..." /> <dfdl:discriminator test="..." /> <dfdl:assert test="..." /> </xs:appinfo>

Should that evaluate the first assert, then the discriminiator, then second assert. I don't know if the spec clarifies that or if it's implied, but I think we always evaluate asserts before discrims regardless of how they are defined in the schema?

stevedlawrence marked this pull request as draft September 18, 2024 11:27

Josh Adams added 3 commits January 3, 2025 10:28

Fix formatting

9f0b750

jadams-tresys force-pushed the DAFFODIL-1971 branch from 94fe3ae to ef3ec79 Compare January 6, 2025 14:51

jadams-tresys mentioned this pull request Jan 6, 2025

Reorder DFDL statement processing to match spec #1369

Open

jadams-tresys requested review from mbeckerle and stevedlawrence January 6, 2025 14:55

stevedlawrence requested changes Jan 6, 2025

View reviewed changes

stevedlawrence reviewed Jan 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluate assert/discriminator expressions after groupContent: WIP #987

Evaluate assert/discriminator expressions after groupContent: WIP #987

jadams-tresys commented Mar 13, 2023

jadams-tresys commented Mar 13, 2023 •

edited

Loading

stevedlawrence left a comment

stevedlawrence Jan 6, 2025

stevedlawrence Jan 6, 2025

stevedlawrence Jan 6, 2025

stevedlawrence Jan 6, 2025

stevedlawrence Jan 6, 2025

stevedlawrence Jan 6, 2025

		@@ -217,7 +217,7 @@ trait ProvidesDFDLStatementMixin extends ThrowsSDE with HasTermCheck {
		final lazy val patternStatements: Seq[DFDLStatement] = patternAsserts ++ patternDiscrims

		final lazy val lowPriorityStatements: Seq[DFDLStatement] =

Evaluate assert/discriminator expressions after groupContent: WIP #987

Are you sure you want to change the base?

Evaluate assert/discriminator expressions after groupContent: WIP #987

Conversation

jadams-tresys commented Mar 13, 2023

jadams-tresys commented Mar 13, 2023 • edited Loading

stevedlawrence left a comment

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

stevedlawrence Jan 6, 2025

Choose a reason for hiding this comment

jadams-tresys commented Mar 13, 2023 •

edited

Loading