Parser microoptimizations #2246

kyri-petrou · 2024-05-23T15:36:44Z

I was implementing a parser using fastparse today at $WORK and found out that CharsWhileIn("xx", minChars) is much more efficient than using .repX. With these changes we get ~5-10% performance increase in our ParserBenchmark.

Note that I also changed the fullIntrospectionQuery val to strip the margins of the query. I realised that we were benchmarking too heavily the whitespace parser

ghostdogpr · 2024-05-24T00:08:13Z

core/src/main/scala/caliban/parsing/parsers/Parsers.scala

@@ -348,7 +348,7 @@ object Parsers extends SelectionParsers {
    schemaExtension | typeExtension

  def definition(implicit ev: P[Any]): P[Definition] =
-    executableDefinition | typeSystemDefinition | typeSystemExtension
+    typeSystemDefinition | typeSystemExtension


It doesn't match the spec definition (keeping the code close to the spec is good for maintainability), how about changing document instead?

I reverted the change. tbh it only makes a difference for invalid queries really

But how about changing document to (Start ~ definition.rep ~ End)? I agree with the fix, just wanted to do it somewhere else 😄

document is currently this:

((Start ~ executableDefinition.rep ~ End) | (Start ~ definition.rep ~ End)).map(seq => ParsedDocument(seq.toList))

The reason for having Start ~ executableDefinition.rep ~ End separately is so that when when we reach the end of the file after executableDefinition has successfully parsed (which is the most common thing we parse), we shortcut to the end and don't try typeSystemDefinition prior to exiting. I'm not sure if it's even possible, but in the case that there is a typeSystem definition in the query then we'll fallback to the full parser.

I actually realised that the "fix" I did earlier wouldn't work for cases that there is a mix of them; although I don't know if that's a valid query 🤔. If yes I should add a test for it

Ah, didn't know we were trying to be clever 🤣

benchmarks/src/main/scala/caliban/ParserBenchmark.scala

kyri-petrou added 3 commits May 23, 2024 17:21

Parser micro-optimizations

c86d2aa

Use flatMap instead of positive-lookahead

2d24ffb

Add back nowarn annotations

9a0bea5

ghostdogpr reviewed May 24, 2024

View reviewed changes

Revert change to definition

33a6512

plokhotnyuk reviewed May 24, 2024

View reviewed changes

benchmarks/src/main/scala/caliban/ParserBenchmark.scala Outdated Show resolved Hide resolved

Remove blackhole usage from parser benchmarks

fbc3536

ghostdogpr approved these changes May 24, 2024

View reviewed changes

kyri-petrou merged commit c849e0e into series/2.x May 24, 2024
11 checks passed

kyri-petrou deleted the parser-microoptimizations branch May 24, 2024 09:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser microoptimizations #2246

Parser microoptimizations #2246

kyri-petrou commented May 23, 2024

ghostdogpr May 24, 2024 •

edited

Loading

kyri-petrou May 24, 2024

ghostdogpr May 24, 2024 •

edited

Loading

kyri-petrou May 24, 2024

ghostdogpr May 24, 2024

Parser microoptimizations #2246

Parser microoptimizations #2246

Conversation

kyri-petrou commented May 23, 2024

ghostdogpr May 24, 2024 • edited Loading

Choose a reason for hiding this comment

kyri-petrou May 24, 2024

Choose a reason for hiding this comment

ghostdogpr May 24, 2024 • edited Loading

Choose a reason for hiding this comment

kyri-petrou May 24, 2024

Choose a reason for hiding this comment

ghostdogpr May 24, 2024

Choose a reason for hiding this comment

ghostdogpr May 24, 2024 •

edited

Loading

ghostdogpr May 24, 2024 •

edited

Loading