avoid copy when encoding request body #2349

robx · 2022-06-24T09:44:16Z

(Updated after the merge of #2467)

This update hasql and switches to the new lazy json byte encoder, saving one copy of the request body.
After the merge of the libpq change in #2467, this is now essentially what #2333 was.

robx · 2022-06-24T10:00:46Z

memory test diff (< main, >, #2333, + this)

< ok 1 - POST /rpc/leak?columns=blob: with a json key of 1M the memory usage(15,605,584 bytes) is less than 16M
> ok 1 - POST /rpc/leak?columns=blob: with a json key of 1M the memory usage(14,662,232 bytes) is less than 16M
+ ok 1 - POST /rpc/leak?columns=blob: with a json key of 1M the memory usage(11,594,920 bytes) is less than 16M
< ok 2 - POST /leak?columns=blob: with a json key of 1M the memory usage(15,538,336 bytes) is less than 16M
> ok 2 - POST /leak?columns=blob: with a json key of 1M the memory usage(14,541,408 bytes) is less than 16M
+ ok 2 - POST /leak?columns=blob: with a json key of 1M the memory usage(11,509,712 bytes) is less than 16M
< ok 3 - PATCH /leak?id=eq.1&columns=blob: with a json key of 1M the memory usage(15,585,384 bytes) is less than 16M
> ok 3 - PATCH /leak?id=eq.1&columns=blob: with a json key of 1M the memory usage(14,601,496 bytes) is less than 16M
+ ok 3 - PATCH /leak?id=eq.1&columns=blob: with a json key of 1M the memory usage(11,561,088 bytes) is less than 16M
< ok 4 - POST /rpc/leak?columns=blob: with a json key of 10M the memory usage(42,927,272 bytes) is less than 44M
> ok 4 - POST /rpc/leak?columns=blob: with a json key of 10M the memory usage(32,960,632 bytes) is less than 44M
+ ok 4 - POST /rpc/leak?columns=blob: with a json key of 10M the memory usage(20,980,360 bytes) is less than 44M
< ok 5 - POST /leak?columns=blob: with a json key of 10M the memory usage(42,847,232 bytes) is less than 44M
> ok 5 - POST /leak?columns=blob: with a json key of 10M the memory usage(32,893,504 bytes) is less than 44M
+ ok 5 - POST /leak?columns=blob: with a json key of 10M the memory usage(20,897,896 bytes) is less than 44M
< ok 6 - PATCH /leak?id=eq.1&columns=blob: with a json key of 10M the memory usage(42,900,040 bytes) is less than 44M
> ok 6 - PATCH /leak?id=eq.1&columns=blob: with a json key of 10M the memory usage(32,947,056 bytes) is less than 44M
+ ok 6 - PATCH /leak?id=eq.1&columns=blob: with a json key of 10M the memory usage(20,933,576 bytes) is less than 44M
< ok 7 - POST /rpc/leak?columns=blob: with a json key of 50M the memory usage(164,339,792 bytes) is less than 172M
> ok 7 - POST /rpc/leak?columns=blob: with a json key of 50M the memory usage(114,496,360 bytes) is less than 172M
+ ok 7 - POST /rpc/leak?columns=blob: with a json key of 50M the memory usage(62,643,264 bytes) is less than 172M
< ok 8 - POST /leak?columns=blob: with a json key of 50M the memory usage(164,227,224 bytes) is less than 172M
> ok 8 - POST /leak?columns=blob: with a json key of 50M the memory usage(114,421,840 bytes) is less than 172M
+ ok 8 - POST /leak?columns=blob: with a json key of 50M the memory usage(62,568,984 bytes) is less than 172M
< ok 9 - PATCH /leak?id=eq.1&columns=blob: with a json key of 50M the memory usage(164,282,832 bytes) is less than 172M
> ok 9 - PATCH /leak?id=eq.1&columns=blob: with a json key of 50M the memory usage(114,479,848 bytes) is less than 172M
+ ok 9 - PATCH /leak?id=eq.1&columns=blob: with a json key of 50M the memory usage(62,614,928 bytes) is less than 172M
< ok 10 - POST /perf_articles?columns=id,body: with a json payload of 32K that has 1000 array values the memory usage(12,605,744 bytes) is less than 14M
> ok 10 - POST /perf_articles?columns=id,body: with a json payload of 32K that has 1000 array values the memory usage(12,574,616 bytes) is less than 14M
+ ok 10 - POST /perf_articles?columns=id,body: with a json payload of 32K that has 1000 array values the memory usage(10,510,256 bytes) is less than 14M
< ok 11 - POST /perf_articles?columns=id,body: with a json payload of 329K that has 10000 array values the memory usage(13,508,120 bytes) is less than 14M
> ok 11 - POST /perf_articles?columns=id,body: with a json payload of 329K that has 10000 array values the memory usage(13,182,160 bytes) is less than 14M
+ ok 11 - POST /perf_articles?columns=id,body: with a json payload of 329K that has 10000 array values the memory usage(10,820,976 bytes) is less than 14M
< ok 12 - POST /perf_articles?columns=id,body: with a json payload of 3.4M that has 100000 array values the memory usage(22,804,512 bytes) is less than 24M
> ok 12 - POST /perf_articles?columns=id,body: with a json payload of 3.4M that has 100000 array values the memory usage(19,428,296 bytes) is less than 24M
+ ok 12 - POST /perf_articles?columns=id,body: with a json payload of 3.4M that has 100000 array values the memory usage(14,019,096 bytes) is less than 24M

robx · 2022-06-24T10:10:11Z

Once again, the load test is not affected in any obvious way.

steve-chavez · 2022-06-24T17:22:38Z

< ok 7 - POST /rpc/leak?columns=blob: with a json key of 50M the memory usage(164,339,792 bytes) is less than 172M
+ ok 7 - POST /rpc/leak?columns=blob: with a json key of 50M the memory usage(62,643,264 bytes) is less than 172M

Whoa that is awesome, a 61% decrease! It's almost close to the same payload size!

avoids copies for all the other small parameters we pass on (which should affect more than POST requests)

🔥 Will also affect other http methods.

Once again, the load test is not affected in any obvious way.

I think this is because we don't include a bulk insert in the load tests. We only have a single object POST.

postgrest/test/load/targets.http

Lines 1 to 30 in d87b622

    
           GET http://postgrest/ 
        
           Prefer: tx=commit 
        
           HEAD http://postgrest/actors?actor=eq.1 
        
           Prefer: tx=commit 
        
           GET http://postgrest/actors?select=*,roles(*,films(*)) 
        
           Prefer: tx=commit 
        
           POST http://postgrest/films?columns=title 
        
           Prefer: tx=rollback 
        
           @post.json 
        
           PUT http://postgrest/actors?actor=eq.1&columns=name 
        
           Prefer: tx=rollback 
        
           @put.json 
        
           PATCH http://postgrest/actors?actor=eq.1 
        
           Prefer: tx=rollback 
        
           @patch.json 
        
           DELETE http://postgrest/roles 
        
           Prefer: tx=rollback 
        
           GET http://postgrest/rpc/call_me?name=John 
        
           POST http://postgrest/rpc/call_me 
        
           @rpc.json 
        
           OPTIONS http://postgrest/actors

postgrest/test/load/post.json

Lines 1 to 3 in d87b622

    
           { 
        
             "title": "Workers Leaving The Lumière Factory In Lyon" 
        
           }

If we include a bulk insert, this change should show improved numbers.

robx · 2022-06-29T18:02:29Z

Whoa that is awesome, a 61% decrease! It's almost close to the same payload size!

Being close the payload size almost seems a bit too good to be true... By my understanding we should at least be strictly reading the body to a lazy bytestring, and then copying it to a strict bytestring. But that should mean we need at least 2x the payload size, right?

robx · 2022-06-30T10:29:48Z

I pushed an updated version with a milder dependency footprint. Memory results seem similar.

In this version:

switch from text unknown to json binary encoding for the request body (this means we don't have to deal with zero-termination of the CString)
use a jsonBytesLazy encoder variant (this is a hasql change, analogue to experiment / request for feedback: lazy bytestring variant of 'unknown' nikita-volkov/hasql#146)
modify postgresql-libpq to avoid copying only for binary parameters (Avoid copies when passing binary parameters haskellari/postgresql-libpq#22)

steve-chavez · 2022-07-01T03:36:12Z

switch from text unknown to json binary encoding for the request body (this means we don't have to deal with zero-termination of the CString)
use a jsonBytesLazy encoder variant

Just for my own sanity, the json function doesn't use Aeson at all right?
(Aeson was horrible for memory usage in previous benchmarks)

jsonBytesLazy -> PTI.json
jsonBytesLazy -> bytea_lazy -> lazyBytes.

Nope, it uses LibPQ.Binary. Nice!

steve-chavez · 2022-07-06T18:53:25Z

Once again, the load test is not affected in any obvious way.
If we include a bulk insert, this change should show improved numbers.

Just added a bulk insert loadtest on #2358, if you rebase then I think we should see a change in the numbers.

steve-chavez · 2022-07-19T19:14:09Z

Just added a bulk insert loadtest on #2358, if you rebase then I think we should see a change in the numbers.

Hm, even with the bulk insert addition seems we're not getting better results, not sure if this is true or if our loadtest setup is at fault.

One idea for later could be hosting an instance on AWS(like we do for our aarch64 builds) and use NixOps to deploy the built static binary and then run a more realistic load test there.

NixOps has been getting some improvements(like using s3 instead of sqlite for its state) and seems 2.0 is becoming more usable nowadays(ref).

wolfgangwalther · 2022-07-20T09:43:05Z

Just added a bulk insert loadtest on #2358, if you rebase then I think we should see a change in the numbers.

Hm, even with the bulk insert addition seems we're not getting better results, not sure if this is true or if our loadtest setup is at fault.

I don't think the loadtest is "at fault". The loadtest is just not designed for this. The loadtest is designed for a single request at a time, not for concurrent access. The loadtest is not constrained on memory either. And the loadtest data is still not analyzed separately by request.

Improvements in memory usage are just not expected to improve the loadtest's results.

A full-system load test, including many concurrent requests with bigger request bodies at the same time could show some improvements with this change, so running something like that automatically could indeed help a lot.

- switch from "unknown" parameter in text format to a "json" parameter in binary format (no dependency update required) - use a lazy bytestring "json" encoder (via updated hasql)

steve-chavez

Excited to try this one 🎉

robx force-pushed the optimize-unsafe branch from 2bb7b96 to e510a1c Compare June 30, 2022 10:13

robx mentioned this pull request Jun 30, 2022

Avoid copies when passing binary parameters haskellari/postgresql-libpq#22

Closed

robx mentioned this pull request Jun 30, 2022

perf: Avoid copying the request body more than once #2261

Open

steve-chavez mentioned this pull request Jul 5, 2022

test: add bulk insert to loadtest #2358

Merged

robx force-pushed the optimize-unsafe branch from e510a1c to 8e29a43 Compare July 8, 2022 08:45

robx mentioned this pull request Jul 8, 2022

stack: update stackage release, use GHC 9.0 #2361

Merged

robx force-pushed the optimize-unsafe branch 2 times, most recently from fba5b1a to 9f01aa4 Compare July 11, 2022 16:29

This was referenced Aug 10, 2022

experiment: use a lazy bytestring hasql 'unknown' #2333

Closed

RFC: allow forked dependencies, drop hackage release #2422

Closed

This was referenced Sep 5, 2022

Avoid copies when passing binary parameters PostgREST/postgresql-libpq#1

Merged

postgresql-libpq: use PostgREST fork with reduced copies #2467

Merged

robx force-pushed the optimize-unsafe branch from 9f01aa4 to b96b2da Compare September 6, 2022 15:07

robx changed the title ~~experiment: avoid copies when passing parameters to libpq~~ experiment: avoid copies when encoding request body Sep 6, 2022

robx force-pushed the optimize-unsafe branch from b96b2da to b78c6e2 Compare September 8, 2022 10:54

robx changed the title ~~experiment: avoid copies when encoding request body~~ avoid copy when encoding request body Sep 8, 2022

robx force-pushed the optimize-unsafe branch from b487e08 to 7b9a905 Compare September 19, 2022 10:04

robx marked this pull request as ready for review September 19, 2022 10:13

robx added 2 commits September 19, 2022 13:13

nix: add update-nix-fetchgit to shell

d3c8aaf

encode json body in binary, using modified hasql dep

8316104

- switch from "unknown" parameter in text format to a "json" parameter in binary format (no dependency update required) - use a lazy bytestring "json" encoder (via updated hasql)

robx force-pushed the optimize-unsafe branch from 7b9a905 to 8316104 Compare September 19, 2022 11:13

robx mentioned this pull request Sep 19, 2022

Upgrade nixpkgs #2479

Merged

steve-chavez approved these changes Sep 19, 2022

View reviewed changes

robx merged commit 38ecaa4 into PostgREST:main Sep 19, 2022

robx deleted the optimize-unsafe branch September 19, 2022 19:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid copy when encoding request body #2349

avoid copy when encoding request body #2349

robx commented Jun 24, 2022 •

edited

Loading

robx commented Jun 24, 2022

robx commented Jun 24, 2022

steve-chavez commented Jun 24, 2022 •

edited

Loading

robx commented Jun 29, 2022

robx commented Jun 30, 2022

steve-chavez commented Jul 1, 2022

steve-chavez commented Jul 6, 2022

steve-chavez commented Jul 19, 2022

wolfgangwalther commented Jul 20, 2022

steve-chavez left a comment

avoid copy when encoding request body #2349

avoid copy when encoding request body #2349

Conversation

robx commented Jun 24, 2022 • edited Loading

robx commented Jun 24, 2022

robx commented Jun 24, 2022

steve-chavez commented Jun 24, 2022 • edited Loading

robx commented Jun 29, 2022

robx commented Jun 30, 2022

steve-chavez commented Jul 1, 2022

steve-chavez commented Jul 6, 2022

steve-chavez commented Jul 19, 2022

wolfgangwalther commented Jul 20, 2022

steve-chavez left a comment

Choose a reason for hiding this comment

robx commented Jun 24, 2022 •

edited

Loading

steve-chavez commented Jun 24, 2022 •

edited

Loading