Nesting/unnesting many/catListTable crashes? #168

tomjaguarpaw · 2022-02-18T08:13:33Z

Consider the following Rel8 program. It produces SQL which crashes.

Query Error: error: could not identify column "f1" in record data type

Is this known/expected?

The ultimate problem is that .f1, .f2, etc. for accessing fields of ROWs don't really work "through" SELECTs.

*Rel8 Data.Int Prelude> putStr $ showQuery $ do { q1 <- many (many (values [1 , 2 :: Expr Int16])); q2 <- catListTable q1; catListTable q2 }
SELECT
CAST("unnest0_9" AS int2) as "anon"
FROM (SELECT
      UNNEST("unnest0_7") as "unnest0_9",
      *
      FROM (SELECT
            (UNNEST(CASE WHEN ("rebind0_5") IS NULL THEN CAST(ARRAY[] AS record[]) ELSE "result0_4" END)).f1 as "unnest0_7",
            *
            FROM (SELECT *
                  FROM
                  (SELECT
                   0) as "T1"
                  LEFT OUTER JOIN
                  (SELECT
                   TRUE as "rebind0_5",
                   *
                   FROM (SELECT
                         *
                         FROM (SELECT
                               ARRAY_AGG("inner0_4") as "result0_4"
                               FROM (SELECT
                                     ROW(CASE WHEN ("rebind0_3") IS NULL THEN CAST(ARRAY[] AS int2[]) ELSE "result0_2" END) as "inner0_4",
                                     *
                                     FROM (SELECT *
                                           FROM
                                           (SELECT
                                            0) as "T1"
                                           LEFT OUTER JOIN
                                           (SELECT
                                            TRUE as "rebind0_3",
                                            *
                                            FROM (SELECT
                                                  *
                                                  FROM (SELECT
                                                        ARRAY_AGG("inner0_2") as "result0_2"
                                                        FROM (SELECT
                                                              "values0_1" as "inner0_2",
                                                              *
                                                              FROM (SELECT
                                                                    *
                                                                    FROM (SELECT "column1" as "values0_1"
                                                                          FROM
                                                                          (VALUES
                                                                           (CAST(1 AS int2)),
                                                                           (CAST(2 AS int2))) as "V") as "T1") as "T1") as "T1"
                                                        GROUP BY COALESCE(0)) as "T1") as "T1") as "T2"
                                           ON
                                           TRUE) as "T1") as "T1"
                               GROUP BY COALESCE(0)) as "T1") as "T1") as "T2"
                  ON
                  TRUE) as "T1") as "T1") as "T1"

The text was updated successfully, but these errors were encountered:

ocharles · 2022-02-18T09:41:46Z

What version of PostgreSQL are you on? We somewhat know about this, and afaik it works on newer PostgreSQL versions. On older ones, the fix is to use castTable with many. Can you share the Haskell that produced this crashing query?

tomjaguarpaw · 2022-02-18T09:59:51Z

It fails in every version of Postgres on DB Fiddle. In fact it seems that .f1 syntax for extracting fields of anonymous rows was first supported in v13, yet v13 doesn't support this particular usage (which is a flaw of Postgres I think).

The Haskell is in my post above:

do { q1 <- many (many (values [1 , 2 :: Expr Int16])); q2 <- catListTable q1; catListTable q2 }

ocharles · 2022-02-18T10:03:20Z

Thanks, I missed that this was in GHCI. Can you try changing some many x to many <$> x? I'll have a play soon myself

tomjaguarpaw · 2022-02-18T10:03:53Z

Here is a full program that demonstrates the problem (requiring the hasql and tmp-postgres packages). castTable doesn't seem to help, but I'm not sure I'm using it right.

import Rel8
import Data.Int
import Hasql.Statement
import Hasql.Session
import Hasql.Connection
import Database.Postgres.Temp
import Data.Text (Text)

main = Database.Postgres.Temp.with $ \db -> do
  Right conn <- acquire (toConnectionString db)

  flip run conn $ statement () $ select $ do
    q1 <- castTable <$> many (castTable <$> many (values [1 , 2 :: Expr Int16]))
    q2 <- catListTable q1
    catListTable q2

tomjaguarpaw · 2022-02-18T10:04:26Z

Can you try changing some many x to many <$> x?

That doesn't seem to type check.

ocharles · 2022-02-18T11:37:02Z

Sorry, I meant many x to many $ castTable <$> x

tomjaguarpaw · 2022-02-18T11:42:35Z

I made that change, but it still crashes with the same error:

module Main where

import Rel8
import Data.Int
import Hasql.Statement
import Hasql.Session
import Hasql.Connection
import Database.Postgres.Temp

main = Database.Postgres.Temp.with $ \db -> do
  Right conn <- acquire (toConnectionString db)

  flip run conn $ statement () $ select $ do
    q1 <- many (castTable <$> (many (castTable <$> (values [1 , 2 :: Expr Int16]))))
    q2 <- catListTable q1
    catListTable q2

ilyakooo0 · 2023-04-09T11:25:41Z

Is this possibly related to #219?

This is one possible "fix" to #168. With this `catListTable` arbitrarily deep trees of `ListTable`s. It comes at a relatively high cost, however. Currently we represent nested arrays with anonymous records. This works reasonably well, except that we can't extract the field from the anonymous record when we need it (PostgreSQL [theoretically](https://www.postgresql.org/docs/13/release-13.html#id-1.11.6.16.5.6) suports `.f1` syntax since PG13 but it only works in very limited situations). But it does mean we can decode the results using Hasql's binary decoders, and ordering works how we expect ('array[row(array[9])] < array[row(array[10])]'. What this PR does is instead represent nested arrays as text. To be able to decoder this, we need each 'DBType' to supply a text parser in addition to a binary decoder. It also means that ordering is no longer intuitive, because `array[array[9]::text] > array[array[10]::text]`. However, it does mean we can nest `catListTable`s to our heart's content and it will always just work.

This is another possible "fix" to #168 (as opposed to #242). It doesn't really fix the problem, but it allows us to use two levels of `catListTable` instead of only one. Instead of trying to use Postgres's broken `.f1` syntax, we cast the anonymous record to text, remove the parentheses and quotes and unescape any escaped quotes or backslashes, and then cast the resulting text back to the appropriate type. The reason this only works one level deep is that if the type we cast the text back to is itself an anonymous record, then PostgreSQL doesn't know how to parse the text. It's kind of ugly and hacky but it does work and otherwise maintains the status quo. Comparison operators on nested lists continue to work as before and we don't need to burden `DBType` with parsing nonsense.

This is one possible "fix" to #168. With this `catListTable` arbitrarily deep trees of `ListTable`s. It comes at a relatively high cost, however. Currently we represent nested arrays with anonymous records. This works reasonably well, except that we can't extract the field from the anonymous record when we need it (PostgreSQL [theoretically](https://www.postgresql.org/docs/13/release-13.html#id-1.11.6.16.5.6) suports `.f1` syntax since PG13 but it only works in very limited situations). But it does mean we can decode the results using Hasql's binary decoders, and ordering works how we expect ('array[row(array[9])] < array[row(array[10])]'. What this PR does is instead represent nested arrays as text. To be able to decoder this, we need each 'DBType' to supply a text parser in addition to a binary decoder. It also means that ordering is no longer intuitive, because `array[array[9]::text] > array[array[10]::text]`. However, it does mean we can nest `catListTable`s to our heart's content and it will always just work.

This is one possible "fix" to #168. With this we can `catListTable` arbitrarily deep trees of `ListTable`s. It comes at a relatively high cost, however. Currently we represent nested arrays with anonymous records. This works reasonably well, except that we can't extract the field from the anonymous record when we need it (PostgreSQL [theoretically](https://www.postgresql.org/docs/13/release-13.html#id-1.11.6.16.5.6) suports `.f1` syntax since PG13 but it only works in very limited situations). But it does mean we can decode the results using Hasql's binary decoders, and ordering works how we expect ('array[row(array[9])] < array[row(array[10])]'. What this PR does is instead represent nested arrays as text. To be able to decode this, we need each 'DBType' to supply a text parser in addition to a binary decoder. It also means that ordering is no longer intuitive, because `array[array[9]::text] > array[array[10]::text]`. However, it does mean we can nest `catListTable`s to our heart's content and it will always just work.

…242) This is one possible "fix" to #168. With this we can `catListTable` arbitrarily deep trees of `ListTable`s. It comes at a relatively high cost, however. Currently we represent nested arrays with anonymous records. This works reasonably well, except that we can't extract the field from the anonymous record when we need it (PostgreSQL [theoretically](https://www.postgresql.org/docs/13/release-13.html#id-1.11.6.16.5.6) suports `.f1` syntax since PG13 but it only works in very limited situations). But it does mean we can decode the results using Hasql's binary decoders, and ordering works how we expect ('array[row(array[9])] < array[row(array[10])]'. What this PR does is instead represent nested arrays as text. To be able to decode this, we need each 'DBType' to supply a text parser in addition to a binary decoder. It also means that ordering is no longer intuitive, because `array[array[9]::text] > array[array[10]::text]`. However, it does mean we can nest `catListTable`s to our heart's content and it will always just work.

shane-circuithub changed the title ~~Nesting/unnesting many/catMaybeTables crashes?~~ Nesting/unnesting many/catListTable crashes? Jun 18, 2023

shane-circuithub mentioned this issue Jun 18, 2023

Support nested catListTable (by representing nested arrays as text) #242

Merged

shane-circuithub mentioned this issue Jun 18, 2023

Support "rank 2" catListTable (by "parsing" anonymous record) #243

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nesting/unnesting many/catListTable crashes? #168

Nesting/unnesting many/catListTable crashes? #168

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ilyakooo0 commented Apr 9, 2023

Nesting/unnesting many/catListTable crashes? #168

Nesting/unnesting many/catListTable crashes? #168

Comments

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ocharles commented Feb 18, 2022

tomjaguarpaw commented Feb 18, 2022

ilyakooo0 commented Apr 9, 2023