Skip to content

Latest commit

 

History

History
175 lines (125 loc) · 4.57 KB

catalog-postgresql.md

File metadata and controls

175 lines (125 loc) · 4.57 KB
title slug keyword license
Gravitino connector - PostgreSQL catalog
/trino-connector/catalog-postgresql
gravitino connector trino
Copyright 2023 Datastrato Pvt Ltd. This software is licensed under the Apache License version 2.

The PostgreSQL catalog allows querying and creating tables in an external PostgreSQL database. This can be used to join data between different systems like PostgreSQL and Hive, or between different PostgreSQL instances.

Requirements

To connect to PostgreSQL, you need:

  • PostgreSQL 10.x or higher.
  • Network access from the Trino coordinator and workers to PostgreSQL. Port 5432 is the default port.

Create table

At present, the Gravitino connector only supports basic PostgreSQL table creation statements, which involve fields, null allowances, and comments. However, it does not support advanced features like primary keys, indexes, default values, and auto-increment.

The Gravitino connector does not support CREATE TABLE AS SELECT.

Alter table

Gravitino connector supports the following alter table operations:

  • Rename table
  • Add a column
  • Drop a column
  • Rename a column
  • Change a column type
  • Set a table property

Select

The Gravitino connector supports most SELECT statements, allowing the execution of queries successfully. Currently, it doesn't support certain query optimizations, such as indexing and pushdowns.

Table and Schema properties

PostgreSQL's tables and schemas cannot support properties.

Basic usage examples

You need to do the following steps before you can use the PostgreSQL catalog in Trino through Gravitino.

  • Create a metalake and catalog in Gravitino. Assuming that the metalake name is test and the catalog name is postgresql_test, then you can use the following code to create them in Gravitino:
curl -X POST -H "Content-Type: application/json" \
-d '{
  "name": "test",
  "comment": "comment",
  "properties": {}
}' http://gravitino-host:8090/api/metalakes

curl -X POST -H "Content-Type: application/json" \
-d '{
  "name": "postgresql_test",
  "type": "RELATIONAL",
  "comment": "comment",
  "provider": "jdbc-postgresql",
  "properties": {
    "jdbc-url": "jdbc:postgresql://postgresql-host/mydb",
    "jdbc-user": "<user>",
    "jdbc-password": "<password>",
    "jdbc-database": "mydb",
    "jdbc-driver": "org.postgresql.Driver"
  }
}' http://gravitino-host:8090/api/metalakes/test/catalogs

For more information about the PostgreSQL catalog, please refer to PostgreSQL catalog.

  • Set the value of configuration gravitino.metalake to the metalake you have created, named 'test', and start the Trino container.

Use the Trino CLI to connect to the Trino container and run a query.

Listing all Gravitino managed catalogs:

SHOW CATALOGS;

The results are similar to:

    Catalog
----------------
 gravitino
 jmx
 system
 test.postgresql_test
(4 rows)

Query 20231017_082503_00018_6nt3n, FINISHED, 1 node

The gravitino catalog is a catalog defined By Trino catalog configuration. The test.postgresql_test catalog is the catalog created by you in Gravitino. Other catalogs are regular user-configured Trino catalogs.

Creating tables and schemas

Create a new schema named database_01 in test.postgresql_test catalog.

CREATE SCHEMA "test.postgresql_test".database_01;

Create a new table named table_01 in schema "test.postgresql_test".database_01.

CREATE TABLE "test.postgresql_test".database_01.table_01
(
name varchar,
salary int
);

Writing data

Insert data into the table table_01:

INSERT INTO "test.postgresql_test".database_01.table_01 (name, salary) VALUES ('ice', 12);

Insert data into the table table_01 from select:

INSERT INTO "test.postgresql_test".database_01.table_01 (name, salary) SELECT * FROM "test.postgresql_test".database_01.table_01;

Querying data

Query the table_01 table:

SELECT * FROM "test.postgresql_test".database_01.table_01;

Modify a table

Add a new column age to the table_01 table:

ALTER TABLE "test.postgresql_test".database_01.table_01 ADD COLUMN age int;

Drop a column age from the table_01 table:

ALTER TABLE "test.postgresql_test".database_01.table_01 DROP COLUMN age;

Rename the table_01 table to table_02:

ALTER TABLE "test.postgresql_test".database_01.table_01 RENAME TO "test.postgresql_test".database_01.table_02;

Drop

Drop a schema:

DROP SCHEMA "test.postgresql_test".database_01;

Drop a table:

DROP TABLE "test.postgresql_test".database_01.table_01;