Name		Name	Last commit message	Last commit date
parent directory ..
config		config
ddl		ddl
dml		dml
driver		driver
queries		queries
scratch		scratch
scripts		scripts
src		src
.env		.env
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
pom.xml		pom.xml

README.md

LDBC SNB Interactive PostgreSQL implementation

PostgreSQL implementation of the LDBC Social Network Benchmark's Interactive workload.

Setup

The recommended environment is that the benchmark scripts (Bash) and the LDBC driver (Java 8) run on the host machine, while the PostgreSQL database runs in a Docker container. Therefore, the requirements are as follows:

Bash
Java 8
Docker 19+
libpq5
the psycopg Python library: scripts/install-dependencies.sh
enough free space in the directory ${POSTGRES_DATA_DIR} (its default value is specified in scripts/vars.sh)

docker-compose

Alternatively, a docker-compose specification is available to start the PostgreSQL container and a container loading the data. This requires docker-compose installed on the host machine. Running PostgreSQL and loading the data can be done by executing:

docker-compose build && docker-compose up

The default environment variables are loaded from .env. Change the POSTGRES_CSV_DIR to point to the data set, e.g.

POSTGRES_CSV_DIR=`pwd`/social-network-sf0.003-bi-composite-merged-fk/

To persist the data by storing the database outside a Docker volume, uncomment the following lines in the docker-compose.yml file:

- type: bind
  source: ${POSTGRES_DATA_DIR}
  target: /var/lib/postgresql/data

Generating and loading the data set

Generating the data set

The PostgreSQL implementation uses the composite-merged-fk CSV layout, with headers and without quoted fields. To generate data that confirms this requirement, run Datagen without any layout or formatting arguments (--explode-* or --format-options).

In Datagen's directory (ldbc_snb_datagen_spark), issue the following commands. We assume that the Datagen project is built and sbt is available.

export SF=desired_scale_factor
export LDBC_SNB_DATAGEN_MAX_MEM=available_memory
export LDBC_SNB_DATAGEN_JAR=$(sbt -batch -error 'print assembly / assemblyOutputPath')

rm -rf out-sf${SF}/graphs/parquet/raw
tools/run.py \
    --cores $(nproc) \
    --memory ${LDBC_SNB_DATAGEN_MAX_MEM} \
    -- \
    --format csv \
    --scale-factor ${SF} \
    --mode bi \
    --output-dir out-sf${SF} \
    --format-options compression=gzip

Configuration

The default configuration of the database (e.g. database name, user, password) is set in the scripts/vars.sh file.

Loading the data set

Set the ${POSTGRES_CSV_DIR} environment variable.
- To use a locally generated data set, set the ${LDBC_SNB_DATAGEN_DIR} and ${SF} environment variables and run:
```
export POSTGRES_CSV_DIR=${LDBC_SNB_DATAGEN_DIR}/out-sf${SF}/graphs/csv/bi/composite-merged-fk/
```
  Or, simply run:
```
. scripts/use-datagen-data-set.sh
```
- To download and use the sample data set, run:
```
scripts/get-sample-data-set.sh
. scripts/use-sample-data-set.sh
```
To start the DBMS, create a database and load the data, run:
```
scripts/load-in-one-step.sh
```

Running the benchmark

To run the scripts of benchmark framework, edit the driver/{create-validation-parameters,validate,benchmark}.properties files, then run their script, one of:
```
driver/create-validation-parameters.sh
driver/validate.sh
driver/benchmark.sh
```

⚠️ The default workload contains updates which change the state of the database. Therefore, the database needs to be reloaded or restored from backup before each run. Use the provided scripts/backup-database.sh and scripts/restore-database.sh scripts to achieve this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

postgres

postgres

README.md

LDBC SNB Interactive PostgreSQL implementation

Setup

docker-compose

Generating and loading the data set

Generating the data set

Configuration

Loading the data set

Running the benchmark

Files

postgres

Directory actions

More options

Directory actions

More options

Latest commit

History

postgres

Folders and files

parent directory

README.md

LDBC SNB Interactive PostgreSQL implementation

Setup

docker-compose

Generating and loading the data set

Generating the data set

Configuration

Loading the data set

Running the benchmark