Geospatial data importers for cloud native data platforms

This repository contains Python scripts for importing geospatial data in cloud native data platforms.

Redshift importer (geo2rs.py)

This script makes use of the following packages:

AWS SDK for Python(Boto3) for interacting with AWS services (S3 and Redshift)
Fiona for reading geospatial data files
Shapely for writing EWKB geometries

The main steps are:

Transform the input file to a CSV file with EWKB geometries
Upload the CSV file to a S3 bucket
Load the data in the CSV file into Redshift using the COPY command with the Redshift Data API

You need to install the AWS CLI and configure your Access key ID, Secret access key and AWS Region where your Redshift cluster and S3 bucket are located. The Access key ID and Secret access key parameters will be used for authorizing the S3 upload operation and the access to the Redshift Data API.

The script is built for Python 3 and it is recommended to create a virtual environment in the folder where the script is located and install the required Python packages:

python3 -m venv /path/to/script/folder
cd /path/to/script/folder
source bin/activate
pip install boto3
pip install fiona
pip install shapely==1.8.5

The script can be executed standalone or used as a module from another script/program. It requires the following parameters:

Parameter	Description
input_file	Input geospatial file. Supported formats: any file format with reading support in Fiona, including Esri Shapefile, GeoPackage, GeoJSON
bucket	S3 bucket where the file will be uploaded
cluster_identifier	Redshift cluster identifier
database	Database where the data will be imported
secret_arn	ARN of the secret that provides access to the database
redshift_role	ARN of the Redshift role with read access to S3
table_name	Redshift table where the data will be imported. The script will error out if the table already exists

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
geo2rs.py		geo2rs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Geospatial data importers for cloud native data platforms

Redshift importer (geo2rs.py)

About

Releases

Packages

Contributors 2

Languages

License

borja-munoz/cloud-native-geo-importers

Folders and files

Latest commit

History

Repository files navigation

Geospatial data importers for cloud native data platforms

Redshift importer (geo2rs.py)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages