redshift_to_pandas write to S3 #24

Gauravshah · 2018-06-20T14:57:25Z

it would be more performance efficient if redshift_to_pandas first writes to S3 and then reads it from there.

redshift_to_pandas reads over wire directly connecting to redshift

If there is large dataset ( >100 million rows) being downloaded, redshift's one thread is occupied in serving this user.

The text was updated successfully, but these errors were encountered:

yaojiach · 2018-07-08T21:12:53Z

It sounds like a separate concern. If you are using Python >= 3.6 you can try https://github.com/yaojiach/red-panda/blob/master/red_panda/red_panda.py#L664

PabTorre · 2018-07-17T03:45:07Z

agawronski · 2018-07-18T06:33:42Z

Will look into this.

Provide feedback