-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Redshift Loader: Handle missing columns in CSV #1065
Comments
We've run into this issue a few times when upgrading schemas.
And shows this error in
From what I can tell, this is because the events were processed by the RDB transformer prior to the update, and the RDB loader is trying to process them against the new schema. Currently running both the transformer and loader as docker images on EC2 instances. |
We also have this problem. |
This issue is about schema evolutions which add new columns. There is a problem that arises when the data is transformed using the older schema, but attempted to load using the newer schema.
There are a few situations where this problem could arise:
In all those cases, the CSV file might have fewer columns than what is expected by the loader/warehouse.
When this happens, the loader logs contain this message:
If you look up the error in the
stl_load_errors
table, then the error message is:The solution is to always specify the columns in the COPY INTO statement. Currently we load a shredded table like this:
whereas we need it look like this:
...where the column list refer to the columns that we know are present in the CSV file; not the columns that are present in the table.
The text was updated successfully, but these errors were encountered: