-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for bigquery schema inference #259
Add support for bigquery schema inference #259
Conversation
@jbergeskans why would you list the columns if you do not want to explicitly set them? |
We want to use other documentation features such as description, tests, and constraints. Basically, this allows us to omit the data type field which, when you're using parquet files, isn't needed anyway. |
For me this feels very conflicting, you want to infer the schema automatically but do want to manually add the column names for documentation. I see two problems with this setup:
|
Description & motivation
Fixes #249 to support schema inference on BigQuery.
Currently, if the column names are defined in the sources file but the data types are not, the following SQL will be generated:
In order for BigQuery to infer the schema, the column names and data types needs to be omitted:
This has been achieved by introducing the variable
infer_schema
. When set to true, it will not iterate over the list of columns.Example source file
Checklist