-
Notifications
You must be signed in to change notification settings - Fork 234
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add schema to the default location root #239
Comments
@JCZuurmond I'm super supportive of this change. It might be breaking for some users, so we'll just want to call it out as such in the changelog / release notes, and wait for the next minor version to release — which will be v1.0 :) I don't have a strong preference between the two options you've proposed:
The former is closer to standard Hive partitioning layout, right? If we're already changing the default, we may as well pick an approach that will serve us well for some time to come. |
The @guillesd what is your take on this issue? |
forgot to mention, I would choose: |
Hey chicos! Nice indeed to get this going. I'm not sure which of the two options I'd go for. If we choose your preferred option @JCZuurmond, then it'd look something like this for partitioned tables: |
I think the other option |
@JCZuurmond Ah, that's a really good point. I think you've clinched it for me: |
This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days. |
This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days. |
PR #339 is still open to solve this issue |
Describe the feature
Add the schema to the location root by default.
As the documentation says:
I like this behavior. In the case of Spark, if you decide to materialize your tables as files (which you commonly do) then it might be that two users do not overwrite eachothers table definitions as these have separate schemas (or databases in spark). However, the users do overwrite the underlying data of eachothers tables.
I think this is unexpected behavior, which we can avoid by adding the schema to the location root - by default.
Describe alternatives you've considered
Overwriting the macro myself.
Additional context
The
location-root
macro.Who will this benefit?
Teams where people develop on
dbt-spark
concurrently. Unexpected behavior will happen when you unknowingly overwrite the data of each others tables.Are you interested in contributing this feature?
This is the macro I use:
We could choose something like:
The text was updated successfully, but these errors were encountered: