You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was looking at this project and I must say: it's awesome and something that dbt docs currently is missing.
One thing got in my mind is the question why there's not an option to add the TOP x column values and their distribution? Is there any other reason to not include this in the docs?
Like in this example where you show TOP 2 for example:
Column Name
Top 1 Value
Distribution
Top 2 Value
Distribution
Column 1
Value 1 A
0.50 ("number"/"total")
Value 1 B
0.20 ("number"/"total")
Column 2
Value 2 A
0.50 ("number"/"total")
Value 2 B
0.30 ("number"/"total")
Column 3
Value 3 A
0.10 ("number"/"total")
Value 3 B
0.05 ("number"/"total")
Column 4
Value 4 A
0.10 ("number"/"total"
Value 4 B
0.05 ("number"/"total")
Looking forward to your thoughts!
The text was updated successfully, but these errors were encountered:
@diegodewilde I've thought about adding a "mode" (most common value) profiling metric to the package but never around to implementing it. This proposal expands the mode concept into N most common values and I think it's a good idea.
Just throwing thoughts here:
What would be a sensible default for the number of top values? 1, 2, 3?
How should we name the columns? top_1_value, top_1_value_proportion, top_2_value, top_2_value_distribution, etc?
Is there a better way to display the distributions than the (value, proportion) pairs for each top value?
Hi,
I was looking at this project and I must say: it's awesome and something that dbt docs currently is missing.
One thing got in my mind is the question why there's not an option to add the TOP x column values and their distribution? Is there any other reason to not include this in the docs?
Like in this example where you show TOP 2 for example:
Looking forward to your thoughts!
The text was updated successfully, but these errors were encountered: