Quantile-based discretisation function #1197

JosephBond · 2024-12-12T10:18:17Z

Fluid library function to bin a set of “observations” into a set of (potentially non-uniform) bins.

As a test case, use the new function to compute the 5-95% interpercentile range from the sample we generate from the preprocessing step in explorable-viz/transparent-text#22. This will make up part of the pipeline for the examples, and is needed to provide a data source for the whisker plots on the bar chart bars, and also for computing the appropriate probabilities to use in text like “very likely”.

Python libraries to consider:

pandas.cut (supports custom bins)
pandas.qcut (quantile-based bins)
numpy.digitize (similar but doesn’t require bin labels)

Going forward let’s use Python-inspired names for library functions, to leverage #1139.

See also:

explorable-viz/transparent-text#26

The text was updated successfully, but these errors were encountered:

rolyp · 2024-12-16T08:54:29Z

@JosephBond Added some clarification to this task and renamed from “Calculate Interpercentile Range from empirical distribution”.

rolyp · 2024-12-16T12:41:48Z

@JosephBond It looks like qcut does take an argument that allows you specify the target quantiles, so we could take a similar approach. E.g. something pandas.qcut(xs, q=[0, 0.05, 0.95, 1.0]) but without the named argument syntax. Renamed task again.

JosephBond added the implementation label Dec 12, 2024

JosephBond added this to the transparent-text 0.1 milestone Dec 12, 2024

JosephBond self-assigned this Dec 12, 2024

JosephBond moved this to In Progress in Fluid Dec 12, 2024

JosephBond added this to Fluid Dec 12, 2024

rolyp moved this from In Progress to Planned in Fluid Dec 16, 2024

rolyp changed the title ~~Calculate Interpercentile Range from empirical distribution~~ Bin data set using custom bin sizes Dec 16, 2024

rolyp changed the title ~~Bin data set using custom bin sizes~~ Quantile-based discretisation function with custom quantiles Dec 16, 2024

rolyp changed the title ~~Quantile-based discretisation function with custom quantiles~~ Quantile-based discretisation function Dec 16, 2024

rolyp moved this from Planned to In Progress in Fluid Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantile-based discretisation function #1197

Quantile-based discretisation function #1197

JosephBond commented Dec 12, 2024 •

edited by rolyp

Loading

rolyp commented Dec 16, 2024 •

edited

Loading

rolyp commented Dec 16, 2024 •

edited

Loading

Quantile-based discretisation function #1197

Quantile-based discretisation function #1197

Comments

JosephBond commented Dec 12, 2024 • edited by rolyp Loading

rolyp commented Dec 16, 2024 • edited Loading

rolyp commented Dec 16, 2024 • edited Loading

JosephBond commented Dec 12, 2024 •

edited by rolyp

Loading

rolyp commented Dec 16, 2024 •

edited

Loading

rolyp commented Dec 16, 2024 •

edited

Loading