v0.2.0 #141
Replies: 4 comments 4 replies
-
I really, really like this 👍 Couple comments: In generalisation, it's unclear when it's enough to fill one field and when it isn't.
Bug with substrings: if (substring length + substring start) > (length of string), it throws me an "Anonymization Error". For the Anonymization Summary, it occurs to me that the distribution of distortions would be useful as well. Right now I see the average and the maximum distortion, but I can't really tell if there are just a few rows with really high distortion or how this comes together. Sorting by "Count noise" only helps if there are relatively few rows. The above also shows: We need unified terminology here... what is it now, distortion or noise? |
Beta Was this translation helpful? Give feedback.
-
In my mind, they are slightly different, noise makes me think of the SD of the generator and distortion makes me think of the final output value. |
Beta Was this translation helpful? Give feedback.
-
Sorting is not very relevant because we only show a subset of the output values (even after we'll make the summary accurate). To improve sorting utility, we can either:
|
Beta Was this translation helpful? Give feedback.
-
What do you think about changing "Substring start / Substring length" to something shorter, like "Text start / Length"? |
Beta Was this translation helpful? Give feedback.
-
Setup packages are NOT signed.
Version 0.2.0
This discussion was created from the release v0.2.0.
Beta Was this translation helpful? Give feedback.
All reactions