Draft bugfix: panic null value in dataset #91

Chao-Ma5566 · 2024-04-22T12:01:41Z

No description provided.

youen

Proposal for additional data processing step

I propose adding an additional data processing step before constructing the tree in the SIGO workflow. Currently, the workflow follows this pattern:

Input -> Tree Construction -> Tree
Tree -> Aggregation -> Output

I suggest the following modification:

Input -> Data Validation -> Validated Table
Validated Table -> Tree Construction -> Tree
Tree -> Aggregation -> Output

This adjustment offers several advantages:

Isolation of Error Handling: By separating the data validation step, we ensure that the rest of the processing is not cluttered with error handling logic. This promotes cleaner and more focused code for each stage of the workflow.
Early Pre-processing: Introducing a pre-processing step allows us to address data integrity issues upfront, improving the overall quality of the data fed into subsequent stages of the workflow.

I believe this change will enhance the maintainability and robustness of the SIGO system. Looking forward to your feedback and discussion on this proposal.

youen · 2024-04-24T20:13:16Z

pkg/sigo/kdtree.go

+	less := func(i, j int) bool {
+		valueI, err := n.cluster[i].QuasiIdentifer()
+		if err != nil {
+			// Stocker l'erreur dans la variable globale


please use English

Yes, adding a step Data Validation is a better way to handling error. I will revert to the first commit just keep the venom test.

I propose adding an new interface dataValidator in case we use other type than float64 in the future. After by default we use float64DataValidator after source created to valide input data then we can focus in other step of the workflow.

feat: add null value should return error venom test

a949286

Chao-Ma5566 linked an issue Apr 22, 2024 that may be closed by this pull request

BUG: null value in dataset #22

Open

Chao-Ma5566 added 9 commits April 23, 2024 08:41

feat: quaisi identifier check value type is float 64

e7f8a6c

feat:return error when a null value in dataset

cfaa587

refactor: repair lint warning

79d86df

refactor: repair lint warning

264cc92

feat: add type check in quasi identifier

43f54e9

feat: add venom test wront type error

3776cb0

feat: handling all quasi identifer error

f62b758

docs: update changelog

36f9453

docs: update changelog

9cf9b70

Chao-Ma5566 changed the title ~~Draft bugfix: panic null value in dataset~~ bugfix: panic null value in dataset Apr 24, 2024

youen requested changes Apr 24, 2024

View reviewed changes

Chao-Ma5566 changed the title ~~bugfix: panic null value in dataset~~ Draft bugfix: panic null value in dataset Apr 25, 2024

Chao-Ma5566 added 16 commits April 25, 2024 08:16

refactor: reverted commits until a949286

8ddf40f

feat: creation of data validator and float64 data validator

1178cd0

feat: integrates validator in main

87cb278

refactor: move validator in sigo

307621d

refactor: fix lint error

c187edb

refactor: fix lint error

83201e6

refactor: fix lint error

abd0d21

refactor: fix lint error

8bb0137

feat: add check type func

f473bd9

refactor: fix lint error

59c99a9

feat: add transform type func in validator

8ed4b40

feat: add null value in list should return err test

5d3224d

refactor: remove list value type check and test

a0e9064

refactor: fix lint err

3226fbd

feat: add get qi func in record

426ea09

refactor: change float64 qi to pointer for swap anonymizer

0e0bd65

Chao-Ma5566 added 5 commits May 14, 2024 14:34

feat: add set qi fun for recordd

1c8a9ed

feat: add test get qi and set qi

a007de3

feat: add float64 qi as arg in newj jsonline record

f7c658e

refactor: remove set qi func

c6a9469

refactor: replace validator in source

a0e43df

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft bugfix: panic null value in dataset #91

Draft bugfix: panic null value in dataset #91

Chao-Ma5566 commented Apr 22, 2024

youen left a comment

youen Apr 24, 2024

Chao-Ma5566 Apr 25, 2024 •

edited

Loading

Draft bugfix: panic null value in dataset #91

Are you sure you want to change the base?

Draft bugfix: panic null value in dataset #91

Conversation

Chao-Ma5566 commented Apr 22, 2024

youen left a comment

Choose a reason for hiding this comment

youen Apr 24, 2024

Choose a reason for hiding this comment

Chao-Ma5566 Apr 25, 2024 • edited Loading

Choose a reason for hiding this comment

Chao-Ma5566 Apr 25, 2024 •

edited

Loading