feat(#184): add source column on data_record #185

1yuv · 2024-11-25T19:06:08Z

Description

Add source column on data_record table.
Closes #184

License

The software is provided under AGPL-3.0. Contributions to this project are accepted under the same license.

lorerod

Thanks, @1yuv, for these changes. I recommend ensuring the new source field (or fields) is properly tested to verify that the new logic works as expected in all cases. Feel free to reach out if you need guidance on implementing these updates. Looking forward to seeing the final version!

lorerod · 2024-11-29T14:29:24Z

models/forms/data_record.sql

@@ -22,6 +22,7 @@ SELECT
  document_metadata.saved_timestamp,
  to_timestamp((NULLIF(doc->>'reported_date'::text, ''::text)::bigint / 1000)::double precision) AS reported,
  doc->>'form' as form,
+  doc#>>'{fields,inputs,source}' as source,


Does breaking this into separate columns (fields, inputs, source) make sense?
This can improve clarity in downstream usage, simplify debugging and testing, and align with potential future changes of individual transformation.

I don't see any benefits of breaking this into the separate columns. Moreover, that way is less performant and I've opened an issue to make this changes in every other places.

Your suggestion to use direct access with #>> is an excellent approach for improving performance. If downstream processes don't need to select specific fields, inputs, or source attributes, separating them may not be necessary.
However, if more granularity is required, we could balance performance and flexibility by separating the columns while using direct access instead of ->.
Please let me know what you think.

lorerod · 2024-11-29T14:34:46Z

models/forms/forms.yml

@@ -23,6 +23,8 @@ models:
        data_type: timestamp with time zone
      - name: form
        data_type: string
+      - name: source


Did you consider adding generic data test here? You can refer to this documentation.

lorerod · 2024-11-29T15:58:41Z

models/forms/data_record.sql

I will also recommend updating the data_record fixtures to include the new fields along with their expected results.

Additionally, it would be interesting to have a sqltest to verify that the values for the new fields are correctly populated and consistent between data_record and couchdb.

For these updates, you can refer to the Guide for testing dbt models or review the existing tests already implemented in this repository. Let me know if you need any assistance!

#184 add source on data_record

f23e7cb

1yuv requested review from lorerod and witash November 25, 2024 19:06

lorerod requested changes Nov 29, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(#184): add source column on data_record #185

feat(#184): add source column on data_record #185

1yuv commented Nov 25, 2024 •

edited by andrablaj

Loading

lorerod left a comment

lorerod Nov 29, 2024

1yuv Dec 3, 2024

lorerod Dec 3, 2024 •

edited

Loading

lorerod Nov 29, 2024

lorerod Nov 29, 2024

feat(#184): add source column on data_record #185

Are you sure you want to change the base?

feat(#184): add source column on data_record #185

Conversation

1yuv commented Nov 25, 2024 • edited by andrablaj Loading

Description

License

lorerod left a comment

Choose a reason for hiding this comment

lorerod Nov 29, 2024

Choose a reason for hiding this comment

1yuv Dec 3, 2024

Choose a reason for hiding this comment

lorerod Dec 3, 2024 • edited Loading

Choose a reason for hiding this comment

lorerod Nov 29, 2024

Choose a reason for hiding this comment

lorerod Nov 29, 2024

Choose a reason for hiding this comment

1yuv commented Nov 25, 2024 •

edited by andrablaj

Loading

lorerod Dec 3, 2024 •

edited

Loading