fix(Workable): use append write disposition for candidate subresources #366

ethanve · 2024-02-22T02:41:48Z

Tell us what you do here

Given that there's no primary key set for candidate sub-resources, we should just use append

implementing verified source (please link a relevant issue labeled as verified source)
fixing a bug (please link a relevant bug report)
improving, documenting, or customizing an existing source (please link an issue or describe below)
anything else (please link an issue or describe below)

Relevant issue

issue #

More PR info

AstrakhantsevaAA · 2024-02-26T10:06:09Z

sources/workable/__init__.py

            )
            yield candidates_resource | dlt.transformer(
-                name=f"candidates_{sub_endpoint}", write_disposition="merge"
+                name=f"candidates_{sub_endpoint}", write_disposition="append"


hey! you are right, we missed primary_key (or merge_key) here, candidates details response has no id key to make it primary. I think it makes more sense to set write_disposition='replace', to avoid data duplication.

@AstrakhantsevaAA given that candidates is incremental, it will replace all existing data with just updated ones.
So for example,

Initial sync syncs 100 candidates and 10 offers

Candidate 20 now has an offer

Second pull will replace candidate offers with only a single offer for candidate 20

While merge may have duplicates, it's better than the alternative of replacing everything

AstrakhantsevaAA

Thank you for your contribution!

AstrakhantsevaAA · 2024-03-04T14:32:22Z

sources/workable/__init__.py

            )
            yield candidates_resource | dlt.transformer(
-                name=f"candidates_{sub_endpoint}", write_disposition="merge"
+                name=f"candidates_{sub_endpoint}", write_disposition="append"


AstrakhantsevaAA · 2024-03-05T10:14:28Z

@ethanve can you rebase on master pls?

rudolfix

LGTM!

fix(Workable): use append write disposition for candidate subresources

8f12f8b

AstrakhantsevaAA requested changes Feb 26, 2024

View reviewed changes

rudolfix assigned AstrakhantsevaAA Feb 26, 2024

AstrakhantsevaAA approved these changes Mar 4, 2024

View reviewed changes

AstrakhantsevaAA added the ci from fork Allows to run tests from PR coming from fork label Mar 4, 2024

rudolfix self-requested a review April 2, 2024 11:47

rudolfix approved these changes Apr 22, 2024

View reviewed changes

rudolfix merged commit d4806af into dlt-hub:master Apr 22, 2024
15 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(Workable): use append write disposition for candidate subresources #366

fix(Workable): use append write disposition for candidate subresources #366

ethanve commented Feb 22, 2024 •

edited

Loading

AstrakhantsevaAA Feb 26, 2024

ethanve Mar 1, 2024 •

edited

Loading

AstrakhantsevaAA Mar 4, 2024

AstrakhantsevaAA left a comment

AstrakhantsevaAA Mar 4, 2024

AstrakhantsevaAA commented Mar 5, 2024

rudolfix left a comment

fix(Workable): use append write disposition for candidate subresources #366

fix(Workable): use append write disposition for candidate subresources #366

Conversation

ethanve commented Feb 22, 2024 • edited Loading

Tell us what you do here

Relevant issue

More PR info

AstrakhantsevaAA Feb 26, 2024

Choose a reason for hiding this comment

ethanve Mar 1, 2024 • edited Loading

Choose a reason for hiding this comment

AstrakhantsevaAA Mar 4, 2024

Choose a reason for hiding this comment

AstrakhantsevaAA left a comment

Choose a reason for hiding this comment

AstrakhantsevaAA Mar 4, 2024

Choose a reason for hiding this comment

AstrakhantsevaAA commented Mar 5, 2024

rudolfix left a comment

Choose a reason for hiding this comment

ethanve commented Feb 22, 2024 •

edited

Loading

ethanve Mar 1, 2024 •

edited

Loading