Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port Scotland #58

Merged
merged 72 commits into from
Jul 10, 2024
Merged

Port Scotland #58

merged 72 commits into from
Jul 10, 2024

Conversation

sgreenbury
Copy link
Collaborator

@sgreenbury sgreenbury commented Mar 12, 2024

Contributes towards #36 and supersedes #26 since porting to dagster.

Updated: this version now implements the country base class approach #98. Further revisions to be done with #138.


Remaining tasks

  • Revise the catalog with metadata class fields
  • Add descriptions for tables from table index
  • Subset some assets for specific partition key (OA11/LC1117SC)
  • Check geometries available
  • Add source data release
  • Update pipeline to output as required following update output types of publishing assets, add cloud sensors #92 and potentially Port Northern Ireland #98
  • Reshape data across tables for derived metrics to not lose any data include generating column names and hxltags programmatically conditional on the table variables
  • Merge main and add required changes
  • Confirm pipeline runs

@sgreenbury sgreenbury changed the title Port scotland Port Scotland Mar 12, 2024
@andrewphilipsmith
Copy link
Collaborator

andrewphilipsmith commented Mar 27, 2024

@sgreenbury Please could you add detail about how this PR relates to #26? Does this supersede #26, or will both PRs need to be merged in due course? Many thanks

@andrewphilipsmith andrewphilipsmith added this to the v0.2 release milestone Mar 27, 2024
@sgreenbury
Copy link
Collaborator Author

@sgreenbury Please could you add detail about how this PR relates to #26? Does this supersede #26, or will both PRs need to be merged in due course? Many thanks

Yes, this supersedes #26 since porting to dagster. I'll update the first comment to clarify and close #26. Thanks!

@sgreenbury
Copy link
Collaborator Author

Following the discussion just now, one missing detail that should be implemented is capturing the SourceDataRelease for each table. This seems to be provided by the first column of the metadata catalog (e.g. 3C is release 3C for the corresponding census). From this a set of SourceDataRelease metadatas should be included (possibly in a separate module) and populated and then looked up as the source_tables are constructed.

Copy link
Collaborator

@andrewphilipsmith andrewphilipsmith left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all looks good to me.

The various comments are all minor things (and one's I'm no doubt guilty of myself!). I think the best option would be to merge this and open issues, or new PRs to deal with most of the comments.

pyproject.toml Outdated Show resolved Hide resolved
python/popgetter/assets/gb_sct/__init__.py Show resolved Hide resolved
python/popgetter/assets/gb_sct/__init__.py Outdated Show resolved Hide resolved
python/popgetter/assets/gb_sct/__init__.py Outdated Show resolved Hide resolved
python/popgetter/assets/gb_sct/__init__.py Outdated Show resolved Hide resolved
@sgreenbury
Copy link
Collaborator Author

Thanks @andrewphilipsmith! I'll go ahead and merge this now once the checks pass and then open new issues/PRs for the areas you mention + any fixes for the currently missing derived metrics.

@sgreenbury sgreenbury merged commit ff0d019 into main Jul 10, 2024
8 checks passed
@sgreenbury sgreenbury deleted the 36-port-scotland branch July 10, 2024 15:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done:
Development

Successfully merging this pull request may close these issues.

3 participants