Named query generation #655

GeoWill · 2024-07-03T14:20:16Z

This is now in the state I used it to generate the results for the 2024-07-04 General Election. So probably worth thinking about while it's still fresh-ish.

GeoWill · 2024-07-11T08:02:48Z

dc_logging_aws/named_queries/commands/create_election_query_files.py

+            raise argparse.ArgumentTypeError(msg)
+
+    def create_query_directory(self):
+        queries_dir = self.script_dir.parent / "queries"


The outstanding question I have is whether we want the queries directory in the gh repo or not. I think I err to 'yes', because it's nice to keep a record of these things, and it's easy to delete things in Athena. But maybe it should be 'no' until we're actually using CI (or some other automation to) to run the queries. This is because we'll have 2 sources of truth (athena and gh) until that's the case...

symroe · 2024-08-12T15:02:49Z

Not a review, but a reminder that we need to update the two API users CSV files on S3 (that are joined via a Glue table) before running queries. I think the default join is INNER, so API keys missing form the CSV files just get excluded from the resulting queries.

I have hacked a script in devs.DC locally to update the CSV. I need to commit this, or find another way to update the CSV file. We need to do the same for the EC API.

GeoWill force-pushed the named-query-generation branch 4 times, most recently from 9d68901 to 7a32792 Compare July 3, 2024 18:17

GeoWill added 6 commits July 11, 2024 08:57

Templates for named queries for election reports

0f158f0

Command to create queries for election

d73fb6f

Command to sync query files up to athena

c562c80

Command to run queries and save results

9bbd7a5

add timeseries queries

a805ff0

Add pagination to results

89feacc

GeoWill force-pushed the named-query-generation branch from 7a32792 to 89feacc Compare July 11, 2024 07:57

GeoWill commented Jul 11, 2024

View reviewed changes

GeoWill self-assigned this Jul 11, 2024

GeoWill requested a review from symroe July 11, 2024 08:03

fixup! add timeseries queries

d05114b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Named query generation #655

Named query generation #655

GeoWill commented Jul 3, 2024 •

edited

Loading

GeoWill Jul 11, 2024

symroe commented Aug 12, 2024

Named query generation #655

Are you sure you want to change the base?

Named query generation #655

Conversation

GeoWill commented Jul 3, 2024 • edited Loading

GeoWill Jul 11, 2024

Choose a reason for hiding this comment

symroe commented Aug 12, 2024

GeoWill commented Jul 3, 2024 •

edited

Loading