Add additional logging to the cleanup process #861
Labels
🤖 aspect: dx
Concerns developers' experience with the codebase
✨ goal: improvement
Improvement to an existing user-facing feature
🟧 priority: high
Stalls work on the project or its dependents
🧱 stack: ingestion server
Related to the ingestion/data refresh server
Problem
Currently, the cleanup process only records the final number of cleaned rows. We want to have more information on how many rows are affected and what type of cleanup is applied (malformed URL fixed, tag filtering) in order to develop a strategy to apply these changes on the upstream database and remove the steps from the ingestion server.
Additional context
Previous attempt at WordPress/openverse-api#1126
The text was updated successfully, but these errors were encountered: