Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DEV-458 end-to-end automated test for metadata workflow #46

Draft
wants to merge 112 commits into
base: main
Choose a base branch
from

Conversation

moseshll
Copy link
Contributor

  • README updates on standard directory locations
  • README updates on daily script inputs and outputs

@moseshll moseshll marked this pull request as draft November 13, 2024 22:07
moseshll and others added 24 commits December 5, 2024 15:58
- README updates on standard directory locations
- README updates on daily script inputs and outputs
* accepts empty lines
* accepts 9 digit catalog record IDs
* rejects all other non-whitespace lines
* sets up empty journal
* sets up logging to string for testing
- `with_test_environment` helper.
* Fix regex for verifying digitization source
* Add optional 'gzipped' argument
* Make 'errmsg' a keyword argument
* verify lines have the expected number of columns
* verify fields by regex as best we can

Still to do: test hathifiles line counts against catalog; directly test
HathifileContentsVerifier
- Add `database` to `Services` canister
- Add `PopulateRightsVerifier` with unit tests
@moseshll moseshll force-pushed the DEV-458_E2E_workflow_test branch from bff9a91 to a2623bc Compare December 5, 2024 21:12
aelkiss and others added 3 commits December 5, 2024 16:43
* Squelch logging output from tests -- since we now have an 'errors'
  method to check this in tests

To do: integration test for hathifiles
moseshll and others added 13 commits December 18, 2024 16:23
* pull repeated uses to :around blocks
* override TMPDIR by default
* Use env vars directly in derivative, or extract logic from
  directory_for where needed
* Move filename logic from Verifier to Derivative
* move helpers out of spec helper to derivatives
* use ClimateControl
* move test for all missing files to derivatives integration
* raise exception if requesting nonexistent 'full' deletes file
* Reduces level of nested ifs
* Separate catalog full verification from catalog update -- rather
  different behavior
* Also: update documentation regarding derivatives / paths
* prep bin/verify.rb for testing
* try running bin/verify.rb runs
* use run_for_date by default; remove requirement for journal
aelkiss and others added 9 commits December 20, 2024 14:50
* sufficient fixtures to run all verifiers (some fixtures have been
  moved/renamed and adjusted in tests appropriately)
* verify.rb (for now) notes which date is needed for 'today'
* extract solr catalog & hathifile database helpers to spec/support
* info logging
…n date" and not "datestamp"

- `Derivative` subclasses can provide a timestamp delta
- PostZephirDerivatives and Dates also changed to "run date" semantics.
…integration_spec.rb`

- Move `hathifile_derivative_spec.rb` to `derivative/hathifile_spec.rb`
- Rename `HathifileRedirectsVerifier` to `HathifilesRedirectsVerifier`
previous commit changed everything use current date instead of filename
date, so we can call the verifiers as they were before.
…n to monthlies)

- Requires some wrangling of TMPDIR to find `ht_bib_export_incr...` and dollar dup files
- Update .gitignore to just ignore data/ so we don't miss new fixtures
  - For now these ENV values are taken from hathifiles secret
  - `database.yml` appears to be read-write to the rights DB (in practice) so using it for Verifier is not appropriate.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants