response to persist work actor failure #159

lsat12357 · 2019-11-27T17:18:44Z

I've been finding works that, according to the aasm_status, failed during the persist_work stage, but are actually in Hyrax. Presumably something goes wrong after the object is saved but before the stack completes.
I think the causes I've seen so far (that aren't just problems we need to fix) have to do with the attach files job. I think we could have a job/service that retries running the attach files and checks visibility and if that operation is successful, a callback that updates the migrator work status.
We could add more services to cover other failures as they become apparent.

lsat12357 · 2020-01-28T21:39:49Z

Add other services as needed.

lsat12357 · 2021-04-22T21:43:14Z

because of intermittent system errors, this time around :

ERROR: Undefined namespace prefix: /rdf:RDF/rdf:Description/dc:title/text()
Failed to open TCP connection to fcrepo.od2-test.svc.cluster.local:8080 (getaddrinfo: Temporary failure in name resolution)
end of file reached

a number of assets were ingested in an incomplete state.
Fixing them required some combination of:

attaching the fileset
setting visibility
setting collections
creating new sipity entity

May want to draw the line somewhere and just delete/reingest? we just had some changes to infra, likely will not have this level of error when we start migrating for real.

lsat12357 changed the title ~~persist work actor does not always have correct status~~ response to persist work actor failure Dec 26, 2019

lsat12357 mentioned this issue Dec 26, 2019

actor/service/job for restarting failed works #129

Closed

lsat12357 self-assigned this Dec 26, 2019

raybrarian added the Migration label Jan 2, 2020

lsat12357 mentioned this issue Jan 13, 2020

Migration fix task OregonDigital/OD2#944

Merged

luisgreg99 closed this as completed in OregonDigital/OD2#944 Jan 14, 2020

lsat12357 reopened this Jan 28, 2020

lsat12357 added the Epic label Jan 28, 2020

lsat12357 removed their assignment Jan 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

response to persist work actor failure #159

response to persist work actor failure #159

lsat12357 commented Nov 27, 2019 •

edited

Loading

lsat12357 commented Jan 28, 2020

lsat12357 commented Apr 22, 2021 •

edited

Loading

response to persist work actor failure #159

response to persist work actor failure #159

Comments

lsat12357 commented Nov 27, 2019 • edited Loading

lsat12357 commented Jan 28, 2020

lsat12357 commented Apr 22, 2021 • edited Loading

lsat12357 commented Nov 27, 2019 •

edited

Loading

lsat12357 commented Apr 22, 2021 •

edited

Loading