You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A number of users have report issues where the pScheduler service is showing as not running on the Toolkit web page. This appears to be caused by a missing PID file for one of the services. So far, I have only been able to access a host with this issue once and the service in question was pscheduler-archiver. I don't have enough data points to know if it is limited to this service or others have it as well since any one process missing a pid would cause the toolkit to display as such. It also seems to be most prevalent on EL9 systems.
If you look at systemctl status pscheduler-archive while in this state it is listed as running. Also, doing systemctl show pscheduler-archiver has NRestarts as 0, which makes me think this wasn't the result of some automatic restart by systemd. I was able to fix this by restarting the archive process with systemctl by hand, at which point the pid was created. What's also strange is that before the restart there was no results getting archived, but after restarting the process I started getting results on the host...so i think this is more than just a display issue on toolkit page. Output of systemctl commands below:
systemctl status pscheduler-archiver
# systemctl status pscheduler-archiver
● pscheduler-archiver.service - pScheduler server - archiver
Loaded: loaded (/usr/lib/systemd/system/pscheduler-archiver.service; enabled; preset: disabled)
Active: active (running) since Fri 2023-09-29 06:18:24 CEST; 1 month 4 days ago
Main PID: 987086 (archiver)
Tasks: 9 (limit: 76198)
Memory: 121.4M
CPU: 23min 20.660s
CGroup: /system.slice/pscheduler-archiver.service
├─236298 /usr/bin/python3 /usr/libexec/pscheduler/daemons/archiver --dsn @/etc/pscheduler/database/database-dsn --pid-file /run/pscheduler-ser>
├─236306 python3 /usr/libexec/pscheduler/classes/archiver/esmond/archive
├─236308 python3 /usr/libexec/pscheduler/classes/archiver/http/archive
├─236309 python3 /usr/libexec/pscheduler/classes/archiver/http/archive
└─987086 /usr/bin/python3 /usr/libexec/pscheduler/daemons/archiver --dsn @/etc/pscheduler/database/database-dsn --pid-file /run/pscheduler-ser>
Nov 03 15:03:58 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2964: Failed to archive https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:03:58 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2964: Gave up archiving https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:04:35 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2967: Failed to archive https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:04:35 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2967: Gave up archiving https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:04:38 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2970: Failed to archive https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:04:38 perfsonar01-iep-grid.saske.sk archiver[225158]: archiver WARNING 2970: Gave up archiving https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:06:50 perfsonar01-iep-grid.saske.sk archiver[236298]: archiver WARNING 2973: Failed to archive https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:06:50 perfsonar01-iep-grid.saske.sk archiver[236298]: archiver WARNING 2973: Gave up archiving https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:07:29 perfsonar01-iep-grid.saske.sk archiver[236298]: archiver WARNING 2976: Failed to archive https://perfsonar01-iep-grid.saske.sk/pscheduler/>
Nov 03 15:07:29 perfsonar01-iep-grid.saske.sk archiver[236298]: archiver WARNING 2976: Gave up archiving https://perfsonar01-iep-grid.saske.sk/pscheduler/>
A number of users have report issues where the pScheduler service is showing as not running on the Toolkit web page. This appears to be caused by a missing PID file for one of the services. So far, I have only been able to access a host with this issue once and the service in question was pscheduler-archiver. I don't have enough data points to know if it is limited to this service or others have it as well since any one process missing a pid would cause the toolkit to display as such. It also seems to be most prevalent on EL9 systems.
If you look at
systemctl status pscheduler-archive
while in this state it is listed as running. Also, doingsystemctl show pscheduler-archiver
has NRestarts as 0, which makes me think this wasn't the result of some automatic restart by systemd. I was able to fix this by restarting the archive process with systemctl by hand, at which point the pid was created. What's also strange is that before the restart there was no results getting archived, but after restarting the process I started getting results on the host...so i think this is more than just a display issue on toolkit page. Output of systemctl commands below:systemctl status pscheduler-archiver
systemctl show pscheduler-archiver
The text was updated successfully, but these errors were encountered: