You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since March, HTCondor jobs on lxplus do not have the CMSSW environment set correctly, nor JOBID or TASKID as defined in submit_HTCondor.sub. This causes the following error and subsequent job failure:
The environment issue in lxplus HTCondor should be solved with PR #67. It should also be possible now to ask jobs to be run in a singularity container.
One issue, however, is that if you work in a singularity, you lose the ability to submit jobs (condor_submit cannot be found anymore). We need to find a workaround for this... :( It would mean people have to exit the singularity to submit jobs.
However, if you have a CMSSW 11 or 12 setup with SCL7/CC7 inside a cmssw-cc7 singularity on a lxplus EL9 node, C++ libraries like ROOT will stop working, and so we run into a new compatibility issue... Currently,
pico.py submit needs both ROOT andcondor_submit, and
pico.py status needs ROOT andcondor_q...
This means that when using a singularity, we need to prepare jobs inside the singularity, and then exit it to submit it (e.g. as a simple shell script with all the condor_submit commands).
Issue: Environment not set
Since March, HTCondor jobs on lxplus do not have the CMSSW environment set correctly, nor
JOBID
orTASKID
as defined insubmit_HTCondor.sub
. This causes the following error and subsequent job failure:Our hacky workaround was to hardcode our individual
CMSSW_BASE
path in the executablesubmit_HTCondor.sh
script and docmsenv
...The cause appears to be that newer HTCondor versions have a "new syntax" (documented here), and we have to simply change
to
I'll make a PR with a patch asap.
Issue: SLC7/CC7/CentOS7 compatibility on lxplus
CERN's lxplus is phasing out CentOS7 by end of June 2024 (see this announcement and this page).
If we want to keep using CMSSW 11 or 12 on a SLC7 architecture, we have to use a singularity on lxplus user nodes and in HTCondor jobs, see this page:
I'll add this in a future PR as well, and update the instructions in the documentation...
The text was updated successfully, but these errors were encountered: