1579118 : Your held Jobs¶
Created: 2026-02-24T07:36:01Z - current status: new¶
Here is the anonymized and summarized version of the reported issue:
Summary of the Issue¶
A large number of jobs ([~500]) were placed on hold due to exceeding the requested disk space allocation. The jobs had a RequestDisk value of 0 MB, but their actual disk usage (DiskUsage) reached 1500 MB. Since the jobs did not explicitly request disk space, the system enforced a default limit, causing them to be held.
Solution¶
-
Option 1: Remove the held jobs Delete the jobs and resubmit them with updated disk requirements.
-
Option 2: Edit and release the jobs
- Update the
RequestDiskattribute to a value greater than the actual disk usage (e.g.,RequestDisk = 2000for 1500 MB usage). - Use the following commands:
bash condor_qedit [JOB_ID] "RequestDisk = 2000" condor_release [JOB_ID] - For bulk editing, loop through all held jobs or use wildcards (if supported).
Prevention for Future Submissions¶
- Explicitly define
RequestDiskin the job submission file to avoid default limits. - Monitor disk usage during job execution to ensure requirements align with actual needs.