1596802 : naf-jhub - Server spawn failed¶
Created: 2026-04-29T05:29:33Z - current status: new¶
Anonymized Summary:
A user reports persistent issues when attempting to start a Jupyter notebook server on the NAF platform. The error message indicates a timeout after 60 seconds:
"Spawn failed: Server at http://[WORKER_NODE]:40000/user/[USERNAME]/api didn't respond in 60 seconds".
Attempts to resolve the issue by retrying with different configurations (e.g., GPU/non-GPU) have been unsuccessful.
Possible Causes & Solutions:¶
Based on the provided context, the issue may stem from one of the following:
- Deadlocked Notebook Entry in the Hub
- The user’s notebook state may be stuck due to a race condition or failed route deletion in the JupyterHub database.
-
Solution:
- Try using the "Stop My Server" button in the JupyterHub GUI to force removal of the obsolete entry.
- If the button fails, the issue may require admin intervention (e.g., account reset or hub restart). The user can contact NAF support for assistance.
-
Disk Quota Exceeded
- The error log in
.jupyterhub.condor.err(located in the user’s$HOMEdirectory) may show:Failed to write server-info to [PATH]: OSError(122, 'Disk quota exceeded') -
Solution:
- Check disk usage (
fs lqorquota) and clean up unnecessary files in$HOME. - Request a quota increase if needed.
- Check disk usage (
-
Unresponsive CVMFS Mounts
- A known issue (December 2024) where CVMFS mounts prevent the notebook from reporting its port to the hub, leading to timeouts.
-
Solution:
- Retry spawning after some time, as the healthcheck bug has been fixed and faulty nodes should now be removed automatically.
-
Local Jupyter Configuration Issue
- A misconfigured setting in
.jupyter/jupyter_server_config.json(e.g.,open_browser = False) may block the hub connection. -
Solution:
- Check the file and comment out problematic entries.
-
Worker Node Issues
- The specific worker node (
batchj004.desy.de) may be unresponsive or overloaded. - Solution:
- Wait and retry later, or contact NAF admins to investigate the node.
Recommended Next Steps:¶
- Check Logs:
- Review
.jupyterhub.condor.errand.jupyterhub.condor.outin$HOMEfor detailed error messages. -
Look for
Disk quota exceededor CVMFS-related errors. -
Force Server Removal:
-
Use the "Stop My Server" button in the JupyterHub GUI to clear stale entries.
-
Clean Up Disk Space:
-
Free up space in
$HOMEif quota is exceeded. -
Retry Later:
-
If the issue persists, wait and retry, as it may be related to temporary node or CVMFS problems.
-
Contact Support:
- If none of the above works, reach out to NAF admins for further diagnosis.
Sources: - NAF Documentation: Notebook Spawning Issues - JupyterHub BatchSpawner Issue #314