/
Checklist for debugging XRootD issues

Checklist for debugging XRootD issues

green - can be done by users

orange - anyone in the group (e.g. ceph, batch farm, OC) can do it

red - needs specialized knowledge to resolve

  1. transfers to X are failing.

    1. Are they failing for everyone else too? i.e, do we fail against other endpoints as well or are all transfers to that endpoint failing?

    2. what’s the error?

    3. can you ping it/trace it from the gateways?

    4. how is the load looking over the gateways?

    5. how is the load looking on the managers?

    6. Does the server logs report any errors?

  2. high job failure rate.

    1. is it limited to specific VO?

    2. is it affecting specific gens/WNs?

    3. what’s the uptime on the gateway container? was it killed at some point? syslogs? server logs?

 

 

 

Related content

Keepalived stall on ECHO XRootD managers - 31/01/25
Keepalived stall on ECHO XRootD managers - 31/01/25
More like this
2025-02-13 Meeting Notes
2025-02-13 Meeting Notes
More like this
Ticket to create a new XRootD Manager
Ticket to create a new XRootD Manager
More like this