Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

\uD83D\uDC65 Participants

Apologies:

CC:

\uD83E\uDD45 Goals

...

Recent sandbox’s for review / deployments:

Thomas, Jyothish (STFC,RAL,SC)

Item

Presenter

Notes

Operational Issues
Gateways and WNs:
- Current status and upcoming changes

Thomas, Jyothish (STFC,RAL,SC)

Further observations from Data Challenge

Future testing plans (RAL initiated / VO initiated) ?

All

DC24 observations

To improve still: LB, Checksums (inflight + metadata),

image-20240229-134100.pngImage Added

RR with failsafe hotspotting

Rocky 8 migration planning

Deletion studies through RDR

Ian Johnson

Previous deletion times reported were from the client side, end-to-end. Now analysing times for individual ceph_posix_unlink calls, and will look at deletion request times with XRootD itself. That is, looking at the amounts of time that a deletion request takes travelling though layers of XRootD code.

I’ve noted some strange outliers from an initial sample (y-axis times in ms, last tick should read 80000 ms.) Outlier for ceph_posix_unlink at 90s? stddev of 4476 for the plot below. 6808 samples taken for a single gateway from 22nd Feb:

atlas-unlink-times.pngImage Added

Deletions

Jira Legacy
serverSystem JIRA
serverId929eceee-34b0-3928-beeb-a1a37de31a8b
keyXRD-83

Next steps:
Collate previous information to define the problem

Identify potential solutions.

Planning for ALICE CMSD redirection

Thomas, Jyothish (STFC,RAL,SC)

INC-163994 - DNS ip additions

Checksums fixes

Alexander Rogovskiy Thomas, Jyothish (STFC,RAL,SC)

https://github.com/stfc/xrootd/pull/9

Prefetch studies and WN changes

Alexander Rogovskiy

Tokens Status

Thomas, Jyothish (STFC,RAL,SC) Katy Ellis

CMSD Load balancing

Thomas Byrne Thomas, Jyothish (STFC,RAL,SC)

PR:
https://github.com/stfc/xrootd/pull/8/files

addresses the issue where the current selByLoad algorithm leads to load hotspotting and coarse load distribution

It replaces the current algorithm in xrootd. testing for simulated behaviour in progress.

SKA Gateway box

James Walder

/wiki/spaces/UK/pages/215941180

5.6.x root TPC issue

https://github.com/xrootd/xrootd/issues/2202

issue open, under investigation

...

Putting first new storage nodes in a long time into production next week, any tips?

  • Tom: better control of number of placement groups that could be moving.

Glasgow

✅ Action items

  • James Walder to schedule a ‘hackathon’ within a F2F to have a session on architectural planning.

  • James Walder to prepare an outline of the expected roadmap for XRootD developments in 2024.

...