Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Item

Presenter

Notes

Operational Issues
Gateways and WNs:
- Current status and upcoming changes

mitigations have been communicated to ATLAS for the jobs using 5.6.0 clients

reboot campaign

XRootD Managers De-VMWareification

(Moving to physical hosts)

Thomas, Jyothish (STFC,RAL,SC)

/wiki/spaces/GRIDPP/pages/872644647

XRootD Cluster Shuffle

View file
nameRedirector de-VMWareification.pptx

Release of 5.7.3

(May expect an 5.8.X prior to 6.X?)

https://github.com/xrootd/xrootd/releases/tag/v5.7.3

  • Major bug fixes
    [Seckrb5] Avoid null pointer dereference (#2385)
    [XrdPfc] Fix file descriptor leak when reading file size from cinfo file (#2392)

  • Minor bug fixes
    [Protocol] do_WriteSpan() - Add written bytes in file statistics (#2368)
    [XrdHttp] Correct response code for PUT (from 200 to 201) (#2382)
    [XrdHttp] Set oss.asize if object size is known (#2378)
    [XrdOfs] Correct forward declaration of XrdSfsFSctl (#2405)

  • Miscellaneous
    [CI] Drop CentOS 7 builds from GitHub and GitLab CI
    [CI] Move macOS GitHub Actions workflow to macOS 15
    [Docker] Add Dockerfile for Alpine Linux
    [Docker] Remove Dockerfile to build on CentOS 7
    [Docker] Update docker/ subdirectory setup and xrd-docker script
    [Misc] Fix compilation with GCC 15 (#2411)
    [Tests] Fix check for running process to prevent setup failures
    [XrdCl] Improve checking of logging format strings (#2380)
    [XrdSciTokens] Add tests for token-based authorization (#2381)

Checksums issue with an ATLAS file

https://github.com/xrootd/xrootd/issues/2388

https://ggus.eu/index.php?mode=ticket_info&ticket_id=169360

Checksum requested before whole file is updated. No ability to do stale checksum check in ceph, so original checksum ‘sticks’ to the file.

fix in place RAL side by clearing checksums after a write is complete

cms-aaa naming convention

cms-aaa is the only remaining personality to use proxy/ceph as the xrootd service names


Separate naming convention would be more appropriate, to have main/supporting

(not so urgent).

CC created, and sandbox is prepared and has been tested on a test hostXRootD Managers De-VMWareification

Thomas, Jyothish (STFC,RAL,SC)

View file
nameRedirector de-VMWareification.pptx

Option 2 preferred for efficiency, but Option 1 decided on

Option 1 would be simpler to implement for a temporary fix, as the move would be reversed

antares tpc nodes to be moved to an echo leafsw, to confirm ipv4 real estate with James
lfsw30 (UPS room) decided on destination

hosts moved to rack, renamed and IP assigned. pending DI advertisement

Compilation and rollout status with XrdCeph and rocky 8: 5.7.x

Thomas, Jyothish (STFC,RAL,SC)

5.7.2 published.
Investigating xrootd.redirect for write operations.

5.7.2 skipped on farm due to pfc bug,

possible RAL release 5.7.3 equivalent with a fix for that and 5.6.0 client compatibility

Shoveler

Katy Ellis

Shoveler installation and monitoring

On the fly Checksums

Jira Legacy
serverSystem Jira
serverId929eceee-34b0-3928-beeb-a1a37de31a8b
keyXRD-98

Ian Johnson

Added configuration to PoC: option to turn on/off Adler32 on-the-fly calculation.

Proved ability to set XrdCks.adler32 attribute from “standalone” code (running from the command line), will incorporate this into PoC code next. (Wasted time looking for attribute in wrong file…)

also to measure - trougput pattern (does this replicate the double troughput seen currently on first checksum request?)

discussed on possible implementation as plugin/base xrootd

crc32 also implemented here, noted that any new communities should use straight crc32 variants.

Deletions

Jira Legacy
serverSystem Jira
serverId929eceee-34b0-3928-beeb-a1a37de31a8b
keyXRD-83

NTR

XRootD Writable Workernode  Gateway Hackaton

Thomas, Jyothish (STFC,RAL,SC)

XRootD Writable Workernode  Gateway Hackaton (XWWGH)


Hackaton writeable workernode

sandbox with fixes present, tested on lhcb workernode, reading works fine as is, writes still need testing to let jobs only write on that WN

Jira Legacy
serverSystem Jira
serverId929eceee-34b0-3928-beeb-a1a37de31a8b
keyGSTSM-284

Xrootd testing framework

XRootD Site Testing Framework

Discussion in Storage Meeting in how to integrate the various testing structures within the UK. container with the testing framework TBD

100 GbE Gateway testing:
SKA / Tier-1

James Walder Thomas, Jyothish (STFC,RAL,SC)

UKSRC - XRootD used Acting as source for SRCNet testingverification tests; not being stressed so far …

Teir-1 cabled, but awaiting some work to progress on the Swtich .

UKSRC Storage Architecture

Through discussions, need to change the DNS entries for the data and mgmt interfaces, update netbox and reconfigure in AQ. Data network will be (exclusively) for the DTN / data traffic. mgmt for ancillary needs (icinga, AQ). Host will be known via its mgmt dns name (the canonical name).

Tokens Status

  • Operational

  • Technical

  • Accounting

...