2024-01-04 Meeting Notes

 Date

Jan 4, 2024

 Participants

  • @Thomas, Jyothish (STFC,RAL,SC)

  • @Katy Ellis

  • @Thomas Byrne

  • Lancs:

  • Glasgow:

Apologies:

CC:

@Alastair Dewhurst

 

 

 Goals

  • List of Epics

  • New tickets

  • Consider new functionality / items

  • Detailed discussion of important topics

  • Site report activity

 

 Discussion topics

Current status of Echo Gateways / WNs testing

Recent sandbox’s for review / deployments:

 

Item

Presenter

Notes

 

Item

Presenter

Notes

 

Operational Issues over the Holiday period

@Thomas, Jyothish (STFC,RAL,SC)

 

brief VOMS authentication failure on svc02 - caused a spike in failures

 

Gateways and WNs:
- Current status and upcoming changes

@Thomas, Jyothish (STFC,RAL,SC)

stable status currently

  • tokens deployment cms/atlas

  • checksum library

  • prefetch off on WNs

To resist installing 5.6.4; before the break, one sets of sets (TPC transfers) was failing against another site. To repeat the tests and see

 

bugfix for calculating striper objects in direct reads

 

bugfix for calculating striper objects in direct reads by Jo-stfc · Pull Request #50 · stfc/xrootd-ceph

Any further comments on the PR; or ready to merge ?

@Katy Ellis to confirm a successful file dump read.

  • confirmed

JW to approve the merge

 

ECHO File transfer / throughput studies

@Katy Ellis

Tests of per-file transfer writes into Echo.

Report of initial set of tests presented of writes from a few sites to RAL, using different file sizes, and network links.
A new Jira is set up to track these changes: https://stfc.atlassian.net/browse/XRD-80
A number of proposed tests were suggested, including continuing with Davs; different sites; concurrent file transfer tests; and including low-level (e.g. iperf tests) to identify any bottlenecks and attempt to decouple and isolate as best as possible all the processes / layers involved.

 

 

Checksums fixes

@Alexander Rogovskiy

Status and plans for improving Checksumming work …

GitHub - alex-rg/xrd_ckslib
It seems to be possible to move checksum script execution to checksum library. That removes the necessity of core xrootd patching

It is probably also possible to compute checksums in the library itself (without the help of external script). Though it seems too dangerous..

https://stfc.atlassian.net/browse/XRD-56

 

 

Prefetch studies and WN changes

@Alexander Rogovskiy

Status and plans for Prefect study testing …

 

 

Deletion studies through RDR

@Ian Johnson

 

 

Tokens testing

@Thomas, Jyothish (STFC,RAL,SC) @Katy Ellis

https://stfc.atlassian.net/browse/XRD-63
https://stfc.atlassian.net/browse/XRD-78

avoid duplicating basepath by Jo-stfc · Pull Request #2151 · xrootd/xrootd

 

Understanding CMSD Loadbalancing

@Thomas Byrne

 

 

SKA Gateway box

@James Walder

https://stfc.atlassian.net/wiki/spaces/UK/pages/215941180

 

Architectural review ‘hackathon’

All

Plan the process for the Architectural planning of XRootD across the External Gateways and WNs

 

2024 Planning

 

JW to prepare a summary of the plans for 2024

 

 

on GGUS:

Site reports

Lancaster -

Glasgow -

 

 

 

 Action items

  • @James Walder to schedule a ‘hackathon’ within a F2F to have a session on architectural planning.

  • @Katy Ellis to run (and help coordinate) additional tests on File transfer performance, to understand the differences between SVC/new network and GW / legacy network hosts that suggest different levels of performance.

    • These tests to involve iperf testing, and concurrent FTS-issued file transfer testing.

  • @James Walder to prepare an outline of the expected roadmap for XRootD developments in 2024.

  •  

 

 Decisions

  1. @Alexander Rogovskiy to work on and help coordinate the development of a checksum library to perform metadata requests, calling the external checksum script, if a checksum is needed. This is agreed to be a stepping stone to a full implementation whereby the checksum itself may be caclulated by the checksum library.