2023-11-16 Meeting Notes
Date
Nov 16, 2023
Participants
@Thomas, Jyothish (STFC,RAL,SC)
@Thomas Byrne
@Alexander Rogovskiy
@James Walder
@Ian Johnson
Lancs: @Matt Doidge, Gerard, Steven
Glasgow: Sam
Apologies:
Goals
List of Epics
New tickets
Consider new functionality / items
Detailed discussion of important topics
Site report activity
Discussion topics
Current status of Echo Gateways / WNs testing
Recent sandbox’s for review / deployments:
Item | Presenter | Notes |
|
---|---|---|---|
XrootD gateway architecture review (What should the XrootD access to Echo look like in a year’s time) |
| https://stfc.atlassian.net/wiki/spaces/GRIDPP/pages/255262851 Ideas on xrootd batch farm architecture Current State Key questions: What to aim for:
Containerizing everything (shared containers across all hardware) is the preferred desired end state. some system resource overhead should be reserved to keep the gateways running smoothly
WN gateways:
|
|
XRootD Releases |
| 5.6.3-1 is out Glasgow Lancs has been using it (el7 and rocky8) (no cmfst post centos7) Make solve problem with WN gateways that's been seen recently Notes that cmsTFC needs to be compiled from source for EL8+, and that CMAKE errors on a particular ‘warning'. |
|
Checksums fixes |
| Deployed to a single prod server; to deploy next week, and test the speed. |
|
Prefetch studies and WN changes | Alex | planned for week of 20th to resume partial deployment over the farm Increasing of timeouts to reduce failures of some async requests |
|
Deletion studies through RDR | Ian | To follow up looking for differences in requests through XRootD and via rados commands directly (to spot where the ‘long tail’ may originate from). For echo metrics, the kibana page is useful : https://kibana.gridpp.rl.ac.uk/goto/c962c51ce13283e9853334bbc79ca801https://kibana.gridpp.rl.ac.uk/goto/c962c51ce13283e9853334bbc79ca801 |
|
Gateways: observations |
|
|
|
Tokens testing |
| To Liaise with the Token Trust Traceability Taskforce (aka. @Matt Doidge ) report by end of this month CMS GGUS for enabling token auth |
|
SKA Gateway box |
| https://stfc.atlassian.net/wiki/spaces/UK/pages/215941180 Deneb-dev routing still needed (on the Switch / router side). Some tests with Ceph-dev and changing of the rados-striper Difference between upload and download may be due to uploads from local disk, downloads to /dev/null. (to repeat with tmpfs). |
|
WN Xcache issue |
| futex lock hard locking xcache proxy on WNs (possibly occurrence of Deadlock in XCache's XrdCl instance · Issue #1979 · xrootd/xrootd ) |
|
containerised gateways (kubernetes cluster) |
| working but still needs ironing a few bugs and scaling up |
|
on GGUS:
Site reports
Lancaster - Revisiting Tokens config after a long hiatus - aiming to have atlas tokens working for DC24. Testing is proving problematic as Matt keeps mucking up the client side.
Glasgow -
Action items