2023-03-02 Meeting Notes

 Date

Mar 2, 2023

 Participants

  • @James Walder

  • @Thomas, Jyothish (STFC,RAL,SC)

  • @Thomas Byrne

  • @Ian Johnson

  • Lancs: Gerard, Matt, Steven

  • Manchester; Alessandra

 Goals

  • List of Epics

  • New tickets

  • Consider new functionality / items

  • Detailed discussion of important topics

  • Site report activity

 

 Discussion topics

Current status of Echo Gateways / WNs testing

  • ceph-gw14: CMSD configuration (connected to manager01/02, but transfers continuing through usual endpoints (e.g. ‘passive’ mode).

    • ceph-gw1,2,3, 8 pre-prod machines also for testing

  • lcg2269 and lcg2270 with Vector read patch (+ openssl fix)

  • WNs openssl fix rolling out

CMSD redirection:

  • Providing HA frontend

    • Discussion with JA on best practice.

    • No need to use the load balancer; set up the keep-alived on the hosts themselves

  • Forgot to request OPN IPs for the echo-manager hosts

Alice gateway congfiguration

  • Updates for space reporting and the related activities

 

Item

Presenter

Notes

 

Item

Presenter

Notes

 

simultaneous file reads issue

JW

Done (pending any further improvements.)
https://stfc.atlassian.net/browse/XRD-54

https://github.com/stfc/xrootd-ceph/pull/38
https://elog.gridpp.rl.ac.uk/Tier1/10945

 

Status of space reporting and tests on Alice dev gateway and setting up of a unified configuration.

 

https://stfc.atlassian.net/browse/XRD-21

Including unified setup

Reduced amount of logging, added Doxygen-style comments for methods Jyothish and Ian worked on.

Ease building XrdCeph RPM by emitting source tarball instead of source RPM (latter not directly used).

unified tpc setup sandboxed on dev

 

CMSD status

 

https://stfc.atlassian.net/browse/XRD-41

Firewall holes for the managers requested,
AAAA records for managers requested
https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=481569
https://helpdesk.gridpp.rl.ac.uk/Ticket/Display.html?id=481561

Initial LB config (other changes seem to have crept in)
http://aquilon.gridpp.rl.ac.uk/sandboxes/diff.php?sandbox=jw-xrootd-cmsd-loadbalance

https://phabricator.gridpp.rl.ac.uk/diffusion/AQ/browse/prod/ral-tier1/features/ceph/common-gw/config.pan

 

Monitoring of memory spikes

 

script developed, setting up sandbox with cron job

 

 

 

GGUS:

Deletion problem at RAL

Slow stat calls at RAL

Problem accessing some LHCb files at RAL

Site reports

 Action items

  • Create Jira for Checksumming updates for 3.7+ (especially for Rocky 9 releases).

  •  

 

 Decisions