CMSD rollout

The rollout is planned in phases. When a gateway is added to the redirector alias (managed by echo manager), it is put in the cmsd sandbox.

 

 

 

Current status

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Phase 1

Single active gateway in the cmsd cluster, manager01 gets added to the xrootd alias . This substitutes gw14 in the xrootd and webdav aliases and ensures correctness of the initial change. any protocol/cmsd interaction unexpected behaviour should be caught here. Load behaviour and overheads should also become apparent.

 

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Phase 2

For small scale testing, add additional gws to rdr, this is to test small scale load balancing. having 2 gateways might lead to instabilities (load ping-pong or fallback to single gw in case the other is loaded). The gateways are from different generations to evaluate impact on each. gws are kept within the xrootd/webdav aliases to evaluate the impact of cross-alias requests and for quick rollback capability. Atlas sends some transfer to rdr

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Phase 3

All gws are added to rdr and VOs switch to rdr one by one

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Phase 4 (a)

gw14 is reintroduced to the historic aliases. rdr becomes the default alias, but the xrootd/webdav aliases keep the current functionality, with partial load put into rdr (<10%)

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Phase 4 (b)

When the previous phases pass, all other gws will be removed from the xrootd/webdav alias. All prod traffic will flow trough the redirector. This tests the capacity of the cmsd managers to cope with full prod traffic

 

xrootd

webdav

cmsd

 

xrootd

webdav

cmsd

 

yes

no

passive

gw4

 

 

 

gw5

 

 

 

gw6

 

 

 

gw7

 

 

 

gw14

 

 

 

gw15

 

 

 

gw16

 

 

 

svc01

 

 

 

svc02

 

 

 

svc97

 

 

 

svc98

 

 

 

svc99

 

 

 

rdr

 

 

 

Rollback

Phase1: the rdr dns will be removed from xrootd and webdav aliases and the previous set of gateways will be reinstated.

Phase2/3: VOs will switch back to xrootd/webdav aliases

Phase4: either of the previous depending on implementation