CMSD rollout
The rollout is planned in phases. When a gateway is added to the redirector alias (managed by echo manager), it is put in the cmsd sandbox.
Current status
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Phase 1
Single active gateway in the cmsd cluster, manager01 gets added to the xrootd alias . This substitutes gw14 in the xrootd and webdav aliases and ensures correctness of the initial change. any protocol/cmsd interaction unexpected behaviour should be caught here. Load behaviour and overheads should also become apparent.
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Phase 2
For small scale testing, add additional gws to rdr, this is to test small scale load balancing. having 2 gateways might lead to instabilities (load ping-pong or fallback to single gw in case the other is loaded). The gateways are from different generations to evaluate impact on each. gws are kept within the xrootd/webdav aliases to evaluate the impact of cross-alias requests and for quick rollback capability. Atlas sends some transfer to rdr
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Phase 3
All gws are added to rdr and VOs switch to rdr one by one
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Phase 4 (a)
gw14 is reintroduced to the historic aliases. rdr becomes the default alias, but the xrootd/webdav aliases keep the current functionality, with partial load put into rdr (<10%)
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Phase 4 (b)
When the previous phases pass, all other gws will be removed from the xrootd/webdav alias. All prod traffic will flow trough the redirector. This tests the capacity of the cmsd managers to cope with full prod traffic
| xrootd | webdav | cmsd |
---|---|---|---|
| yes | no | passive |
gw4 |
|
|
|
gw5 |
|
|
|
gw6 |
|
|
|
gw7 |
|
|
|
gw14 |
|
|
|
gw15 |
|
|
|
gw16 |
|
|
|
svc01 |
|
|
|
svc02 |
|
|
|
svc97 |
|
|
|
svc98 |
|
|
|
svc99 |
|
|
|
rdr |
|
|
|
Rollback
Phase1: the rdr dns will be removed from xrootd and webdav aliases and the previous set of gateways will be reinstated.
Phase2/3: VOs will switch back to xrootd/webdav aliases
Phase4: either of the previous depending on implementation