An interesting post and you bring up many good points, I suspect we agree more than you think we do.
Well, I think we probably both agree on the value of good service management, and no organisation will ever be in the position where there's nothing to improve.
3) That depends on how you run your CAB. I and at least some others would look at a change as "A change in configuration or a significant change in the operation of the CI".
That's fair - personally I think this sort of load issue should be considered primarily at a bigger-picture level than CAB in the Design/Strategy stages - keep CAB focused on planned changes rather than external events. (The two meetings will likely have similar attendee lists, but probably not identical)
Ill turn this around

- what are you suggestions to address this matter, or do you believe nothing needs to be done?
While I've done process analysis and problem management, I wouldn't want to try doing it remotely on a company where I don't know any of the internal workings or technological details.
I don't think it's so much that I'm intrinsically lenient, as that the evidence I see suggests that Frontier do understand the problem, are moving in the right direction, and do learn from their previous failures [1]. But with a complex problem (or indeed Problem) there may be multiple causes needing to be dealt with before the user-facing symptoms are all sorted.
It is worth remembering that DWE2 is the largest ever expedition by a factor of about 10x on paper and probably around 4-5x in terms of signups who actually set off. That makes it ~100x bigger on paper / 40-50x bigger in practice than any *recent* expedition. And back in 2.0, the original 100-ship mass jump on DWE1 nearly broke the servers and got the organisers a "very nice! please don't do it again" from Frontier.
By mid 2.1, we were regularly having 50-80 meetups on planets as part of expeditions, and finishing them off with mass jumps, and we didn't break any servers at all (and nor did we ask Frontier for special extra servers in advance - it just worked!). There were major instancing stability improvements in 2.1 and 2.2, especially for the expedition case (i.e. no NPCs)
DWE2, though - that was over 1,000 ships mass-jumping (how much over, who knows?). That's at least 10x bigger than the previous mass jump record (and probably higher, even split over three platforms and timezones) and I suspect the "instancing problem" has at least some O(n^2) complexity [2]. That's exactly the sort of situation where doing load testing with a combination of test environments and extrapolation from previous performance is going to be highly unreliable and basically it comes down to someone making an educated guess.
Mass jumps are a particularly tough case, too - 1000 people drifting in to a system over a few hours and getting instanced up is one thing. 1000 people all making simultaneous instancing requests to the same supercruise location? That's going to be much bigger ... but how much bigger?
If DWE2 had been similar in size to DWE1, and broken the servers, that would have been very bad and I'd be a lot more critical. But even here there was evidence of improvements on previously:
- the Gnosis and the 3.3 launch day had serious instancing problems without even the sharp spike of a mass jump. Until the mass jump, instancing was basically working for the DWE2 launch, despite comparable numbers of players in-system to the Gnosis, and probably more of them simultaneously online.
- there were serious server issues following 3.3 launch (especially the new comms server freezes for system/squadron chats) which seemed to have been (correctly) a big priority for Frontier to fix before DWE2 launch.
- the US/Oceania mass-jumps didn't have the same issues. They may have been a bit smaller, but still, Frontier seem to have used the data from the Europe launch to feed their scaling decisions pretty quickly.
My prediction is that now Frontier have good data on the load caused by really big mass jumps they should be able to handle all future scheduled DWE2 mass jumps without wholesale service disruption. Whereas if you're right and their processes are inadequate to manage this sort of event, this will probably be a regular occurrence throughout the expedition.
I guess we should have a good idea of which it is in a few weeks?
[1] These are not things I would always have said. It was a continuous process, of course, but I went from feeling that Frontier didn't really get what "multiplayer Elite" implied when they released it (no-one else did either, of course) ... to having a good understanding of it around 2.2ish ... to actually being able to implement some of that understanding in the Beyond releases. If you look back at my posts from years back I was a lot more critical then and I think that's because they did more things to be critical *of* back then.
[2] Which probably means there's a size of mass jump that is literally impossible for Frontier to support. We're not there yet, but don't get your hopes up for it working smoothly if DWE3 is ten times bigger again
