Doug,
On January 4, 2018 at 11:50:02 PM, Doug Barton (dougb@dougbarton.email) wrote:Since a little before September when the 8145 data started rolling in
all I've heard discussed is the risk to the deployed base if we do the
roll and their stuff breaks. But there is another, arguably greater risk
that is not being discussed, what happens if we get ourselves into a
position where we are forced to do an emergency roll? (The common
scenarios for that are key compromise, which is very unlikely but not
impossible, and alg failure.)If they key gets lost or compromised, my understanding is that we cannot use RFC 5011 to do the roll and must fall back to doing an out-of-band key rollover. We aren’t really exercising this under this iteration of the community defined KSK rollover plan.
I was a founding member of the SSAC before I joined the ICANN staff. While it's been some time since I participated with them, I feel I understand their remit pretty well. The recommendation you refer to reads (sorry for any copy/paste issues):There are only two conditions that can be true at this point:[…]
If #1 is true we should do the roll ASAP […]
If #2 is true we should do the roll ASAP […]As I’ve noted previously, this would appear to argue that SAC-063 rec#3 should not have been made and that the amount of “breakage” is irrelevant. It would be nice if SSAC were to weigh in on this.
It is expected that there will be some issues during at least the first KSK rollover, andTo me, that recommendation seems to provide a solid balance between acknowledging that there will be problems, and also taking the importance of stability into account by asking for both a rollback plan and a criteria for the rollback decision. That all seems perfectly reasonable and appropriate. The SSAC is asked to provide advice on both Security and Stability. in this case, adding security (by showing that a 5011 roll can be performed with a minimum of disruption) requires a small, but necessary sacrifice in stability. That makes this issue no different than other, similar issues; like IPv6, IDNs, the new gTLD program, or even the introduction of DNSSEC itself.
probably the next few. It will not be possible to anticipate all the problems that may
occur but an agreed understanding of when the rollover has affected operational stability
beyond a reasonable boundary is essential so the decision to rollback the rollover can be
made quickly and efficiently.