Around the same time, Cloudflare’s chief technology officer Dane Knecht explained that a latent bug was responsible in an apologetic X post.

“In short, a latent bug in a service underpinning our bot mitigation capability started to crash after a routine configuration change we made. That cascaded into a broad degradation to our network and other services. This was not an attack,” Knecht wrote, referring to a bug that went undetected in testing and has not caused a failure.

  • monkeyslikebananas2@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    9 hours ago

    Not really. Sometimes there are processes designed where engineers will make a change as a reaction or in preparation for something. They could have easily made a mistake when making a change like that.

    • 123@programming.dev
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      8 hours ago

      E.g.: companies that advertise on a large sporting event might preemptively scale up (maybe warm up depending on language) their servers in preparation for a large load increase following some ad or mention of a coupon or promo code. Failure to capture the market it could generate would be seen as wasted $$$

      Edit: auto-scale does not count on non essential products, people would not come back if the website failed to load on the first attempt.

    • NotMyOldRedditName@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      8 hours ago

      I don’t think it was a bug making the configuration change, I think there was a bug as a result of that change.

      That specific combination of changes may not have been tested, or applied in production for months, and it just happened to happen today when they were needed for the first time since an update some time ago, hence the latent part.

      But they do changes like that routinely.

      • monkeyslikebananas2@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        8 hours ago

        Yeah, I just read the postmortem. My response was more about the confusion that any configuration change is inherently non-routine.