what actually happened with the site today?????

  • 2
  • Question
  • Updated 4 years ago
In title. Heck of an update huh?
Photo of Walt - KZ1F

Walt - KZ1F

  • 3040 Posts
  • 645 Reply Likes

Posted 4 years ago

  • 2
Photo of Richard McClelland, AA5S

Richard McClelland, AA5S

  • 296 Posts
  • 61 Reply Likes
Someone must have pooched the update.  No fun for him today.  I spent the day shoveling snow, so my conscious is clear.  I recall a day fourteen years ago when I allowed a software implementation to proceed with a service called Session_manager_server.  It should have been session_manager_server.  Worst outage of my career.  I'm pretty good at checking for capitalization errors now.
Photo of Robbie - KI4TTZ

Robbie - KI4TTZ

  • 484 Posts
  • 78 Reply Likes
My best guess: disk failure.  But maybe that's just because I have bruises from past disk array failures.  I really have no idea. :)
Photo of Tim - W4TME

Tim - W4TME, Customer Experience Manager

  • 9198 Posts
  • 3558 Reply Likes
I was never able to get an answer from the GetSat folks, but for a 18 hr outage, it had to be a cascade of failures or a single point failure of a piece of hardware that did not have a cold spare.
Photo of Ken - NM9P

Ken - NM9P

  • 4239 Posts
  • 1351 Reply Likes
Ouch! What a headache!
Photo of Al / NN4ZZ

Al / NN4ZZ

  • 1853 Posts
  • 672 Reply Likes
Sounds like they aren't sure of the root cause yet...still investigating.

Regards, Al / NN4ZZ  
al (at) nn4zz (dot) com
6700 - HW.................... V
SSDR / DAX / CAT...... V
Photo of Barry N1EU

Barry N1EU

  • 495 Posts
  • 124 Reply Likes
Evidently quickly restorable backups, hardware/network redundancy, SLA's, and pre-production platforms to pilot new roll-outs aren't as familiar to those guys as they are to the rest of the IT world.
Photo of Walt - KZ1F

Walt - KZ1F

  • 3040 Posts
  • 645 Reply Likes
Well, at least for a day people weren't being abt when will their Maestro arrive. Silver long playbook!
Photo of KY6LA - Howard

KY6LA - Howard, Elmer

  • 3784 Posts
  • 1637 Reply Likes
Systems usually fail on Chritsmas Eve, NewYears eve and Super Bowl. When salaried staff is off and incompetent management bosses (me) get stick showing up to restart things.

I must have missed a dozen super bowls.
Photo of Walt - KZ1F

Walt - KZ1F

  • 3040 Posts
  • 645 Reply Likes
So much for their 5 9s. Thankfully, we never had that problem at Monster.