uptime (or more appropriately, downtime)

Plan: record downtimes over the period of a year so that I can figure out downtime percentage

ie, 99% uptime is .01*365.25 = 3.6525 days of downtime

99.999% uptime is .00001*365.25 = 5.25 minutes of downtime

"five nines" is what I would prefer. Two nines is what I get.

Work (roughly 10k users in AD; not sure how many duplicate accounts)

20100512: power outage, more than 10 minutes. No warning, no explanation

20100327: power outage, less than 10 minutes. No warning, no explanation

20100201: network outage, 12 hours notice. Planned for 11pm-midnight for router fix.

20100116: power outage, between 10 and 15 minutes. No warning, no explanation

20091210: internet unavailable for 10 minutes, 7pm. No warning, no explanation

20090907 - 09: network disconnected due to policy violation for office room (Monday 7pm to Wednesday 3pm). 8 people affected. Policy was incorrectly enforced and no communication was made, resulting in 43 hours of no internet. There were other resources available, but severe inconvenience plus much administrative communication lead to almost no work being completed during outage.

20090821: network services outage. Projected noon to 1pm, but turned out to be noon to 3pm. 30 minute warning.

20090810: [power outage]: between midnight and 8am, unknown duration (but longer than 10 minutes). No warning or explanation.

20090801: [power outage]: 7.5 hours. 7:10a-1:40p 1 week warning. all computers had 75 days of uptime prior to outage.

20090724: [cluster filesystem corruption]: needed to restart all jobs (180 CPUs)

20090516: [power outage]: 1.5 hours, no warning, no explanation.

20090508: [power outage]: 10 seconds (due to storm). UPS was able to cover outage

20090419: "network-wide issues" no access to external DNS. 4pm - ?

20090409: [wireless outage]: 10 am, 15 minutes. Notice that it was down and notice when it was back up. No warning.

20090323: [power outage]: 2 hours. No warning or explanation

20090315: [internet outage] midnight to 6am. IT upgrade

-three day warning

20090221: [internet outage] 11pm - 5am. IT upgrade

-two day warning

20090211: [wireless outage] 6-8am. IT upgrade

20090206: [power outage] about an hour. 7:45pm, so I shutdown safely (thanks, UPS) and went home

-no warning, no explanation

-server had 38 days of uptime prior to the outage

20090130: [internet outage, all users] 10 minutes

-no warning, no explanation

20081222: [power outage] 7am - 8:30am, 1.5 hours

20081230: [power outage] 3:45pm - 5:15pm, 1.5 hours

-20 minute warning lead-time

20081113: [power outage] 3 hours

20081108: [core router failure] 4:20pm. 15 minutes

20081104: [network outage] 8am - 11am

20081016: [networked drives unavailable] noon - 5am (17 hrs)

20080809: [power outage] 4 hours

-2 day warning lead-time