[LINK] Are the concepts of Hotsite and Warmsite dead?
Roger Clarke
Roger.Clarke at xamax.com.au
Fri Jan 24 08:36:35 AEDT 2020
[Once upon a time, single points-of-failure were addressed by various
forms of redundancy. It's expensive, and its exponentially difficult to
implement as the number of elements within the system increases.
[Yet an expectation arose that critical services would have features
such as hotsites that could pick up the load of a failed primary site in
real-time, and warmsites that could re-start a service in minutes to an
hour, while people scratched their heads trying to work out why the
primary site was down, and what to do about it.
[In recent years, any number of services have had long outages, some of
them with serious consequences. Some of those were still in-house
rather than be-clouded. Clearly the multiple bank and airline outages
should have had hotsite or at least warmsite recovery plans, and didn't.
[But, once you've switched to the cloud, surely it's easy, even
inherent. We were told by the spruikers that supply is elastic, and
more instances are run up in real-time. And it's all highly dispersed
and therefore single-point-of-failure issues are more manageable.
[I'm not sure how critical the ACT ESA's website is. It might be used
only to inform the public; or it might deliver operational services.
But, either way, you'd have expected inexpensive warmsite-like features
to be part of what an emergency services site would be about.
[What am I missing here?]
AWS outage cripples ACT Emergency Services Agency website as Canberra
bushfire rages
Wobble drags on through Thursday
Julian Bajkowski
itNews
Thu Jan 23 2020
The ACT Government’s Emergency Services Agency (ESA) has attributed a
website outage that hit in the middle of a rapidly escalating bushfire
between Canberra Airport and Queanbeyan to Thursday’s AWS outage in Sydney.
Capping off an already bad day for AWS after significant availability
problems hit its Sydney region, the ESA took to twitter to redirect
Canberrans to Facebook and local media to obtain current information on
the fire hitting the national capital that remains at a watch and act level.
The outage hit as Canberra Airport was shut to commercial traffic
because of the fire, with residents around Oaks Estate warned to get out
of the road of the oncoming blaze after two fires merged and engulfed a
rubbish tip.
It is still unclear why the ESA website was hit by a single point of
failure, however the blaze, known as the Beard fire, is burning close to
the industrial suburb of Fyshwick which houses several data centres.
The blaze near the airport is also within stone’s throw of the the
Australian Signals Directorate’s Australian Cyber Security Centre
offices at the Brindabella Park office complex that houses a clutch of
other technology, consulting and miltech tenants.
AWS users started noting problems with services around 11.15am AEDT with
the problems continuing at 4.00pm.
The issues affect services including EC2, elastic load balancing (ELB),
relational database service (RDS), AppStream 2.0, ElastiCache,
WorkSpaces and Lambda.
Update: The ESA's website was restored on Thursday evening as the fire
was downgraded to 'advice' level overnight.
--
Roger Clarke mailto:Roger.Clarke at xamax.com.au
T: +61 2 6288 6916 http://www.xamax.com.au http://www.rogerclarke.com
Xamax Consultancy Pty Ltd 78 Sidaway St, Chapman ACT 2611 AUSTRALIA
Visiting Professor in the Faculty of Law University of N.S.W.
Visiting Professor in Computer Science Australian National University
More information about the Link
mailing list