Ninefold outage disables server provisioning

 

Spurs investment in second availability zone.

Australian cloud provider Ninefold has been unable to pinpoint the exact cause of a five-hour outage last week that led to virtual server provisioning in its cloud environment being disabled.

The outage started at 9.20am on Thursday February 16 and was resolved at 2.05pm the same day, according to a post-incident report obtained by iTnews.

The report initially pinned the incident on "an unexpected failure" of an NFS server that "automatically restarted itself and resumed normal operation within minutes".

"A number of physical host servers (which support customer VMs) were performing operations to the NFS server at the time of the unexpected Network File System server failure," the report stated.

"Due to the nature of the NFS server failure, this caused provisioning on these specific physical host servers to become unresponsive.

"Eventually, some customer VMs on these particular host servers became unavailable and unable to restart on alternate physical hosts."

Ninefold managing director Peter James told iTnews today that the fault was in the NFS server hardware "which ultimately transmitted into a software failure".

James could not say how this occurred. "We're still working with engineers to determine the absolute underlying events," he said.

He said patches had been applied to the NFS storage system and that further upgrades were in the pipeline.

Provisioning taken offline

Ninefold was careful in describing the outage, stating it was "not... system wide".

The outage was confined to part of the virtual server side of Ninefold's business, where "some" virtual server instances stored on "a number" of physical host servers connected to the problematic NFS server were impacted. The outage did not touch a "large" number of virtual servers or physical hosts and had no impact on Ninefold's cloud storage business.

However, the outage also meant that no customers - across the board - could spin up new virtual servers over the five hour period.

James confirmed that provisioning for all customers was disabled so that engineers could restore the VMs that had failed.

"We took a decision based on advice from our engineers," James said.

"The issue is that if we've got customers, albeit a small number of them, [that] are down, they're the ones that we absolutely focus on.

"We took a decision to take the appropriate steps to get them back up and running, albeit that on a Thursday it did mean that a number of customers couldn't provision but it did mean that we were able to reasonably quickly get that small number of customers who were more affected back up and running."

The post-incident report does not mention the number of physical host servers or customer VMs impacted and James declined to elaborate.

New availability zone

In response to the outage, James said Ninefold has doubled its investment in an already-planned core infrastructure enhancement project to $1 million. Installation work is expected "over the next few weeks".

The provider also plans to launch a second availability zone hosted in Macquarie Telecom's forthcoming Intellicentre 2 facility in North Ryde.

The zone is expected to be live in May, a month ahead of earlier expectations.

James said Ninefold would make a "significant investment as the anchor client" in the data centre.

He said Ninefold was "yet to work the detail out" on whether it might run its presence in Macquarie's two data centres in an active-active configuration.

"We're already underway with our planning, but we're yet to make that call," James said.

He said the launch of the zone was "largely aimed at increasing the resilience" of the service as it scaled up to meet customer growth.

Ninefold previously suffered a host server outage in August last year and another incident a few months earlier in May 2011.

Copyright © iTnews.com.au . All rights reserved.


Ninefold outage disables server provisioning
 
 
 
 
Top Stories
Photos: Highlights from SAP Sapphire Now 2013
All the keynote action from one of the world's biggest SAP events.
 
How do I: Improve my presentation skills
A repeatable process to follow.
 
Photos: NextDC builds S1 data centre
Prepares for September launch.
 
 
Sign up to receive iTnews email bulletins
   FOLLOW US...

Latest VideosSee all videos »

Bankwest builds continuous delivery capability
Bankwest builds continuous delivery capability
To automatically deploy test/dev sandboxes by mid-year.
Veterans' Affairs sets sights on modernisation
Veterans' Affairs sets sights on modernisation
Data safe with Human Services, CIO says.
Citi Australia drops platform customisations
Citi Australia drops platform customisations
Technology chief shifts focus from building to leveraging systems.
VicRoads restructures IT team
VicRoads restructures IT team
Department moves to align with industry benchmarks.
Zurich Australia extends IT team offshore
Zurich Australia extends IT team offshore
Malaysian staff served from Australian data centres.
Leigh Berrell - Utilities CIO of the Year
Leigh Berrell - Utilities CIO of the Year
Yarra Valley Water CIO Leigh Berrell accepts his Benchmark Award for Utilities CIO of the Year.
Wayne McMahon - Retail CIO of the Year
Wayne McMahon - Retail CIO of the Year
Domino's Pizza CIO Wayne McMahon accepts his Benchmark Award for Retail CIO of the Year.
Inside Perpetual's ongoing IT transformation
Inside Perpetual's ongoing IT transformation
CIO Jenny Levy discusses how outsourcing will help the firm "simplify, refocus and grow".
Managing Complexity - Defence's Daniel McCabe
Managing Complexity - Defence's Daniel McCabe
Daniel McCabe, Assistant Secretary of Australia's Department of Defence, provides the audience at the iTnews Data Centre Strategy Summit with a deep dive into the organisation's data centre consolidation program.
How Facebook designed the data centre from scratch - Marco Magarelli
How Facebook designed the data centre from scratch - Marco Magarelli
The full keynote by Facebook data centre architect Marco Magarelli at the Australian Data Centre Strategy Summit. Magarelli details the design considerations behind the social network's Prineville, Oregon; North Carolina and Luleå, Sweden data centres.
Modernising Legacy Data Centres - Telstra's Jon Curry
Modernising Legacy Data Centres - Telstra's Jon Curry
Telstra general manager of managed data centres Jon Curry guides the audience at the iTnews Australian Data Centre Summit through the build of the telco's Clayton, Victoria data centre.
NSW Government launches NABERS data centre rating tools
NSW Government launches NABERS data centre rating tools
Matthew Clark from the NSW Department of Environment guides facilties managers through the details of the new NABERS data centre energy rating tool at the Australian Data Centre Strategy Summit.
NABERS launch panel: Australian Data Centre Strategy Summit
NABERS launch panel: Australian Data Centre Strategy Summit
Matthew Clark (NSW Dept of Environment), Greg Boorer (Canberra Data Centres), Glenn Allan (National Australia Bank), Mike Andrea (Strategic Directions) and Bob Sharon (Green Global Consulting) discuss the impact of the NABERS data centre rating.
Judges notes: Fortescue Metals [The Benchmark Awards]
Judges notes: Fortescue Metals [The Benchmark Awards]
iTnews' panel of judges discuss Fortescue Metals 'New World of Work" project, one of three shortlisted finalists for the Industrials category of the CIO Benchmark Awards.
Judges notes: Retail [The Benchmark Awards]
Judges notes: Retail [The Benchmark Awards]
iTnews' panel of judges discuss the shortlisted finalists for the Retail category of the CIO Benchmark Awards.
Judges notes: Pacific Aluminium [The Benchmark Awards]
Judges notes: Pacific Aluminium [The Benchmark Awards]
iTnews' panel of judges discuss Pacific Aluminium's lightning fast service desk refresh, one of three shortlisted finalists for the Industrials category of the CIO Benchmark Awards.
Judges notes: Domino's Pizza [The Benchmark Awards]
Judges notes: Domino's Pizza [The Benchmark Awards]
iTnews' panel of judges discuss Domino's Pizza's shift to hosted services, one of three shortlisted finalists for the Retail category of the CIO Benchmark Awards.
Judges notes: McDonald's Australia [The Benchmark Awards]
Judges notes: McDonald's Australia [The Benchmark Awards]
iTnews' panel of judges discuss McDonald's Australia's new self-service portal for employees, one of three shortlisted finalists for the Retail category of the CIO Benchmark Awards.
Judges notes: ING Direct [The Benchmark Awards]
Judges notes: ING Direct [The Benchmark Awards]
iTnews' panel of judges discuss ING Direct's 'Bank in a Box', one of three shortlisted finalists for the banking and finance category of the CIO Benchmark Awards.
Judges notes: Yarra Valley Water [The Benchmark Awards]
Judges notes: Yarra Valley Water [The Benchmark Awards]
iTnews' panel of judges discuss Yarra Valley Water's insourcing project, one of three shortlisted finalists for the Utilities category of the CIO Benchmark Awards.
Latest Comments
Polls
Do you prefer the Coalition's NBN policy?

   |   View results
Yes
  19%
 
No
  81%
TOTAL VOTES: 1618

Vote