Firmware update blamed for Office365 outage

 

Caused Office365, Windows Live, Azure platform management to cut out.

Microsoft has blamed a botched update applied to a load balancer for an outage that cut access to a number of hosted services including Office365 and Windows Live a fortnight ago.

The outage, which occurred on September 9, was initially thought to have been caused by power cuts in Southern California although speculation had centred on whether it was domain name service (DNS) related.

A post-mortem released by Microsoft confirmed it was an issue in Microsoft's DNS service.

"A tool that helps balance network traffic was being updated and the update did not work correctly," Windows Live test and service engineering vice president Arthur de Haan said.

"As a result, configuration settings were corrupted, which caused a service disruption."

The file corruption occurred for two reasons, Microsoft said. Firstly, the load balancing tool was unable to parse an incorrectly constructed line in the updated configuration file, de Haan said.

"The second condition was related to how the configuration is synchronised across the DNS service to ensure all client requests return the same response regardless of the connection location of the client," he said.

"Each of these conditions was tracked to the networking device firmware used in the Microsoft DNS service."

De Haan said Microsoft was focused on hardening the DNS service, improving redundancy and failover capabilities.

"We are also developing an additional recovery process that will allow a specific property the ability to fail over to restore service and then fail back when the DNS service is restored," he said.

"In addition, we are reviewing the recovery tools to see if we can make more improvements that will decrease the time it takes to resolve outages."

Copyright © iTnews.com.au . All rights reserved.


Firmware update blamed for Office365 outage
 
 
 
Top Stories
Making a case for collaboration
[Blog post] Tap into your company’s people power.
 
Five zero-cost ways to improve MySQL performance
How to easily boost MySQL throughput by up to 5x.
 
Tracking the year of CIO churn
[Blog post] Who shone through in 12 months of disruption?
 
 
Sign up to receive iTnews email bulletins
   FOLLOW US...
Latest articles on BIT Latest Articles from BIT
This 4G smartphone costs $219
Sep 3, 2014
It's possible to spend a lot less on a smartphone if you're prepared to go with a brand you ...
Looking for storage? Seagate has five new small business NAS devices
Aug 22, 2014
Seagate has announced a new portfolio of Networked Attached Storage (NAS) solutions specifically ...
Run a small business in western Sydney?
Aug 15, 2014
This event might be of interest if you're looking to meet other people with a similar interest ...
Buying a tablet? Microsoft's Surface Pro 3 goes on sale this month
Aug 8, 2014
Microsoft has announced its Surface Pro 3 will go on sale in Australia on 28 August from ...
Apple's top MacBook Pro with Retina is now cheaper
Aug 1, 2014
Apple has updated its MacBook Pro range with faster processors and new pricing, including ...
Latest Comments
Polls
Which is the most prevalent cyber attack method your organisation faces?




   |   View results
Phishing and social engineering
  68%
 
Advanced persistent threats
  4%
 
Unpatched or unsupported software vulnerabilities
  11%
 
Denial of service attacks
  6%
 
Insider threats
  12%
TOTAL VOTES: 1049

Vote