iTnews
  • Home
  • News
  • Technology
  • Software

Google sorry for widespread Docs outage

By Liam Tung on Sep 12, 2011 5:57AM
Google sorry for widespread Docs outage

Real time bug drains Google memory.

Google has attributed an hour-long outage of its Docs service last Wednesday to a service upgrade designed to improve real-time collaboration.

"We feel your pain and are very sorry," Alan Warren, Google’s engineering director, advised in a blog post Friday, explaining why the “majority” of Docs customers were unable to access document lists, documents, drawings and Apps Scripts between 2:02PM to 3:18PM Pacific Daylight Time on Wednesday 7 September.  

While the outage officially only lasted an hour, according to Google's Apps Status Dashboard, users began reporting problems late Tuesday evening.

No Docs data was lost in the incident, according to Google’s incident report [PDF], however some edits made immediately prior to the outage may not have been saved.  

Google's attempt to improve collaboration features of Docs lists exposed a memory management bug that affected the “look up” machines used to monitor and execute modifications to a Google Doc. 

The update “placed additional load on the service that manages the distribution of Docs processing” but the bug “accelerated and compounded” the load. 

“[T]he lookup machines didn’t recycle their memory properly after each lookup, causing them to eventually run out of memory and restart,” said Warren. 

The bug’s impact - measured by the rate at which its servers failed to look up documents - escalated “sharply” within a minute of Google’s monitoring systems picking up the fault. 

“The engineering teams diagnosed the problem, determined that it was correlated with the feature change, and started rolling it back 23 minutes after the first alert. In parallel, we doubled the capacity of the lookup service to mitigate the impact of the memory management bug,” said Warren. 

The scale of Google's outage was overshadowed by yet another outage to Microsoft's Office 365 and Hotmail last Friday, believed to have been caused by a power failure in Mexico.     

Got a news tip for our journalists? Share it with us anonymously here.
Copyright © iTnews.com.au . All rights reserved.
Tags:
clouddocsgooglememoryoutagesoftware

Partner Content

The Great Resignation has intensified insider security threats
Promoted Content The Great Resignation has intensified insider security threats
Avoiding CAPEX by making on-premise IT more cloud-like
Promoted Content Avoiding CAPEX by making on-premise IT more cloud-like
Accenture and Google Cloud team up to create a loveable, Australian-first, renewable energy product
Promoted Content Accenture and Google Cloud team up to create a loveable, Australian-first, renewable energy product
Security "mindset shift" needed to protect organisations
Promoted Content Security "mindset shift" needed to protect organisations

Sponsored Whitepapers

Extracting the value of data using Unified Observability
Extracting the value of data using Unified Observability
Planning before the breach: You can’t protect what you can’t see
Planning before the breach: You can’t protect what you can’t see
Beyond FTP: Securing and Managing File Transfers
Beyond FTP: Securing and Managing File Transfers
NextGen Security Operations: A Roadmap for the Future
NextGen Security Operations: A Roadmap for the Future
Video: Watch Juniper talk about its Aston Martin partnership
Video: Watch Juniper talk about its Aston Martin partnership

Events

  • Micro Focus Information Management & Governance (IM&G) Forum 2022
  • CRN Channel Meets: CyberSecurity Live Event
  • IoT Insights: Secure By Design for manufacturing
  • Cyber Security for Government Summit
By Liam Tung
Sep 12 2011
5:57AM
0 Comments

Related Articles

  • ACCC starts review of Google's Mandiant buyout
  • Atlassian blames outage on miscommunication and "faulty script"
  • Google posts strong cloud and hardware growth
  • Macquarie Bank speeds up digital customer onboarding
Share on Twitter Share on Facebook Share on LinkedIn Share on Whatsapp Email A Friend

Most Read Articles

Qantas calls time on IBM, Fujitsu in tech modernisation

Qantas calls time on IBM, Fujitsu in tech modernisation

Service NSW hits digital services goal two years early

Service NSW hits digital services goal two years early

NBN Co taking orders for 'non-premises' connections

NBN Co taking orders for 'non-premises' connections

NSW Police scores $100m to connect body-cams to firearms, tasers

NSW Police scores $100m to connect body-cams to firearms, tasers

Digital Nation

IBM global chief data officer on the rise of the number crunchers
IBM global chief data officer on the rise of the number crunchers
COVER STORY: Operationalising net zero through the power of IoT
COVER STORY: Operationalising net zero through the power of IoT
The security threat of quantum computing
The security threat of quantum computing
Crypto experts optimistic about future of Bitcoin: Block
Crypto experts optimistic about future of Bitcoin: Block
Integrity, ethics and board decisions in the digital age
Integrity, ethics and board decisions in the digital age
All rights reserved. This material may not be published, broadcast, rewritten or redistributed in any form without prior authorisation.
Your use of this website constitutes acceptance of nextmedia's Privacy Policy and Terms & Conditions.