IBM claims neural network speed breakthrough

By Juha Saarinen

Aug 9 2017 4:25PM

Built 256 GPU platform.

IBM researchers claim to have developed a distributed deep learning (DDL) platform with multiple servers that speeds up neural network model training considerably and improves accuracy.

IBM claims neural network speed breakthrough

Nvidia Tesla P100.

They say [pdf] they trained the Resnet neural network model with 101 layers on the ImageNet 22K image data set, which contains over 7.5 million high-resolution images in 22,000 categories, spanning several terabytes in size.

Compared to earlier training tests by Microsoft which took ten days and scored below 30 percent accuracy, the IBM researchers say they got the neural network model up to 33.8 percent validation accuracy in roughly seven hours.

Using the Berkeley University Caffe deep learning framework, the researchers achieved 95 percent scaling efficiency.

Facebook's artificial intelligence research group had earlier recorded 89 percent scaling efficiency on a Caffe training run on a 256-GPU platform, but with higher communications overhead than the IBM platform.

This, IBM claims, is a new record.

"These results are on a benchmark designed to test deep learning algorithms and systems to the extreme, so while 33.8 percent might not sound like a lot, it’s a result that is noticeably higher than prior publications," IBM reserch fellow Hillery Hunter said.

IBM's technology can be used to train AI for specific tasks, such as detecting cancer cells in medical images more accurately and quickly.

The researchers used 64 of IBM's Power System "Minsky" S822LC servers for the neural network. These featured four Nvidia Tesla P100 graphics cards as well as IBM's Power8 processors.

Implemented as a library, IBM's communication algorithm with a multi-ring pattern can be used with Google's TensorFlow and the Torch scientific computing framework.

Got a news tip for our journalists? Share it with us anonymously here.

Tags:

Partner Content

Promoted Content AI in cybersecurity: weapon or shield?

Partner Content Australian organisations must act on security – or risk AI ambitions falling flat

Microsoft Copilot Partner Hub

Partner Content Elastic's Open Source Strategy Drives Innovation and Expansion

Woolworths' CSO is Optus-bound

Lion builds an app to detect its beers on tap in venues

Tech giants' indirect emissions rose 150 percent in three years

icare signs $29m cloud renewal with AWS

Managed XDR can help organisations navigate complex security challenges

IBM claims neural network speed breakthrough

Built 256 GPU platform.

Partner Content

Sponsored Whitepapers

Events

Most Read Articles

Orica to set new workforce systems live in Australia in July

ANZ Institutional readies go-live for "multi-agent chatbot" amie

Lion builds an app to detect its beers on tap in venues

Victoria Police refreshes online reporting

Most popular tech stories

Westpac pilots AI to analyse inbound call content

Coles merges digital and customer leadership

Digital Nation announces finalists for its inaugural Digital Nation Awards

Westpac hires CBA's data chief to lead AI, data and digital

Orica to set new workforce systems live in Australia in July

Australian MSP Index launched

Ingram Micro Ushers in the Age of Ultra

Announcing Pipeline 2025 tickets, theme & initial speakers

Vic gov to spend $100m on cyber security

"Waiting until an incident is in progress is too late": MSPs urged to review ransomware response as new reporting rules commence

Blackberry celebrates "giant step forward"

The three steps to industrial IoT success

Axis Communications opens experience centre in Sydney tech hub

Perth IoT vendor Digital Matter names new chief executive

How the University of Melbourne used LoRaWAN and IoT in ‘COVIDSafe’ strategy

Woolworths' CSO is Optus-bound

Lion builds an app to detect its beers on tap in venues

Tech giants' indirect emissions rose 150 percent in three years

icare signs $29m cloud renewal with AWS

Managed XDR can help organisations navigate complex security challenges

IBM claims neural network speed breakthrough

Built 256 GPU platform.

Partner Content

Sponsored Whitepapers

Events

Most Read Articles

Orica to set new workforce systems live in Australia in July

ANZ Institutional readies go-live for "multi-agent chatbot" amie

Lion builds an app to detect its beers on tap in venues

Victoria Police refreshes online reporting

Most popular tech stories

Westpac pilots AI to analyse inbound call content

Coles merges digital and customer leadership

Digital Nation announces finalists for its inaugural Digital Nation Awards

Westpac hires CBA's data chief to lead AI, data and digital

Orica to set new workforce systems live in Australia in July

Australian MSP Index launched

Ingram Micro Ushers in the Age of Ultra

Announcing Pipeline 2025 tickets, theme & initial speakers

Vic gov to spend $100m on cyber security

"Waiting until an incident is in progress is too late": MSPs urged to review ransomware response as new reporting rules commence

Blackberry celebrates "giant step forward"

The three steps to industrial IoT success

Axis Communications opens experience centre in Sydney tech hub

Perth IoT vendor Digital Matter names new chief executive

How the University of Melbourne used LoRaWAN and IoT in ‘COVIDSafe’ strategy

Log In