NVIDIA Volta GPU Smashes AI Records

•Download as PPTX, PDF•

10 likes•3,630 views

NVIDIA Volta Tensor Core GPU achieves new AI performance milestones in ResNet-50 for a single chip, single node, and single cloud instance. Explore the performance improvements.

Technology

SHATTERING AI
PERFORMANCE RECORDS
NVIDIA Volta Tensor Core GPU Achieves
New AI Performance Milestones

GPU-POWERED DEEP LEARNING IS TRANSFORMING
EVERY INDUSTRY, SOLVING CHALLENGES ONCE
THOUGHT IMPOSSIBLE…

THE IDEAL AI COMPUTING PLATFORM NEEDS TO
PROVIDE IMPROVED PERFORMANCE, SCALABILITY
AND PROGRAMMABILITY TO ADDRESS THE
DIVERSITY OF MODEL ARCHITECTURES.

NVIDIA’S VOLTA TENSOR CORE GPU ACHIEVED
RECORD-SHATTERING RESNET-50 PERFORMANCE
FOR A SINGLE CHIP, SINGLE NODE, AND SINGLE
CLOUD INSTANCE.

FASTEST SINGLE CHIP
A single V100 Tensor Core GPU achieves
1,075 images/second when training
ResNet-50, a 4X performance increase
compared to the previous generation
Pascal GPU.
“New figures from NVIDIA illustrate the contribution hardware improvements can make to progress in machine learning: the AlexNet model that won
ImageNet in 2012 took six days to train, can now be done in 18 minutes — a 500x speedup.” - Tom Simonite, WIRED

FASTEST SINGLE NODE
A single DGX-1 server powered by eight
Tensor Core V100s achieves
7,850 images/second, almost 2X the 4,200
images/second from a year ago on the
same system.
“I feel like it’s important to note that these performance improvements [by NVIDIA] are more important than they immediately appear, because while
these gains dramatically impact today’s workloads, they’re effectively preempting even more complex workloads of the future.”
- Rob Williams, TechGage

FASTEST SINGLE CLOUD INSTANCE
A single AWS P3 cloud instance powered
by eight Tensor Core V100 GPUs can train
ResNet-50 in less than three hours, 3X
faster than a TPU instance.
“4 #TPU chips in a ‘Cloud TPU’ deliver 180 teraFLOPS of performance; by comparison, four V100 chips deliver 500 teraFLOPS. #NVIDIAwins.”
- Karl Freund, Moor Insights

NVIDIA TENSOR CORE GPU ARCHITECTURE ALLOWS US TO SIMULTANEOUSLY PROVIDE GREATER
PERFORMANCE THAN SINGLE-FUNCTION ASICS, YET BE PROGRAMMABLE FOR DIVERSE WORKLOADS.

EACH TESLA V100 TENSOR CORE GPU DELIVERS 125 TERAFLOPS OF PERFORMANCE FOR DEEP
LEARNING COMPARED TO 45 TERAFLOPS BY A GOOGLE TPU CHIP.
4 TPU CHIPS IN A ‘CLOUD TPU’ V2 DELIVER 180 TERAFLOPS OF PERFORMANCE.
BY COMPARISON, 4 NVIDIA V100 CHIPS DELIVER 500 TERAFLOPS OF PERFORMANCE.

EXPLORE THE PERFORMANCE IMPROVEMENTS
HERE

What's hot

AI For EnterpriseNVIDIA

NVIDIA 2017 OverviewNVIDIA

The AI Era Ignited by GPU Deep Learning NVIDIA

HPC Top 5 Stories: Nov. 21, 2016NVIDIA

DGX Sessions You Won't Want to Miss at GTC 2019NVIDIA

EPSRC CDT ConferenceAlison B. Lowndes

GTC China 2016NVIDIA

Opening Keynote at GTC 2015: Leaps in Visual ComputingNVIDIA

HPC Top 5 Stories: Nov. 11, 2016NVIDIA

Top 5 Data Science Sessions from GTC 2019NVIDIA

HPC Top 5 Stories: May 3, 2017NVIDIA

OpenACC Monthly Highlights - March 2018NVIDIA

GTC Europe 2017 KeynoteNVIDIA

Top 5 Stories in Design and Visualization - Nov. 20, 2017NVIDIA

Innovation RoundtableAlison B. Lowndes

Tales of AI agents saving the human race!Alison B. Lowndes

HPC Top 5 Stories: September 29, 2017NVIDIA

OpenACC Monthly Highlights- DecemberNVIDIA

NVIDIA Overview 2015NVIDIA

NVIDIA – Inventor of the GPUNVIDIA

What's hot (20)

AI For Enterprise

NVIDIA 2017 Overview

The AI Era Ignited by GPU Deep Learning

HPC Top 5 Stories: Nov. 21, 2016

DGX Sessions You Won't Want to Miss at GTC 2019

EPSRC CDT Conference

GTC China 2016

Opening Keynote at GTC 2015: Leaps in Visual Computing

HPC Top 5 Stories: Nov. 11, 2016

Top 5 Data Science Sessions from GTC 2019

HPC Top 5 Stories: May 3, 2017

OpenACC Monthly Highlights - March 2018

GTC Europe 2017 Keynote

Top 5 Stories in Design and Visualization - Nov. 20, 2017

Innovation Roundtable

Tales of AI agents saving the human race!

HPC Top 5 Stories: September 29, 2017

OpenACC Monthly Highlights- December

NVIDIA Overview 2015

NVIDIA – Inventor of the GPU

Similar to NVIDIA Volta GPU Smashes AI Records

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...Willy Marroquin (WillyDevNET)

Dell and NVIDIA for Your AI workloads in the Data CenterRenee Yao

Tesla Accelerated Computing Platforminside-BigData.com

HPE and NVIDIA empowering AI and IoTRenee Yao

Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs Indrajit Poddar

GTC Taiwan 2017 企業端深度學習與人工智慧應用NVIDIA Taiwan

Deep Learning Update May 2016Frédéric Parienté

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIASri Ambati

GPU Cloud Server in IndiaCloudtechtiq

GTC 2016 Opening KeynoteNVIDIA

Accelerated Computing: The Path ForwardNVIDIA

Ac922 watson 180208 v1IBM Sverige

Combine containerization and GPU acceleration on VMware: Dell PowerEdge R750 ...Principled Technologies

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化NVIDIA Taiwan

SQREAM DB on IBM Power9Ganesan Narayanasamy

Harnessing the virtual realm for successful real world artificial intelligenceAlison B. Lowndes

How to Run TensorFlow Cheaper in the Cloud Using Elastic GPUsAltoros

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...Kohei KaiGai

Backend.AI Technical Introduction (19.09 / 2019 Autumn)Lablup Inc.

Advances in Accelerator-based CFD SimulationAnsys

Similar to NVIDIA Volta GPU Smashes AI Records (20)

TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...

Dell and NVIDIA for Your AI workloads in the Data Center

Tesla Accelerated Computing Platform

HPE and NVIDIA empowering AI and IoT

Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs

GTC Taiwan 2017 企業端深度學習與人工智慧應用

Deep Learning Update May 2016

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

GPU Cloud Server in India

GTC 2016 Opening Keynote

Accelerated Computing: The Path Forward

Ac922 watson 180208 v1

Combine containerization and GPU acceleration on VMware: Dell PowerEdge R750 ...

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化

SQREAM DB on IBM Power9

Harnessing the virtual realm for successful real world artificial intelligence

How to Run TensorFlow Cheaper in the Cloud Using Elastic GPUs

GPU/SSD Accelerates PostgreSQL - challenge towards query processing throughpu...

Backend.AI Technical Introduction (19.09 / 2019 Autumn)

Advances in Accelerator-based CFD Simulation

Recently uploaded

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

How to convert PDF to text with Nanonetsnaman860154

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech

Artificial Intelligence: Facts and MythsJoaquim Jorge

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Powerful Google developer tools for immediate impact! (2023-24 C)

Presentation on how to chat with PDF using ChatGPT code interpreter

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

How to convert PDF to text with Nanonets

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Advantages of Hiring UIUX Design Service Providers for Your Business

Artificial Intelligence: Facts and Myths

A Domino Admins Adventures (Engage 2024)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

2024: Domino Containers - The Next Step. News from the Domino Container commu...

08448380779 Call Girls In Civil Lines Women Seeking Men

The 7 Things I Know About Cyber Security After 25 Years | April 2024

08448380779 Call Girls In Friends Colony Women Seeking Men

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

CNv6 Instructor Chapter 6 Quality of Service

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

NVIDIA Volta GPU Smashes AI Records

1. SHATTERING AI PERFORMANCE RECORDS NVIDIA Volta Tensor Core GPU Achieves New AI Performance Milestones

2. GPU-POWERED DEEP LEARNING IS TRANSFORMING EVERY INDUSTRY, SOLVING CHALLENGES ONCE THOUGHT IMPOSSIBLE…

3. THE IDEAL AI COMPUTING PLATFORM NEEDS TO PROVIDE IMPROVED PERFORMANCE, SCALABILITY AND PROGRAMMABILITY TO ADDRESS THE DIVERSITY OF MODEL ARCHITECTURES.

4. NVIDIA’S VOLTA TENSOR CORE GPU ACHIEVED RECORD-SHATTERING RESNET-50 PERFORMANCE FOR A SINGLE CHIP, SINGLE NODE, AND SINGLE CLOUD INSTANCE.

5. FASTEST SINGLE CHIP A single V100 Tensor Core GPU achieves 1,075 images/second when training ResNet-50, a 4X performance increase compared to the previous generation Pascal GPU. “New figures from NVIDIA illustrate the contribution hardware improvements can make to progress in machine learning: the AlexNet model that won ImageNet in 2012 took six days to train, can now be done in 18 minutes — a 500x speedup.” - Tom Simonite, WIRED

6. FASTEST SINGLE NODE A single DGX-1 server powered by eight Tensor Core V100s achieves 7,850 images/second, almost 2X the 4,200 images/second from a year ago on the same system. “I feel like it’s important to note that these performance improvements [by NVIDIA] are more important than they immediately appear, because while these gains dramatically impact today’s workloads, they’re effectively preempting even more complex workloads of the future.” - Rob Williams, TechGage

7. FASTEST SINGLE CLOUD INSTANCE A single AWS P3 cloud instance powered by eight Tensor Core V100 GPUs can train ResNet-50 in less than three hours, 3X faster than a TPU instance. “4 #TPU chips in a ‘Cloud TPU’ deliver 180 teraFLOPS of performance; by comparison, four V100 chips deliver 500 teraFLOPS. #NVIDIAwins.” - Karl Freund, Moor Insights

8. NVIDIA TENSOR CORE GPU ARCHITECTURE ALLOWS US TO SIMULTANEOUSLY PROVIDE GREATER PERFORMANCE THAN SINGLE-FUNCTION ASICS, YET BE PROGRAMMABLE FOR DIVERSE WORKLOADS.

9. EACH TESLA V100 TENSOR CORE GPU DELIVERS 125 TERAFLOPS OF PERFORMANCE FOR DEEP LEARNING COMPARED TO 45 TERAFLOPS BY A GOOGLE TPU CHIP. 4 TPU CHIPS IN A ‘CLOUD TPU’ V2 DELIVER 180 TERAFLOPS OF PERFORMANCE. BY COMPARISON, 4 NVIDIA V100 CHIPS DELIVER 500 TERAFLOPS OF PERFORMANCE.

10. EXPLORE THE PERFORMANCE IMPROVEMENTS HERE

NVIDIA Volta GPU Smashes AI Records

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to NVIDIA Volta GPU Smashes AI Records

Similar to NVIDIA Volta GPU Smashes AI Records (20)

More from NVIDIA

More from NVIDIA (20)

Recently uploaded

Recently uploaded (20)

NVIDIA Volta GPU Smashes AI Records