Graph analysis and novel architectures

•

0 likes•102 views

Jason Riedy

Presented at CERFACS Sparse Days, 24 Nov 2020

Data & Analytics

Graph Analysis and
Novel Architectures
Jason Riedy (all opinions my own, no plans)
Lucata Corporation / Emu Technology
Sparse Days, 24 November 2020

Monument aux Combattants de la Haute-Garonne

Graph Analysis v. Hardware Architecture
“We” want:
● Fine-grained memory access,
● fine-grained synchronization,
● sane floating-point (to be defined someday), and
● everything else that drives HW people nuts.
WHY NOT?

Graph Analysis v. Hardware Architecture
“It’s too hard.” Need wide memories, big cache lines, etc.
Nope.
Jeffrey Young, Eric Hein, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, and Thomas M. Conte. A Microbenchmark Characterization of the Emu Chick. Parallel Computing, September 2019. DOI 10.1016/j.parco.2019.04.012.

How? Being specific.
The Lucata / Emu architecture focuses on fine-grained memory access.
This really exists. And is PGAS. Because... ● No cache.
● The OS is handled by the “boring” part.
● Physically distributed memory.
● Many threads to tolerate…
● LOCAL LATENCIES.
○ Read remotely? MIGRATE.
○ Small context, one flit.
○ Plenty of references.
● Oh, and by the way…
○ Narrow channel DRAM: No wasting
cache lines (so not using ⅛ BW).
○ Memory-side processing.
○ Including floating-point accumulation.

Not the only idea out there.
● Metastrider
● Maybe embed sparse
gathers in memory
(CAMS)...
● 5.3x energy savings
● 11% performance boost
Sriseshan Srikanth, Anirudh Jain, Joseph M. Lennon, Thomas M. Conte, Erik Debenedictis, and Jeanine Cook. 2019. MetaStrider: Architectures for Scalable Memory-centric Reduction of Sparse Data Streams. ACM Trans. Archit. Code Optim. 16, 4, Article 35 (Janua
2020), 26 pages. DOI:https://doi.org/10.1145/3355396

Totally nuts ideas………...
What if……
● You could have a hardware dataflow architecture?
●
Borrowed from Cerebras Systems, Inc.

Totally nuts ideas………...
What if……
● You could have a hardware dataflow architecture?
● You could have “infinite” storage with logic?
●
A Rogues Gallery photo!

Totally nuts ideas………...
What if……
● You could have a hardware dataflow architecture?
● You could have “infinite” storage with logic?
● You could have programmable analog devices?
○ Neuromorphic? Waiting on the recount.
A Rogues Gallery photo!

The crazy thing is that all these exist.
So how are we taking advantage?
I apologize to the non-US folks. I only know our labs with testbeds:
● DoE: ORNL, LBNL, ANL, SNL (Sandia, not Saturday Night), …
● NSF: Georgia Tech’s Rogues Gallery, others…
● A64fx came from Japan / England.
● My preference baseline: RISC-V
○ (because you can bolt anything alongside)
No, really, go out and play!
Those ideas from the 80s and
before? YUP!
BTW, there are open foundries now…
No reason why algorithms folks should be quiet.
My photos are thanks to the Franco-Berkeley Fund.

Similar to Graph analysis and novel architectures

AMW43 - Unba.se, Distributed database for human interactionDaniel Norman

Advanced Administration, Monitoring and BackupMongoDB

OSDC 2018 | The Computer science behind a modern distributed data store by Ma...NETWAYS

Talk at Bioinformatics Open Source Conference, 2012c.titus.brown

CT Brown - Doing next-gen sequencing analysis in the cloudJan Aerts

The computer science behind a modern disributed data storeJ On The Beach

ASA Trial Workshop Slides for Archives NZ [2016-09-28]Ross Spencer

Codebits Handivicfpinto

Why Is Concurrent Programming Hard? And What Can We Do about It?Stefan Marr

CRNCH Rogues Gallery: A Community Core for Novel Computing PlatformsJason Riedy

CRNCH 2018 Summit: Rogues Gallery UpdateJason Riedy

Java Tools and Techniques for Solving Tricky ProblemWill Iverson

Flash Memory OSC.U

Infrastructure as code might be literally impossible part 2ice799

1. The Game Of The CenturyAlexandre Linhares

Ex chapter7 questionsaida nabila muhamad sopiee

Massively Parallel ArchitecturesJason Hearne-McGuiness

GPU Introduction.pptxSherazMunawar5

Secondary StorageMd. Bellal Hossain Raju

Memory and storage Tapan Khilar

Similar to Graph analysis and novel architectures (20)

AMW43 - Unba.se, Distributed database for human interaction

Advanced Administration, Monitoring and Backup

OSDC 2018 | The Computer science behind a modern distributed data store by Ma...

Talk at Bioinformatics Open Source Conference, 2012

CT Brown - Doing next-gen sequencing analysis in the cloud

The computer science behind a modern disributed data store

ASA Trial Workshop Slides for Archives NZ [2016-09-28]

Codebits Handivi

Why Is Concurrent Programming Hard? And What Can We Do about It?

CRNCH Rogues Gallery: A Community Core for Novel Computing Platforms

CRNCH 2018 Summit: Rogues Gallery Update

Java Tools and Techniques for Solving Tricky Problem

Flash Memory OS

Infrastructure as code might be literally impossible part 2

1. The Game Of The Century

Ex chapter7 questions

Massively Parallel Architectures

GPU Introduction.pptx

Secondary Storage

Memory and storage

Recently uploaded

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter

Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ

毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss

Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson

GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch

Multiple time frame trading analysis -brianshannon.pdfchwongval

detection and classification of knee osteoarthritis.pptxAleenaJamil4

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics

Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics

Learn How Data Science Changes Our WorldEduminds Learning

How we prevented account sharing with MFAAndrei Kaleshka

Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research

Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03

Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2

Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics

modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics

Recently uploaded (20)

Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...

Advanced Machine Learning for Business Professionals

毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree

Defining Constituents, Data Vizzes and Telling a Data Story

GA4 Without Cookies [Measure Camp AMS]

Multiple time frame trading analysis -brianshannon.pdf

detection and classification of knee osteoarthritis.pptx

Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...

Predicting Salary Using Data Science: A Comprehensive Analysis.pdf

Learn How Data Science Changes Our World

How we prevented account sharing with MFA

Student Profile Sample report on improving academic performance by uniting gr...

Biometric Authentication: The Evolution, Applications, Benefits and Challenge...

Top 5 Best Data Analytics Courses In Queens

Identifying Appropriate Test Statistics Involving Population Mean

Heart Disease Classification Report: A Data Analysis Project

modul pembelajaran robotic Workshop _ by Slidesgo.pptx

Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh

Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...

NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...

Graph analysis and novel architectures

1. Graph Analysis and Novel Architectures Jason Riedy (all opinions my own, no plans) Lucata Corporation / Emu Technology Sparse Days, 24 November 2020

2. Monument aux Combattants de la Haute-Garonne

3. Graph Analysis v. Hardware Architecture “We” want: ● Fine-grained memory access, ● fine-grained synchronization, ● sane floating-point (to be defined someday), and ● everything else that drives HW people nuts. WHY NOT?

4. Graph Analysis v. Hardware Architecture “It’s too hard.” Need wide memories, big cache lines, etc. Nope. Jeffrey Young, Eric Hein, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, and Thomas M. Conte. A Microbenchmark Characterization of the Emu Chick. Parallel Computing, September 2019. DOI 10.1016/j.parco.2019.04.012.

5. Graph Analysis v. Hardware Architecture “It’s too hard.” Need wide memories, big cache lines, etc. Nope. Jeffrey Young, Eric Hein, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, and Thomas M. Conte. A Microbenchmark Characterization of the Emu Chick. Parallel Computing, September 2019. DOI 10.1016/j.parco.2019.04.012.

6. Graph Analysis v. Hardware Architecture “It’s too hard.” Need wide memories, big cache lines, etc. Nope. Jeffrey Young, Eric Hein, Srinivas Eswar, Patrick Lavin, Jiajia Li, Jason Riedy, Richard Vuduc, and Thomas M. Conte. A Microbenchmark Characterization of the Emu Chick. Parallel Computing, September 2019. DOI 10.1016/j.parco.2019.04.012.

7. How? Being specific. The Lucata / Emu architecture focuses on fine-grained memory access. This really exists. And is PGAS. Because... ● No cache. ● The OS is handled by the “boring” part. ● Physically distributed memory. ● Many threads to tolerate… ● LOCAL LATENCIES. ○ Read remotely? MIGRATE. ○ Small context, one flit. ○ Plenty of references. ● Oh, and by the way… ○ Narrow channel DRAM: No wasting cache lines (so not using ⅛ BW). ○ Memory-side processing. ○ Including floating-point accumulation.

8. How? Being specific. The Lucata / Emu architecture focuses on fine-grained memory access. This really exists. And is PGAS. Because... ● No cache. ● The OS is handled by the “boring” part. ● Physically distributed memory. ● Many threads to tolerate… ● LOCAL LATENCIES. ○ Read remotely? MIGRATE. ○ Small context, one flit. ○ Plenty of references. ● Oh, and by the way… ○ Narrow channel DRAM: No wasting cache lines (so not using ⅛ BW). ○ Memory-side processing. ○ Including floating-point accumulation.

9. Not the only idea out there. ● Metastrider ● Maybe embed sparse gathers in memory (CAMS)... ● 5.3x energy savings ● 11% performance boost Sriseshan Srikanth, Anirudh Jain, Joseph M. Lennon, Thomas M. Conte, Erik Debenedictis, and Jeanine Cook. 2019. MetaStrider: Architectures for Scalable Memory-centric Reduction of Sparse Data Streams. ACM Trans. Archit. Code Optim. 16, 4, Article 35 (Janua 2020), 26 pages. DOI:https://doi.org/10.1145/3355396

10. Totally nuts ideas………... What if…… ● You could have a hardware dataflow architecture? ● Borrowed from Cerebras Systems, Inc.

11. Totally nuts ideas………... What if…… ● You could have a hardware dataflow architecture? ● You could have “infinite” storage with logic? ● A Rogues Gallery photo!

12. Totally nuts ideas………... What if…… ● You could have a hardware dataflow architecture? ● You could have “infinite” storage with logic? ● You could have programmable analog devices? ○ Neuromorphic? Waiting on the recount. A Rogues Gallery photo!

13. The crazy thing is that all these exist. So how are we taking advantage? I apologize to the non-US folks. I only know our labs with testbeds: ● DoE: ORNL, LBNL, ANL, SNL (Sandia, not Saturday Night), … ● NSF: Georgia Tech’s Rogues Gallery, others… ● A64fx came from Japan / England. ● My preference baseline: RISC-V ○ (because you can bolt anything alongside) No, really, go out and play! Those ideas from the 80s and before? YUP! BTW, there are open foundries now… No reason why algorithms folks should be quiet. My photos are thanks to the Franco-Berkeley Fund.

Graph analysis and novel architectures

Recommended

Recommended

More Related Content

Similar to Graph analysis and novel architectures

Similar to Graph analysis and novel architectures (20)

More from Jason Riedy

More from Jason Riedy (20)

Recently uploaded

Recently uploaded (20)

Graph analysis and novel architectures