MapMapReduce: Managing Big Geospatial Data

•Download as PPTX, PDF•

0 likes•52 views

TomTom is handling large volumes of geospatial data. The shape of this data poses some unique challenges but also some opportunities to exploit when it comes to distributed processing. In this talk we shed some light on the data processing pipeline we have built and do a deep dive into geo spatial indexing on top of HBase.

Engineering

Whyarewehere?
Stable data model for the customer
Physical format

Thebiggerpicture
Write-optimized data store
Manual map edits
Automatic map edits
Read-optimized data store
Physical output
Cascades & quality checks
~ 5.000 changes / second
~ 1.5 billion changes / month

BigdatainTomTom:genericfeatures
feature
id
type
geometry
attribute
id
type
value
association
id
type
source
target

BigdatainTomTom:genericfeatures
point feature
line feature
area feature

StoringfeaturesintoHBase
feature table
row key
feature id
family qualifier value
ent
type
geometry
entity
geometry
type
type
attributes
associations
att
assoc

StoringfeaturesintoHBase
row key
883026d5-24e8-4f18-
bddf-76b5bb8ec4b4

StoringfeaturesintoHBase
0
0,00
1
1
0,1
1,0 1,1
2 3
2
3
2,3 2,1
3,0 3,1
0,2 0,3
1,2 1,3
2,2 2,3
3,2 3,3
dec

StoringfeaturesintoHBase
00
000000
01
01
0001
0010 0011
10 11
10
11
1000 1001
1010 1011
0100 0101
0110 0111
1100 1101
1110 1111
bin

StoringfeaturesintoHBase
0
00
1
1
1
2 3
2 3
2
3
8 9
10 11
4 5
6 7
12 13
14 15
dec

StoringfeaturesintoHBase
0
000
1
1
01
02 03
2 3
2
3
20 21
22 23
10 11
12 13
30 31
32 33
base4

StoringfeaturesintoHBase
268.435.456 L14

StoringfeaturesintoHBase
feature table
row key
morton
(base4)
+ feature id
+ feature
type
family qualifier value
ent
type
geometry
entity
geometry
type
type
attributes
associations
att
assoc

StoringfeaturesintoHBase
row key
32002010301212
883026d5-24e8-4f18-
bddf-76b5bb8ec4b4
Company

Creatingaproductfromgeo-spatialdata
Read-optimized data store Physical output
Stable data model for the customer
Physical format like shapefile or
Avro-files to load into an RDBMS
Fast-moving object model

Creatingaproduct:technologystack
Current focus: batch, but we also target an incremental approach.
We know it, we use it, we trust it.
Popular, well-supported by Beam and Amazon EMR.

Write-optimized data store
Manual map edits
Automatic map edits
Read-optimized data store
Physical Output
Cascades & quality checks
~ 5.000 changes / second
~ 1.5 billion changes / month
Thecompletepicture

Similar to MapMapReduce: Managing Big Geospatial Data

Performance management capabilitydesigner DATA

Realtime Analytics on AWSSungmin Kim

Spreadsheet ml subject pivottableShawn Villaron

Everything You Need to Know About Oracle 12c IndexesSolarWinds

[WSO2Con Asia 2018] Patterns for Building Streaming AppsWSO2

Pim retail industry case studyDeepak Gupta

A head start on cloud native event driven applications - bigdatadaysSriskandarajah Suhothayan

Complex realtime event analytics using BigQuery @Crunch WarmupMárton Kodok

Big Data Analytics with MariaDB ColumnStoreMariaDB plc

Cubes 1.0 OverviewStefan Urbanek

Big objects in Salesforce TechnologyDivya Agrawal

Fast-BDS-Product ReviewKien Hong Tje

Automated Document Indexing with ImageRampDocuFi, offering HAI and Infection Prevention Analytics

Webinar: Schema Patterns and Your Storage EngineMongoDB

iData Sciences Product Overviewjvsrinivas1

MaxTECH Technical Training - Maximo Custom Audit SolutionHelen Fisher

MongodB InternalsNorberto Leite

Loading Huge Amounts of DataVaticle

What's new in MariaDB AX webinarMariaDB plc

Metadata Mattersdclsocialmedia

Similar to MapMapReduce: Managing Big Geospatial Data (20)

Performance management capability

Realtime Analytics on AWS

Spreadsheet ml subject pivottable

Everything You Need to Know About Oracle 12c Indexes

[WSO2Con Asia 2018] Patterns for Building Streaming Apps

Pim retail industry case study

A head start on cloud native event driven applications - bigdatadays

Complex realtime event analytics using BigQuery @Crunch Warmup

Big Data Analytics with MariaDB ColumnStore

Cubes 1.0 Overview

Big objects in Salesforce Technology

Fast-BDS-Product Review

Automated Document Indexing with ImageRamp

Webinar: Schema Patterns and Your Storage Engine

iData Sciences Product Overview

MaxTECH Technical Training - Maximo Custom Audit Solution

MongodB Internals

Loading Huge Amounts of Data

What's new in MariaDB AX webinar

Metadata Matters

Recently uploaded

Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423

Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913

Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER

Input Output Management in Operating SystemRashmi Bhat

Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra

National Level Hackathon Participation Certificate.pdfRajuKanojiya4

complete construction, environmental and economics information of biomass com...asadnawaz62

Introduction-To-Agricultural-Surveillance-Rover.pptxk795866

Industrial Safety Unit-I SAFETY TERMINOLOGIESNarmatha D

Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis

young call girls in Green Park🔝 9953056974 🔝 escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Research Methodology for Engineering pdfCaalaaAbdulkerim

IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst

Indian Dairy Industry Present Status and.pptMadan Karki

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort

Past, Present and Future of Generative AIabhishek36461

Internet of things -Arshdeep Bahga .pptxVelmuruganTECE

Virtual memory management in Operating SystemRashmi Bhat

welding defects observed during the weldingMuhammadUzairLiaqat

US Department of Education FAFSA Week of ActionMebane Rash

Recently uploaded (20)

Vishratwadi & Ghorpadi Bridge Tender documents

Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg

Risk Assessment For Installation of Drainage Pipes.pdf

Input Output Management in Operating System

Mine Environment II Lab_MI10448MI__________.pptx

National Level Hackathon Participation Certificate.pdf

complete construction, environmental and economics information of biomass com...

Introduction-To-Agricultural-Surveillance-Rover.pptx

Industrial Safety Unit-I SAFETY TERMINOLOGIES

Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction

young call girls in Green Park🔝 9953056974 🔝 escort Service

Research Methodology for Engineering pdf

IVE Industry Focused Event - Defence Sector 2024

Indian Dairy Industry Present Status and.ppt

Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service

Past, Present and Future of Generative AI

Internet of things -Arshdeep Bahga .pptx

Virtual memory management in Operating System

welding defects observed during the welding

US Department of Education FAFSA Week of Action

MapMapReduce: Managing Big Geospatial Data

1. ManagingBigGeospatialData 15-11-2018

2. Whyarewehere? Stable data model for the customer Physical format

3. Thebiggerpicture Write-optimized data store Manual map edits Automatic map edits Read-optimized data store Physical output Cascades & quality checks ~ 5.000 changes / second ~ 1.5 billion changes / month

4. BigdatainTomTom:genericfeatures

5. BigdatainTomTom:genericfeatures feature id type geometry attribute id type value association id type source target

6. BigdatainTomTom:genericfeatures point feature line feature area feature

7. StoringfeaturesintoHBase

8. StoringfeaturesintoHBase feature table row key feature id family qualifier value ent type geometry entity geometry type type attributes associations att assoc

9. StoringfeaturesintoHBase row key 883026d5-24e8-4f18- bddf-76b5bb8ec4b4

10. StoringfeaturesintoHBase 0 0,00 1 1 0,1 1,0 1,1 2 3 2 3 2,3 2,1 3,0 3,1 0,2 0,3 1,2 1,3 2,2 2,3 3,2 3,3 dec

11. StoringfeaturesintoHBase 00 000000 01 01 0001 0010 0011 10 11 10 11 1000 1001 1010 1011 0100 0101 0110 0111 1100 1101 1110 1111 bin

12. StoringfeaturesintoHBase 0 00 1 1 1 2 3 2 3 2 3 8 9 10 11 4 5 6 7 12 13 14 15 dec

13. StoringfeaturesintoHBase 0 000 1 1 01 02 03 2 3 2 3 20 21 22 23 10 11 12 13 30 31 32 33 base4

14. StoringfeaturesintoHBase 268.435.456 L14

15. StoringfeaturesintoHBase feature table row key morton (base4) + feature id + feature type family qualifier value ent type geometry entity geometry type type attributes associations att assoc

16. StoringfeaturesintoHBase row key 32002010301212 883026d5-24e8-4f18- bddf-76b5bb8ec4b4 Company

17. ReadingfeaturesfromHBase

18. ReadingfeaturesfromHBase 16728 tiles

19. ReadingfeaturesfromHBase 4057 tiles

20. Creatingaproductfromgeo-spatialdata Read-optimized data store Physical output Stable data model for the customer Physical format like shapefile or Avro-files to load into an RDBMS Fast-moving object model

21. Creatingaproduct:technologystack Current focus: batch, but we also target an incremental approach. We know it, we use it, we trust it. Popular, well-supported by Beam and Amazon EMR.

22. Write-optimized data store Manual map edits Automatic map edits Read-optimized data store Physical Output Cascades & quality checks ~ 5.000 changes / second ~ 1.5 billion changes / month Thecompletepicture

23. THANKYOU

Editor's Notes

So how do we handle that multitude of concurrent writes?  PostgreSQL cluster for transactional integrity as well However, we also need to read lots of features to create our output. How do we do this?
So we can store the data in a read-optimized way. How can we read from it in a performant way?
So we can store the data in a read-optimized way. How can we read from it in a performant way?
So we can store the data in a read-optimized way. How can we read from it in a performant way?
We are now capable of reading efficiently from HBase. So how do we transform that data?

MapMapReduce: Managing Big Geospatial Data

Recommended

Recommended

More Related Content

Similar to MapMapReduce: Managing Big Geospatial Data

Similar to MapMapReduce: Managing Big Geospatial Data (20)

Recently uploaded

Recently uploaded (20)

MapMapReduce: Managing Big Geospatial Data

Editor's Notes