SlideShare a Scribd company logo
1 of 68
Efficient RDF Interchange (ERI) 
Format for RDF Data Streams 
Javier D. Fernández, Alejandro Llaves, Oscar Corcho 
Ontology Engineering Group (OEG) 
Universidad Politécnica de Madrid, Spain
Outline 
Index 
1. Introduction & Motivation 
2. Background 
3. Efficient RDF Interchange (ERI) Format 
i. Basic Concepts 
ii. ERI Streams 
iii. Practical Deployment 
4. Evaluation 
5. Conclusions and Next steps 
2
INTRODUCTION - Static data versus RDF data streams 
3
INTRODUCTION - Static data versus RDF data streams 
3 
Files 
Extract Transform Load 
DBMS 
Spatial Information 
Web APIs 
Linked Data discovery
INTRODUCTION - Static data versus RDF data streams 
3 
Files 
Extract Transform Load 
DBMS 
Spatial Information 
Web APIs 
Linked Data discovery
INTRODUCTION - Static data versus RDF data streams 
3 
Files 
Extract Transform Load 
DBMS 
Spatial Information 
Web APIs 
Linked Data discovery 
“Most semantic tools are focused on 
this static view”
INTRODUCTION - Static data versus RDF data streams 
RDF Data Streams are gaining momentum, generated from any type 
of data stream, and combining real-time and historical data. 
©Wilgengebroed on Flickr, Mr3641, ProtoplasmaKid and ISA Internationales Stadtbauatelier in commons wikimedia 
3
INTRODUCTION - Static data versus RDF data streams 
3
INTRODUCTION - Static data versus RDF data streams 
3
INTRODUCTION - Static data versus RDF data streams 
3
INTRODUCTION - Static data versus RDF data streams 
3
INTRODUCTION - Static data versus RDF data streams 
RDF streams: potentially unbounded sequences of timestamped 
RDF statements or graphs. 
3
INTRODUCTION - Static data versus RDF data streams 
RDF streams: potentially unbounded sequences of timestamped 
RDF statements or graphs. 
3 
user1_observation [t1] 
weather1_observation [t1] 
user2_observation 
[t3] 
…
INTRODUCTION - Static data versus RDF data streams 
RDF streams: potentially unbounded sequences of timestamped 
RDF statements or graphs. 
3 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
user1_observation [t1] 
weather1_observation [t1] 
user2_observation 
[t3] 
…
INTRODUCTION - Motivation 
Achieve efficient transmission of RDF streams, a necessary step to 
ensure higher throughput for RDF Stream processors 
3 
Stream source 
Stream source 
Stream source 
Stream source 
Stream 
Processor 
Engine 
Historic 
Information 
C-SPARQL, 
SPARQLStream 
morph-streams 
CQELS Cloud 
Ztreamy 
… 
Stream source 
queries 
Continuous results
INTRODUCTION – Motivation - Requirements 
16 
Efficient transmission of RDF streams: 
• Streamable 
• Scalable 
• Easy (fast) to process (create and parse) 
• Compact 
• Parametrizable (several tradeoffs compression/time)
BACKGROUND 
17 
Plain: 
Turtle/ 
Trig/ 
JSON-LD 
Plain 
+Compression 
(e.g. gzip) HDT 
Streaming 
HDT RDSZ 
RDF/XML 
+ EXI ERI 
Streamable Yes Yes No Yes Yes Yes Yes 
Scalable Limited Yes Yes No Yes Yes Yes 
Easy (fast) to 
Yes Limited Limited Yes Limited Limited Yes 
create and parse 
Compact No Yes Yes Limited Yes Yes Yes 
Parametrizable: 
No Limited Yes No Limited Limited Yes 
compression/time
Outline 
Index 
1. Introduction & Motivation 
2. Background 
3. Efficient RDF Interchange (ERI) Format 
i. Basic Concepts 
ii. ERI Streams 
iii. Practical Deployment 
4. Evaluation 
5. Conclusions and Next steps 
18
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
19 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
20 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited 
• Efficient RDF Interchange (ERI) Format encodes the information at 
two levels:
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
21 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited 
• Efficient RDF Interchange (ERI) Format encodes the information at 
two levels: 
• A sliding dictionary of structures: Structural Dictionary
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
22 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited 
• Efficient RDF Interchange (ERI) Format encodes the information at 
two levels: 
• A sliding dictionary of structures: Structural Dictionary 
• The concrete value for each predicate
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
23 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited 
• Efficient RDF Interchange (ERI) Format encodes the information at 
two levels: 
• A sliding dictionary of structures: Structural Dictionary 
• The concrete value for each predicate
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
24 
• (Assumption) Most RDF streams are well structured 
structure 
• the is well-known by the data provider 
• the number of variations 
in the structure are limited 
• Efficient RDF Interchange (ERI) Format encodes the information at 
two levels: 
• A sliding dictionary of structures: Structural Dictionary 
• The concrete value for each predicate
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
25 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
…
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
26 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
27 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
28 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
29 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
30 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
31 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
32 
ID-31 ID-32 
Structural 
Dictionary 
“7.7”^^xsd:float “9.4”^^xsd:float 
t 
w1 w2 w3 
u1 u2 u3 u4 
Stream 
temper 
ature 
Casual 
user 
Anual 
pass 
wind 
ID-30 
ID-33 
… 
weather: 
TemperatureObservation 
rdf:type 
weather: 
AirTemperature 
ssn:observedProperty 
??? 
ex:CelsiusValue 
… … 
… 
molecule
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
33 
• ERI processing model 
• Minimal Information Unit is a molecule: 
• We initially restrict to subject molecules
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 6:55:00”, “Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. 
ex:CelsiusValue “7.7”^^xsd:float 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . 
ex:CelsiusValue “9.4”^^xsd:float 
34 
Subject 
Molecule 
… 
Suubject 
Molecule 
…
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 6:55:00”, “Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. 
ex:CelsiusValue “7.7”^^xsd:float 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . 
ex:CelsiusValue “9.4”^^xsd:float 
35 
Subject 
Molecule 
….. 
Structure ID30= 
a (1, weather:TemperatureObservation) 
rdfs:label (2) 
om-wl:observedProperty (1, weather:_AirTemperature ) 
om-owl:procedure (1,sens-obs:System_4UT01) 
om-owl:result (1) 
om-owl:samplingTime (1) 
ex:CelsiusValue (1) 
….. 
Structural Dictionary 
… 
Suubject 
Molecule 
…
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
Air Temperature Observations of 
the Sensor “System_4UT01” 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 6:55:00”, “Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. 
ex:CelsiusValue “7.7”^^xsd:float 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . 
ex:CelsiusValue “9.4”^^xsd:float 
36 
Subject 
Molecule 
….. 
Structure ID30= 
a (1, weather:TemperatureObservation) 
rdfs:label (2) 
om-wl:observedProperty (1, weather:_AirTemperature ) 
om-owl:procedure (1,sens-obs:System_4UT01) 
om-owl:result (1) 
om-owl:samplingTime (1) 
ex:CelsiusValue (1) 
….. 
Structural Dictionary 
… 
Suubject 
Molecule 
…
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 
Air Temperature Observations of 
the Sensor “System_4UT01” 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 6:55:00”, “Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. 
ex:CelsiusValue “7.7”^^xsd:float 
sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 
a weather:TemperatureObservation ; 
rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; 
om-owl:observedProperty weather:_AirTemperature ; 
om-owl:procedure sens-obs:System_4UT01 ; 
om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; 
om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . 
ex:CelsiusValue “9.4”^^xsd:float 
37 
Subject 
Molecule 
….. 
Structure ID30= 
a (1, weather:TemperatureObservation) 
rdfs:label (2) 
om-wl:observedProperty (1, weather:_AirTemperature ) 
om-owl:procedure (1,sens-obs:System_4UT01) 
om-owl:result (1) 
om-owl:samplingTime (1) 
ex:CelsiusValue (1) 
….. 
Structural Dictionary 
… 
Suubject 
Molecule 
…
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – ERI Streams 
Based on: Efficient XML Interchange (EXI) format 
38 
Block 
Mole 
cule 
Mole 
cule 
Mole 
cule 
… 
Block 
Mole 
cule 
Mole 
cule 
Mole 
cule 
… 
Block 
Mole 
cule 
Mole 
cule 
Mole 
cule 
… … 
Multiplex / Demultiplex 
Compression/Decompression 
(per channel) 
Stream 
Header 
Stream Body 
META 
DATA 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
META 
DATA 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
META 
DATA 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
COMP 
CHAN. 
Channels 
Structural Channels 
Value Channels 
… 
ERI stream
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – ERI Streams 
39 
ERI follows an encoding procedure similar to that of the Efficient 
XML Interchange (EXI) format. 
Structural channels: They encode the subjects in each block 
and, for each one, the structural properties of the related 
triples, using the dynamic dictionary of structures. 
• Main Terms of molecules: subject of the grouping. 
• ID-Structures: ID of the structure of each molecule in the block. The ID 
points to the entry in the Structural Dictionary. 
• New Structures: New entries in the Structural Dictionary. 
– Value channels: They encode the concrete data values held by 
each predicate in the block in a compact fashion. 
• One channel per different predicate in the block. 
• Lists explicit values or use IDs pointing to a sliding object dictionary 
variations structure
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
40 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
41 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
42 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
43 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
44 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
45 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values 
IDs pointing to a sliding 
object dictionary
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
46 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values 
IDs pointing to a sliding 
object dictionary
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
47 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values 
IDs pointing to a sliding 
object dictionary 
Extraction of types
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
48 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values 
IDs pointing to a sliding 
object dictionary 
Extraction of types
EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 
49 
ID-Structures 
New 
Structure 
Marker 
… 
sens-obs:MeasureData_Air…55_00 
sens-obs:Instant_2003…55_00 
sens-obs:MeasureData_Air…45_00 
sens-obs:Instant_2003…55_00 
… 
… 
30 
30 
… 
ID-pred1 weather: TemperatureObservation 
ID-pred2 
ID-pred3 weather:_AirTemperature 
ID-pred4 sensobs: System_4UT01 
ID-pred5 
ID-pred6 
ID-pred7 
[IDs of 
Structures] 
… 
om-owl:samplingTime 
ex:CelsiusValue 
… 
[Encoded Structures] [Strings] 
Structural 
Channels 
…. 
sens-obs:Observation_AirTemperature...55_00 
sens-obs:Observation_AirTemperature...45_00 
…. 
ID-pred2 
… 
Air temperature at 6:55:00 
Verified 
Air temperature at 7:45:00 
Not Verified 
… 
[Object Values] 
[Meta: strings] 
ID-pred5 
[Term IDs] 
[Meta: IDs] 
New Terms 
[Strings] 
… 
101 
245 
… 
ID-pred6 
1 
2 
… 
[Term IDs] 
[Meta: IDs] 
Potential 
Compression 
Differential 
… 
Prefix compression 
Zlib 
Snappy 
… 
Main Terms of Molecules 
[Strings] 
Prefix compression 
Zlib 
Snappy 
… 
Prefix compression 
Zlib 
Snappy 
… 
Zlib 
Snappy 
… 
Differential 
… 
Differential 
… 
… 
1 
0 
… 
[Bits] 
New Structures 
New Predicates 
Zlib 
Snappy 
… 
New Object 
Marker 
ID-pred5 
… 
0 
1 
… 
[Bits] 
New Object 
Marker 
ID-pred6 
1 
1 
… 
[Bits] 
1 
2 
1 
1 
1 
1 
1 
ID-pred7 
… 
7.7 
9.4 
…. 
[Object Values] 
[Meta: xsd:float] 
Differential 
… 
Value 
Channels 
Potential 
Compression 
Explicit list of values 
IDs pointing to a sliding 
object dictionary 
Extraction of types
Outline 
Index 
1. Introduction & Motivation 
2. Background 
3. Efficient RDF Interchange (ERI) Format 
i. Basic Concepts 
ii. ERI Streams 
iii. Practical Deployment 
4. Evaluation 
5. Conclusions and Next steps 
50
EVALUATION - COMPRESSION 
51
EVALUATION - COMPRESSION 
52 
ERI excels in space for streaming and statistical dataset
EVALUATION - COMPRESSION 
53 
ERI excels in space for streaming and statistical dataset 
RDSZ remains comparable to our approach
EVALUATION - COMPRESSION 
54 
ERI excels in space for streaming and statistical dataset 
RDSZ remains comparable to our approach 
The object dictionary can overload the representation, although it always 
obtains comparable compression ratios.
EVALUATION - COMPRESSION 
55
EVALUATION - COMPRESSION 
56 
A smaller buffer in ERI-1k slightly affects the efficiency
EVALUATION - PARSING 
57
EVALUATION - PARSING 
58 
ERI always outperforms the RDSZ compression time (3 and 3.8 times on 
average for ERI-4k and ERI-4k-Nodict, respectively)
EVALUATION - PARSING 
59 
ERI always outperforms the RDSZ compression time (3 and 3.8 times on 
average for ERI-4k and ERI-4k-Nodict, respectively) 
ERI decompression is commonly slower (1.4 times on average in both ERI 
configurations), typically due to decompressing several channels.
EVALUATION - PARSING 
60 
ERI always outperforms the RDSZ compression time (3 and 3.8 times on 
average for ERI-4k and ERI-4k-Nodict, respectively) 
ERI decompression is commonly slower (1.4 times on average in both ERI 
configurations), typically due to decompressing several channels. 
Channels could be grouped 
(as in EXI)
EVALUATION – CONSUMING SCENARIO 
61 
In parsing: transmission + decompression
EVALUATION – CONSUMING SCENARIO 
ERI-4k and ERI-4k-Nodict outperform the baseline in transmission + decompression 
except for those datasets with less regularities in the structure or the data values, 
62 
In parsing: transmission + decompression
EVALUATION – CONSUMING SCENARIO 
63 
In a scenario in which we include the compression time
EVALUATION – CONSUMING SCENARIO 
64 
In a scenario in which we include the compression time 
ERI-4k suffers an expected overhead as we are always including the time 
to process the information
EVALUATION – CONSUMING SCENARIO 
65 
In a scenario in which we include the compression time 
ERI-4k suffers an expected overhead as we are always including the time 
to process the information 
The time in which the client receives all data in ERI is comparable to the baseline
Results 
66 
• Compressed, efficient RDF interchange (ERI) format 
• exploit the RDF data stream regularity of their structure and 
data values 
• Flexible and extensible ERI configurations 
• Minimize transmission costs in RDF stream processing 
• State-of-the-art compression 
• Remains efficient in performance 
• Time overheads are relatively low and can be assumed in 
many scenarios.
Next steps 
67 
• Integration within RDF streaming Engines 
• e.g. morph-streams, CQELS Cloud 
• 3 purposes: 
• scaling to higher input data rates 
• minimizing the data exchange among processing nodes 
• serving a small set of operators on the compressed data 
• Parallel compression/decompression 
• preliminary proposal on Storm 
• Align the proposal with the results of W3C RSP group 
regarding streaming modeling and serialization
Efficient RDF Interchange (ERI) 
Format for RDF Data Streams 
Javier D. Fernández, Alejandro Llaves, Oscar Corcho 
Ontology Engineering Group (OEG), Universidad Politécnica de Madrid, Spain 
purl.org/net/ro-eri-ISWC14 
Electronic edition: 
Research object:

More Related Content

What's hot

Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFOpenLink Software
 
Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding FormJakob .
 
Introduction To RDF and RDFS
Introduction To RDF and RDFSIntroduction To RDF and RDFS
Introduction To RDF and RDFSNilesh Wagmare
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolLaura Po
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013Juan Sequeda
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogsandrea huang
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data VisualizationLaura Po
 
Applications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and ClassificationApplications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and Classificationshakimov
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataOlaf Hartig
 
DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." Avalon Media System
 
Verifying Integrity Constraints of a RDF-based WordNet
Verifying Integrity Constraints of a RDF-based WordNetVerifying Integrity Constraints of a RDF-based WordNet
Verifying Integrity Constraints of a RDF-based WordNetAlexandre Rademaker
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesOntotext
 

What's hot (16)

NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
 
Virtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDFVirtuoso -- The Prometheus of RDF
Virtuoso -- The Prometheus of RDF
 
Another RDF Encoding Form
Another RDF Encoding FormAnother RDF Encoding Form
Another RDF Encoding Form
 
Introduction To RDF and RDFS
Introduction To RDF and RDFSIntroduction To RDF and RDFS
Introduction To RDF and RDFS
 
Wi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX toolWi2015 - Clustering of Linked Open Data - the LODeX tool
Wi2015 - Clustering of Linked Open Data - the LODeX tool
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
RDB2RDF Tutorial (R2RML and Direct Mapping) at ISWC 2013
 
20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs20160818 Semantics and Linkage of Archived Catalogs
20160818 Semantics and Linkage of Archived Catalogs
 
Linked Open Data Visualization
Linked Open Data VisualizationLinked Open Data Visualization
Linked Open Data Visualization
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
Applications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and ClassificationApplications of Word Vectors in Text Retrieval and Classification
Applications of Word Vectors in Text Retrieval and Classification
 
An Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of DataAn Introduction to RDF and the Web of Data
An Introduction to RDF and the Web of Data
 
DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World." DLF 2015 Presentation, "RDF in the Real World."
DLF 2015 Presentation, "RDF in the Real World."
 
Verifying Integrity Constraints of a RDF-based WordNet
Verifying Integrity Constraints of a RDF-based WordNetVerifying Integrity Constraints of a RDF-based WordNet
Verifying Integrity Constraints of a RDF-based WordNet
 
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven RecipesReasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
 
RDF data model
RDF data modelRDF data model
RDF data model
 

Similar to Efficient RDF Interchange (ERI) Format for RDF Data Streams

RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
RDF2Rule PRESENTATION
RDF2Rule PRESENTATIONRDF2Rule PRESENTATION
RDF2Rule PRESENTATIONEfrah Shakir
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD CloudRuben Verborgh
 
Re-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutRe-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutMediaMixerCommunity
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic webMarakana Inc.
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webhorvadam
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...eswcsummerschool
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialMuhammad Saleem
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachhorvadam
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingPlanetData Network of Excellence
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...Oscar Corcho
 
List.MID: A MIDI-Based Benchmark for RDF Lists
List.MID: A MIDI-Based Benchmark for RDF ListsList.MID: A MIDI-Based Benchmark for RDF Lists
List.MID: A MIDI-Based Benchmark for RDF ListsAlbert Meroño-Peñuela
 
LDP-DL: A language to define the design of Linked Data Platforms
LDP-DL: A language to define the design of Linked Data PlatformsLDP-DL: A language to define the design of Linked Data Platforms
LDP-DL: A language to define the design of Linked Data PlatformsMohammad Noorani Bakerally
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streamskeski
 
Streaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream ReasoningStreaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream ReasoningRiccardo Tommasini
 
RDF Linked Data - Automatic Exchange of BIM Containers
RDF Linked Data - Automatic Exchange of BIM ContainersRDF Linked Data - Automatic Exchange of BIM Containers
RDF Linked Data - Automatic Exchange of BIM ContainersSafe Software
 
Efficient Query Answering against Dynamic RDF Databases
Efficient Query Answering against Dynamic RDF DatabasesEfficient Query Answering against Dynamic RDF Databases
Efficient Query Answering against Dynamic RDF DatabasesAlexandra Roatiș
 
Introducing RDA: June 2013
Introducing RDA: June 2013Introducing RDA: June 2013
Introducing RDA: June 2013ALATechSource
 

Similar to Efficient RDF Interchange (ERI) Format for RDF Data Streams (20)

RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
RDF2Rule PRESENTATION
RDF2Rule PRESENTATIONRDF2Rule PRESENTATION
RDF2Rule PRESENTATION
 
KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016KIT Graduiertenkolloquium 11.05.2016
KIT Graduiertenkolloquium 11.05.2016
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD Cloud
 
Re-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playoutRe-using Media on the Web: Media fragment re-mixing and playout
Re-using Media on the Web: Media fragment re-mixing and playout
 
A hands on overview of the semantic web
A hands on overview of the semantic webA hands on overview of the semantic web
A hands on overview of the semantic web
 
First steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic webFirst steps towards publishing library data on the semantic web
First steps towards publishing library data on the semantic web
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
 
RDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approachRDF and Open Linked Data, a first approach
RDF and Open Linked Data, a first approach
 
On the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream ProcessingOn the need for a W3C community group on RDF Stream Processing
On the need for a W3C community group on RDF Stream Processing
 
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
 
List.MID: A MIDI-Based Benchmark for RDF Lists
List.MID: A MIDI-Based Benchmark for RDF ListsList.MID: A MIDI-Based Benchmark for RDF Lists
List.MID: A MIDI-Based Benchmark for RDF Lists
 
LDP-DL: A language to define the design of Linked Data Platforms
LDP-DL: A language to define the design of Linked Data PlatformsLDP-DL: A language to define the design of Linked Data Platforms
LDP-DL: A language to define the design of Linked Data Platforms
 
RSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF StreamsRSP-QL*: Querying Data-Level Annotations in RDF Streams
RSP-QL*: Querying Data-Level Annotations in RDF Streams
 
Streaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream ReasoningStreaming Day - an overview of Stream Reasoning
Streaming Day - an overview of Stream Reasoning
 
RDF Linked Data - Automatic Exchange of BIM Containers
RDF Linked Data - Automatic Exchange of BIM ContainersRDF Linked Data - Automatic Exchange of BIM Containers
RDF Linked Data - Automatic Exchange of BIM Containers
 
Efficient Query Answering against Dynamic RDF Databases
Efficient Query Answering against Dynamic RDF DatabasesEfficient Query Answering against Dynamic RDF Databases
Efficient Query Answering against Dynamic RDF Databases
 
Introducing RDA: June 2013
Introducing RDA: June 2013Introducing RDA: June 2013
Introducing RDA: June 2013
 

Recently uploaded

Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 

Recently uploaded (20)

Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 

Efficient RDF Interchange (ERI) Format for RDF Data Streams

  • 1. Efficient RDF Interchange (ERI) Format for RDF Data Streams Javier D. Fernández, Alejandro Llaves, Oscar Corcho Ontology Engineering Group (OEG) Universidad Politécnica de Madrid, Spain
  • 2. Outline Index 1. Introduction & Motivation 2. Background 3. Efficient RDF Interchange (ERI) Format i. Basic Concepts ii. ERI Streams iii. Practical Deployment 4. Evaluation 5. Conclusions and Next steps 2
  • 3. INTRODUCTION - Static data versus RDF data streams 3
  • 4. INTRODUCTION - Static data versus RDF data streams 3 Files Extract Transform Load DBMS Spatial Information Web APIs Linked Data discovery
  • 5. INTRODUCTION - Static data versus RDF data streams 3 Files Extract Transform Load DBMS Spatial Information Web APIs Linked Data discovery
  • 6. INTRODUCTION - Static data versus RDF data streams 3 Files Extract Transform Load DBMS Spatial Information Web APIs Linked Data discovery “Most semantic tools are focused on this static view”
  • 7. INTRODUCTION - Static data versus RDF data streams RDF Data Streams are gaining momentum, generated from any type of data stream, and combining real-time and historical data. ©Wilgengebroed on Flickr, Mr3641, ProtoplasmaKid and ISA Internationales Stadtbauatelier in commons wikimedia 3
  • 8. INTRODUCTION - Static data versus RDF data streams 3
  • 9. INTRODUCTION - Static data versus RDF data streams 3
  • 10. INTRODUCTION - Static data versus RDF data streams 3
  • 11. INTRODUCTION - Static data versus RDF data streams 3
  • 12. INTRODUCTION - Static data versus RDF data streams RDF streams: potentially unbounded sequences of timestamped RDF statements or graphs. 3
  • 13. INTRODUCTION - Static data versus RDF data streams RDF streams: potentially unbounded sequences of timestamped RDF statements or graphs. 3 user1_observation [t1] weather1_observation [t1] user2_observation [t3] …
  • 14. INTRODUCTION - Static data versus RDF data streams RDF streams: potentially unbounded sequences of timestamped RDF statements or graphs. 3 t w1 w2 w3 u1 u2 u3 u4 Stream user1_observation [t1] weather1_observation [t1] user2_observation [t3] …
  • 15. INTRODUCTION - Motivation Achieve efficient transmission of RDF streams, a necessary step to ensure higher throughput for RDF Stream processors 3 Stream source Stream source Stream source Stream source Stream Processor Engine Historic Information C-SPARQL, SPARQLStream morph-streams CQELS Cloud Ztreamy … Stream source queries Continuous results
  • 16. INTRODUCTION – Motivation - Requirements 16 Efficient transmission of RDF streams: • Streamable • Scalable • Easy (fast) to process (create and parse) • Compact • Parametrizable (several tradeoffs compression/time)
  • 17. BACKGROUND 17 Plain: Turtle/ Trig/ JSON-LD Plain +Compression (e.g. gzip) HDT Streaming HDT RDSZ RDF/XML + EXI ERI Streamable Yes Yes No Yes Yes Yes Yes Scalable Limited Yes Yes No Yes Yes Yes Easy (fast) to Yes Limited Limited Yes Limited Limited Yes create and parse Compact No Yes Yes Limited Yes Yes Yes Parametrizable: No Limited Yes No Limited Limited Yes compression/time
  • 18. Outline Index 1. Introduction & Motivation 2. Background 3. Efficient RDF Interchange (ERI) Format i. Basic Concepts ii. ERI Streams iii. Practical Deployment 4. Evaluation 5. Conclusions and Next steps 18
  • 19. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 19 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited
  • 20. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 20 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited • Efficient RDF Interchange (ERI) Format encodes the information at two levels:
  • 21. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 21 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited • Efficient RDF Interchange (ERI) Format encodes the information at two levels: • A sliding dictionary of structures: Structural Dictionary
  • 22. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 22 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited • Efficient RDF Interchange (ERI) Format encodes the information at two levels: • A sliding dictionary of structures: Structural Dictionary • The concrete value for each predicate
  • 23. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 23 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited • Efficient RDF Interchange (ERI) Format encodes the information at two levels: • A sliding dictionary of structures: Structural Dictionary • The concrete value for each predicate
  • 24. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 24 • (Assumption) Most RDF streams are well structured structure • the is well-known by the data provider • the number of variations in the structure are limited • Efficient RDF Interchange (ERI) Format encodes the information at two levels: • A sliding dictionary of structures: Structural Dictionary • The concrete value for each predicate
  • 25. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 25 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … …
  • 26. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 26 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 27. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 27 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 28. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 28 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 29. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 29 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 30. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 30 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 31. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 31 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 32. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 32 ID-31 ID-32 Structural Dictionary “7.7”^^xsd:float “9.4”^^xsd:float t w1 w2 w3 u1 u2 u3 u4 Stream temper ature Casual user Anual pass wind ID-30 ID-33 … weather: TemperatureObservation rdf:type weather: AirTemperature ssn:observedProperty ??? ex:CelsiusValue … … … molecule
  • 33. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts 33 • ERI processing model • Minimal Information Unit is a molecule: • We initially restrict to subject molecules
  • 34. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 6:55:00”, “Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. ex:CelsiusValue “7.7”^^xsd:float sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . ex:CelsiusValue “9.4”^^xsd:float 34 Subject Molecule … Suubject Molecule …
  • 35. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 6:55:00”, “Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. ex:CelsiusValue “7.7”^^xsd:float sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . ex:CelsiusValue “9.4”^^xsd:float 35 Subject Molecule ….. Structure ID30= a (1, weather:TemperatureObservation) rdfs:label (2) om-wl:observedProperty (1, weather:_AirTemperature ) om-owl:procedure (1,sens-obs:System_4UT01) om-owl:result (1) om-owl:samplingTime (1) ex:CelsiusValue (1) ….. Structural Dictionary … Suubject Molecule …
  • 36. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts Air Temperature Observations of the Sensor “System_4UT01” sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 6:55:00”, “Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. ex:CelsiusValue “7.7”^^xsd:float sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . ex:CelsiusValue “9.4”^^xsd:float 36 Subject Molecule ….. Structure ID30= a (1, weather:TemperatureObservation) rdfs:label (2) om-wl:observedProperty (1, weather:_AirTemperature ) om-owl:procedure (1,sens-obs:System_4UT01) om-owl:result (1) om-owl:samplingTime (1) ex:CelsiusValue (1) ….. Structural Dictionary … Suubject Molecule …
  • 37. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Basic Concepts Air Temperature Observations of the Sensor “System_4UT01” sens-obs:Observation_AirTemperature_4UT01_2003_3_31_6_55_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 6:55:00”, “Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_6_55_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_6_55_00. ex:CelsiusValue “7.7”^^xsd:float sens-obs:Observation_AirTemperature_4UT01_2003_3_31_7_45_00 a weather:TemperatureObservation ; rdfs: label “Air temperature at 7:45:00”, “Not Verified” ; om-owl:observedProperty weather:_AirTemperature ; om-owl:procedure sens-obs:System_4UT01 ; om-owl:result sens-obs:MeasureData_AirTemperature_4UT01_2003_3_31_7_45_00 ; om-owl:samplingTime sens-obs:Instant_2003_3_31_7_45_00 . ex:CelsiusValue “9.4”^^xsd:float 37 Subject Molecule ….. Structure ID30= a (1, weather:TemperatureObservation) rdfs:label (2) om-wl:observedProperty (1, weather:_AirTemperature ) om-owl:procedure (1,sens-obs:System_4UT01) om-owl:result (1) om-owl:samplingTime (1) ex:CelsiusValue (1) ….. Structural Dictionary … Suubject Molecule …
  • 38. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – ERI Streams Based on: Efficient XML Interchange (EXI) format 38 Block Mole cule Mole cule Mole cule … Block Mole cule Mole cule Mole cule … Block Mole cule Mole cule Mole cule … … Multiplex / Demultiplex Compression/Decompression (per channel) Stream Header Stream Body META DATA COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. META DATA COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. META DATA COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. COMP CHAN. Channels Structural Channels Value Channels … ERI stream
  • 39. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – ERI Streams 39 ERI follows an encoding procedure similar to that of the Efficient XML Interchange (EXI) format. Structural channels: They encode the subjects in each block and, for each one, the structural properties of the related triples, using the dynamic dictionary of structures. • Main Terms of molecules: subject of the grouping. • ID-Structures: ID of the structure of each molecule in the block. The ID points to the entry in the Structural Dictionary. • New Structures: New entries in the Structural Dictionary. – Value channels: They encode the concrete data values held by each predicate in the block in a compact fashion. • One channel per different predicate in the block. • Lists explicit values or use IDs pointing to a sliding object dictionary variations structure
  • 40. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 40 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression
  • 41. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 41 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values
  • 42. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 42 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values
  • 43. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 43 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values
  • 44. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 44 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values
  • 45. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 45 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values IDs pointing to a sliding object dictionary
  • 46. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 46 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values IDs pointing to a sliding object dictionary
  • 47. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 47 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values IDs pointing to a sliding object dictionary Extraction of types
  • 48. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 48 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values IDs pointing to a sliding object dictionary Extraction of types
  • 49. EFFICIENT RDF INTERCHANGE (ERI) FORMAT – Practical Deployment 49 ID-Structures New Structure Marker … sens-obs:MeasureData_Air…55_00 sens-obs:Instant_2003…55_00 sens-obs:MeasureData_Air…45_00 sens-obs:Instant_2003…55_00 … … 30 30 … ID-pred1 weather: TemperatureObservation ID-pred2 ID-pred3 weather:_AirTemperature ID-pred4 sensobs: System_4UT01 ID-pred5 ID-pred6 ID-pred7 [IDs of Structures] … om-owl:samplingTime ex:CelsiusValue … [Encoded Structures] [Strings] Structural Channels …. sens-obs:Observation_AirTemperature...55_00 sens-obs:Observation_AirTemperature...45_00 …. ID-pred2 … Air temperature at 6:55:00 Verified Air temperature at 7:45:00 Not Verified … [Object Values] [Meta: strings] ID-pred5 [Term IDs] [Meta: IDs] New Terms [Strings] … 101 245 … ID-pred6 1 2 … [Term IDs] [Meta: IDs] Potential Compression Differential … Prefix compression Zlib Snappy … Main Terms of Molecules [Strings] Prefix compression Zlib Snappy … Prefix compression Zlib Snappy … Zlib Snappy … Differential … Differential … … 1 0 … [Bits] New Structures New Predicates Zlib Snappy … New Object Marker ID-pred5 … 0 1 … [Bits] New Object Marker ID-pred6 1 1 … [Bits] 1 2 1 1 1 1 1 ID-pred7 … 7.7 9.4 …. [Object Values] [Meta: xsd:float] Differential … Value Channels Potential Compression Explicit list of values IDs pointing to a sliding object dictionary Extraction of types
  • 50. Outline Index 1. Introduction & Motivation 2. Background 3. Efficient RDF Interchange (ERI) Format i. Basic Concepts ii. ERI Streams iii. Practical Deployment 4. Evaluation 5. Conclusions and Next steps 50
  • 52. EVALUATION - COMPRESSION 52 ERI excels in space for streaming and statistical dataset
  • 53. EVALUATION - COMPRESSION 53 ERI excels in space for streaming and statistical dataset RDSZ remains comparable to our approach
  • 54. EVALUATION - COMPRESSION 54 ERI excels in space for streaming and statistical dataset RDSZ remains comparable to our approach The object dictionary can overload the representation, although it always obtains comparable compression ratios.
  • 56. EVALUATION - COMPRESSION 56 A smaller buffer in ERI-1k slightly affects the efficiency
  • 58. EVALUATION - PARSING 58 ERI always outperforms the RDSZ compression time (3 and 3.8 times on average for ERI-4k and ERI-4k-Nodict, respectively)
  • 59. EVALUATION - PARSING 59 ERI always outperforms the RDSZ compression time (3 and 3.8 times on average for ERI-4k and ERI-4k-Nodict, respectively) ERI decompression is commonly slower (1.4 times on average in both ERI configurations), typically due to decompressing several channels.
  • 60. EVALUATION - PARSING 60 ERI always outperforms the RDSZ compression time (3 and 3.8 times on average for ERI-4k and ERI-4k-Nodict, respectively) ERI decompression is commonly slower (1.4 times on average in both ERI configurations), typically due to decompressing several channels. Channels could be grouped (as in EXI)
  • 61. EVALUATION – CONSUMING SCENARIO 61 In parsing: transmission + decompression
  • 62. EVALUATION – CONSUMING SCENARIO ERI-4k and ERI-4k-Nodict outperform the baseline in transmission + decompression except for those datasets with less regularities in the structure or the data values, 62 In parsing: transmission + decompression
  • 63. EVALUATION – CONSUMING SCENARIO 63 In a scenario in which we include the compression time
  • 64. EVALUATION – CONSUMING SCENARIO 64 In a scenario in which we include the compression time ERI-4k suffers an expected overhead as we are always including the time to process the information
  • 65. EVALUATION – CONSUMING SCENARIO 65 In a scenario in which we include the compression time ERI-4k suffers an expected overhead as we are always including the time to process the information The time in which the client receives all data in ERI is comparable to the baseline
  • 66. Results 66 • Compressed, efficient RDF interchange (ERI) format • exploit the RDF data stream regularity of their structure and data values • Flexible and extensible ERI configurations • Minimize transmission costs in RDF stream processing • State-of-the-art compression • Remains efficient in performance • Time overheads are relatively low and can be assumed in many scenarios.
  • 67. Next steps 67 • Integration within RDF streaming Engines • e.g. morph-streams, CQELS Cloud • 3 purposes: • scaling to higher input data rates • minimizing the data exchange among processing nodes • serving a small set of operators on the compressed data • Parallel compression/decompression • preliminary proposal on Storm • Align the proposal with the results of W3C RSP group regarding streaming modeling and serialization
  • 68. Efficient RDF Interchange (ERI) Format for RDF Data Streams Javier D. Fernández, Alejandro Llaves, Oscar Corcho Ontology Engineering Group (OEG), Universidad Politécnica de Madrid, Spain purl.org/net/ro-eri-ISWC14 Electronic edition: Research object: