RavenDB

Document databases in practice

Nicola Baldi

http://it.linkedin.com/in/nicolabaldi

Luigi Berrettini

http://it.linkedin.com/in/luigiberrettini

Overview

15/12/2012


2

Unbounded result sets problem
Unbounded number of requests problem

15/12/2012

Document databases in practice - Overview

3

 They favor denormalization over

composition and joins

 Relations are different than in RDBMSs
 They are schema-less, but attention should

be paid in designing documents

15/12/2012


4

« a conceptual model should be drawn with
little or no regard for the software that might
implement it » (Martin Fowler, UML Distilled)

A domain model should be independent from
implementation details like persistence
In RavenDB this is somewhat true
15/12/2012


5

 RDBMS are schema-full
• tuples = sets of key-value pairs ⇒ flat structure
• more complex data structures are stored as relations

 Document databases are schema-less
• object graphs stored as docs ⇒ no flat structure
• each document is treated as a single entity
RavenDB suggested approach is to follow the
aggregate pattern from the DDD book

15/12/2012


6

ENTITY
 Some objects are not defined primarily by

their attributes

 They represent a thread of identity that runs

through time and often across distinct
representations

 Mistaken identity can lead to data corruption
15/12/2012


7

VALUE OBJECT
 When you care only about the attributes of an

element of the model, classify it as a value object

 Make it express the meaning of the attributes it

conveys and give it related functionality

 Treat the value object as immutable
 Don't give it any identity and avoid the design

complexities necessary to maintain entities

15/12/2012


8

AGGREGATE
 Invariants are consistency rules that must be

maintained whenever data changes

 They’ll involve relationships within an aggregate

(relations & foreign keys: order / orderlines)

 Invariants applied within an aggregate will be

enforced with the completion of each transaction

15/12/2012


9

 Cluster entities and value objects into aggregates

and define boundaries around each

 Choose one entity to be the root of each

aggregate and control all access to the objects
inside the boundary through the root

 Allow external objects to hold references to the

root only

 Transient references to internal members can be

passed out for use within a single operation only

15/12/2012


10

 Because the root controls access, it cannot

be blindsided by changes to the internals

 This arrangement makes it practical to

enforce all invariants for objects in the
aggregate and for the aggregate as a
whole in any state change

15/12/2012


11

Nested child document

15/12/2012


12

Document referenced by ID

15/12/2012


13

Denormalized reference
 we clone properties that we care about when

displaying or processing a containing document
 avoids many cross document lookups and results in
only the necessary data being transmitted over the
network
 it makes other scenarios more difficult: if we add
frequently changing data, keeping details in synch
could become very demanding on the server
 use only for rarely changing data or for data that
can be dereferenced by out-of-sync data
15/12/2012


14

15/12/2012


15

Order contains
denormalized data
from Customer
and Product
Full data are
saved elsewhere

15/12/2012


16

15/12/2012


17

Querying

15/12/2012


18

 DocumentStore
• used to connect to a RavenDB data store
• thread-safe
• one instance per database per application

 Session
• used to perform operations on the database
• not thread-safe
• implements the Unit of Work pattern
 in a single session, a single document (identified
by its key) always resolves to the same instance
 change tracking
15/12/2012

Document databases in practice – Querying

19

15/12/2012


20

 Sequential GUID key
• when document key is not relevant (e.g. log entries)
• entity Id = sequential GUID (sorts well for indexing)
• Id property missing / not set ⇒ server generates a key

 Identity key
• entity Id = prefix + next available integer Id for it
• Id property set to a prefix = value ending with slash
• new DocumentStore ⇒ server sends a range of HiLo keys

 Assign a key yourself
• for documents which already have native id (e.g. users)
15/12/2012


21

15/12/2012


22

 soft-limit = 128

no Take() replaced by Take(128)

 hard-limit = 1024

if x > 1024 Take(x) returns 1024 documents

15/12/2012


23

 RavenDB can skip over some results internally

⇒ TotalResults value invalidated

 For proper paging use SkippedResults:
Skip(currentPage * pageSize + SkippedResults)
 Assuming a page size of 10…

15/12/2012


24

15/12/2012


25

15/12/2012


26

 RavenDB supports Count and Distinct
 SelectMany, GroupBy and Join are not supported

 The let keyword is not supported
 For such operations an index is needed

15/12/2012


27

All queries use an index to return results
 Dynamic = created automatically by the server
 Static = created explicitly by the user

15/12/2012


28

 no matching static index to query ⇒ RavenDB

automatically creates a dynamic index on the
fly (on first user query)

 based on requests coming in, RavenDB can

decide to promote a temporary index to a
permanent one

15/12/2012


29

 permanent
 expose much more functionality
 low latency: on first run dynamic indexes

have performance issues

 map / reduce

15/12/2012


30

15/12/2012


31

15/12/2012


32

15/12/2012


33

Advanced topics

15/12/2012


34

 an index is made of documents

 document
•
•
•
•

15/12/2012

atomic unit of indexing and searching
flat ⇒ recursion and joins must be denormalized
flexible schema
made of fields

Document databases in practice – Advanced topics

35

 field
• a name-value pair with associated info
• can be indexed if you're going to search on it
⇒ tokenization by analysis
• can be stored in order to preserve original
untokenized value within document

 example of physical index structure
{“__document_id”: “docs/1”, “tag”: “NoSQL”}

15/12/2012


36

15/12/2012


37

15/12/2012


38

15/12/2012


39

One to one

15/12/2012


40

One to many ⇒ SELECT N+1

15/12/2012


41

Value type

15/12/2012


42

 indexing: thread executed on creation or update
 server responds quickly BUT you may query stale

indexes (better stale than offline)

15/12/2012


43

15/12/2012


44

documentStore.Conventions.DefaultQueryingConsistency

 ConsistencyOptions.QueryYourWrites

same behavior of
WaitForNonStaleResultsAsOfLastWrite

 ConsistencyOptions.MonotonicRead

you never go back in time and read older
data than what you have already seen

15/12/2012


45

15/12/2012


46

15/12/2012


47

15/12/2012


48

RavenDB

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to RavenDB

Similar to RavenDB (20)

Recently uploaded

Recently uploaded (20)

RavenDB