The document discusses real-time visitor analysis using Couchbase and Elasticsearch. It describes how Hippo collects data about website visitors from various sources and stores it in Couchbase for low-latency access. Elasticsearch is used to enable advanced search and analytics of visitor data. The architecture allows scaling to handle high volumes of visitor data and requests. Couchbase and Elasticsearch are suitable for their flexibility, performance, and ability to scale.
8. NoSQL Matters 2013
How we analyse
visitors @ Hippo
OneHippo @ Goto
follow the Hippo trail
9. NoSQL Matters 2013
Registration
Visitor - entity making HTTP requests
Collector - records data about a visitor or his behaviour
Example: location collector (GeoIPCollector)
Targeting Data - all data about a specific visitor
Example: IP address is located in Amsterdam
follow the Hippo trail
10. NoSQL Matters 2013
Matching
Characteristic - a type of fact about visitors
Example: "comes from a city", "experiences a type of
weather"
Target Group - the specification of a Characteristic
Example: "comes from a European city", "comes from
Amsterdam"
Persona - one or more target groups that describe a
certain type of visitor
Example: "Jim, the European urban consumer",
"Alice, the Pet owner"
follow the Hippo trail
11. NoSQL Matters 2013
What do we store?
Request log
!
Targeting data
!
Statistics
Averages, e.g. how many visitors became which persona
follow the Hippo trail
28. NoSQL Matters 2013
Suitable types
• Key-value store
• Document database
• Column oriented store
follow the Hippo trail
29. NoSQL Matters 2013
Assessment Criteria
Maturity
Data model
Scalability
Replication
Performance
Reliability
Caching model
Query model
Consistency model
Support
Monitoring
follow the Hippo trail