Logitech - LOGITECH ACCELERATES CLOUD ANALYTICS USING DATA VIRTUALIZATION
1. LOGITECH ACCELERATES
CLOUD ANALYTICS USING
DATA VIRTUALIZATION
Avinash Deshpande
Principal, Big data and Advanced Analytics
adeshpande2@Logitech.com
2. Logitech designs products that have an everyday place in people's lives, connecting them
to the digital experiences they care about. Over 30 years ago, Logitech started
connecting people through computers; now it’s designing products that bring people
together through music, gaming, video and computing.
In 1981, Logitech was founded in the village of Apples, Switzerland. The start-up was
based in a farm building – the Swiss equivalent of a Silicon Valley garage. Shortly after,
another office was opened up in the U.S. at 165 University Avenue, Palo Alto. This
address has become famous over the years as a lucky one for start-ups. It’s where
Logitech started, as well as Danger, Inc, PayPal and Google.
At the heart of Logitech’s success lies its ability to design product experiences that tap
into genuine consumer needs. Under a number of different brands, the company offers
PC peripherals; cases and keyboards for tablets; equipment for gamers; mobile speakers
and earphones for music and sports enthusiasts; devices to make video collaboration
simple in the workplace; and entertainment and control products for the home.
THE LOGITECH STORY
3. LOGITECH – PRODUCT PORTFOLIO
• Mice + Keyboards
• Mobile
• Smart Home
• Gaming
• Speakers
• Video
5. LOGITECH DATA USE CASES
Structured Semi-Structured Unstructured
BatchDataVelocityReal-Time
Social Media
Sentiment
Predictive Analytics
Demand Forecasting
Price violations
on Retail sites
Data Warehousing Text Mining
Security Video
Analysis
Retail Data
scrapping
Machine Learning
ioT
Multi site ERP
Marketing Funnel
Sales Channel Mgmt
Smart Home
Natural Language
Processing (NLP) VR Gaming
6. ANALYTICS AT SCALE - SUMMARY
• Create a decentralised self-service analytics environment for traditional business reporting and
analytics (Descriptive and Diagnostic Analytics). Becomes a purely EXPLICIT experience.
• Allowing for a centralised, cross-functional shared advanced analytics service tasked to deliver Predictive
and Prescriptive analytics to the organisation.
• A minimal investment, with leveraged return.
8. Comes in two complementary flavours, both are REAL-TIME ON DEMAND
QUERY
The next step in our SELF SERVICE model. This is essentially a ‘natural language’ search of
our data to return the relevant answer. A simple working example is IBM’s Watson, which can
query a data set and return specific answers. This technology is still new and somewhat limited.
REPORTING
This is physical report writing or narration. These platforms start by understanding what the user
wants to communicate, perform the relevant analysis to highlight what is most interesting and
important, identify and access the data necessary to tell the story, and finally deliver the analysis
in a personalized, easy-to-consume way: as a narrative
NATURAL LANGUAGE GENERATION
Gartner predicts that by 2018, advanced NLG will be
integrated into the majority of smart data discovery
platforms and that 20% of business content will be
generated by machines.
11. LOGITECH CONFIDENTIAL: NOT FOR DISTRIBUTION
REAL-TIME ON DEMAND delivery to your PHONE and DESKTOP and DASHBOARD
• Executive Summaries
• Customer by Product
• Product by Customer
• Demand / Supply updates
• Market Analytics / Market Share
• Marketing Reports
• Competitive Analysis
• Sentiment
• ...
NLP is a scalable self-service environment, meaning
we can open it to business users (self-service) and
allow them to improve and drive business impact
and adoption. It is language agnostic, meaning we
can publish reports in the language they are
written.
13. SOLUTION ARCHITECTURE
Amazon Web Services
AWS GlacierAWS S3 AWS Redshift
Pentaho DI
Pentaho Operations Mart
Cloudwatch SNSIAM Cloudtrail EMR SPARK Python / R
AWS RDS
Denodo Data Virtualization
Tableau Pentaho BA Data Interfaces Web ServicesOBIEE CUBES
14. JOURNEY TO CLOUD
Cloud empowers IT organizations to redefine the way Data
services are produced and delivered
Scalable • Infrastructure scaled up - down on the fly (Elastic)
• Focus on simplicity, security, robustness, and scalability
Efficient • Infrastructure costs are pay as use
Reliable
• AWS managed services
Managed &
Governed
• Transparency on usage patterns
• Breadth of services offered, pricing, performance and
flexibility
15. NEED FOR DATA VIRTUALIZATION
Abstract access to disparate data sources
A single semantic repository
Optimized data availability in real-time to consumers
Centralized, governed and secured data layer
16. IMPACT OF DATA VIRTUALIZATION PLATFORMS
By 2018, organizations with data virtualization
capabilities will spend 40% less on building and
managing data integration processes for
connecting distributed data assets.”
-Gartner
17. MANAGING BIG DATA WITH DATA LAKES
Organizations are exploring data lakes as consolidated repositories of massive volumes of
raw, detailed data of various types and formats to overcome Big Data challenges.
But creating a physical data lake presents its own hurdles, one of which is the need to store
the data twice which can lead to governance challenges with regard to data access and
quality.
Data Virtualization technologies can improve an organization’s ability to govern and
extract more value from its data lakes by extending them as logical data lakes.
- Ventana Research
18. • Federated Approach
o Queries sent to data sources without much intelligence about the overall
query or the cost of the individual parts of the federated query.
o Each underlying data source performs its portion of the workload as
best it can and returns the results.
o The various parts are combined and additional post-processing
performed if necessary, for example to sort the combined result set.
• DV / Denodo Approach
o Denodo tools consider the costs of each part of the individual query and
evaluate trade-offs and decides on the best way to execute the SQL.
DATA VIRTUALIZATION OVER DATA FEDERATION
19. REFERENCE ARCHITECTURE
Metadata Management, Data Governance, Data Security
Cost and Usage Pattern
Sensor Data
Machine Data Logs
Social Data
Clickstream Data
Internet Data
Image and Video
Cloud Applications
Enterprise
Applications
Data Sources Data Insights
Self-Service /
Data Discovery
Reporting
Predictive Analytics
Statistical Analytics
Sentimental Analytics
Text Analytics
Data Mining
Data Virtualization
Data Collection
Real-Time Data Access (On-Demand / Streaming)
C
D
C
E
T
L
EDW
ODS
Cloud DW
NoSQL
Data Warehouse
File Storage (S3)
Batch DW Spark SQL
NoSQLSearch Search
Big Data
In-Memory
Analytical
Appliances
Real-Time
Decision Support
Alerts
Scorecards/
Dashboards
20. • Logical model can be predefined for the data
• Eliminates load processes and the need to update the data
• Uses the security and governance system already in place
• Collects and maintains statistics and determines optimal query execution
• Avails Cache mechanism and pushdown for optimal performance
• Array of connection options from structured to unstructured data
• Business Layer, enabling data Consistency through single object, multiple
consumers
• Rapid prototyping
• Data Audits
VIRTUALIZATION BENEFITS
21. • Catalog exploration
o Graphical representation of data model
o Data lineage
o Integrated catalog search
• Data Discovery
o User friendly query wizards with drill down capabilities
o Export to CSV, Excel and Tableau Data Extracts
• Secure
o Leverages Denodo’s security model and access control
o Available vis SSL/TLS
GOVERNANCE - DENODO INFORMATION SELF
SERVICE
23. CLOUD AND DV BENEFITS
• Proactive – IT has embraced cloud as a model for achieving
innovation through increased efficiency, reliability and agility
• Reusability and template development
• Rapid innovation within governance structure, balanced
costs, risks and service levels
• Greater efficiency and reliability, enabling broader audience
to consume IT services via self-service
24. LESSONS LEARNT
• Reduced Spend
• Live migration
• Flexible and cost effective
• Better business continuity
• Speed to deliver
• Easier to manage
• More efficient IT operations
Cons
• Upfront hardware costs
• Software license costs
• Possible learning curve
• Accountability
• Getting all vendors to gel well
Pros
25. DENODO - DATA VIRTUALIZATION IN THE CLOUD
Accelerate Adoption of Data Virtualization
Ready-to-use and available on AWS Marketplace.
Dynamic and elastic infrastructure.
Complete with all enterprise-grade features at the
lowest cost.
Only data virtualization platform on AWS.
Licensing Options
Number of Data Sources
Number of concurrent queries and results
AWS unlimited options