Transform Salesforce into the system of engagement for your big data. Discuss best practices and lessons learned in accessing external data sets in Hadoop or Spark using Salesforce Connect. Leave the big data sets behind the firewall, and get on demand access for your users to big data insights using external objects with Salesforce Connect.
In this session we will cover:
Intro to Salesforce Connect
Intro to Big Data Landscape
How to connect Salesforce to Big Data using External Data Sources
Lessons Learned accessing Big Data using External Objects for native reporting, writes, lookups, search and more
Resources (How to learn more)
1. External Objects for External Big Data
Sumit Sarkar
Chief Data Evangelist
Progress
sumit.sarkar@progress.com
@SAsInSumit
2. Agenda
1. Intro to External Objects for Big Data
• What is Salesforce Connect?
• What is Big Data?
• What is OData?
2. How to access Big Data from Salesforce
3. Live Demo (fingers crossed)
4. Lessons Learned
External Objects for External Big Data
4. Salesforce Connect maps Salesforce external
objects to data tables in external systems.
Instead of copying the data into your
organization, Salesforce Connect accesses the
data on demand and in real time. The data is
never stale, and we access only what you
need.
Recommended when:
•You have a large amount of data that you don’t want to
copy into your Salesforce organization.
•You need small amounts of data at any one time.
•You want real-time access to the latest data.
What is Salesforce Connect?
6. Salesforce uses Big Data technologies within their
platform (HBase, Pig, Phoenix, etc) to support
technologies such as Einstein or exposed through
different APIs such as BigObjects.
However most organizations have their own Big Data
technologies to analyze and expose data sets similar in
scale or variety…
Don’t ETL Big Data sets into the
platform!
What is Big Data?
7. How my team uses Big Data Technology
Enhance the Customer
Journey
What support resources were
consumed for each evaluation?
What other product or solution
pages were visited to create
smarter targeted campaigns?
What is the success score for
leads and contacts evaluating
products?
9. An open protocol to allow the creation and consumption of
queryable and interoperable RESTful APIs in a simple and standard way.
OASIS Standard REST API (“SQL for the web”)
Ratified as an OASIS standard February, 2014
Operations built on REST principles
Uniform URL conventions
Surface metadata in standard way
What is OData?
First member to join OData Technical Committee
12. How to produce OData from Big Data?
Use Salesforce Connect External Objects (intro’d Winter ‘15)
Open Source TechnologiesDataDirect Cloud Apex Connector Framework
13. Connect Salesforce to Big Data
Success Scoring
Personalization
Archived Insight
360 Reporting
Corporate Firewall
17. Lessons Learned
1. Mapping OData entities to Big Data objects
2. Primary keys for Big Data entities
3. HiveServer1 vs HiverServer2 for concurrency
4. External Objects have limits and 2 minute max timeout
5. Native Reporting support being added in Winter ‘17
6. Search considerations
7. Need agile OData service with Data Lake
8. Data Governance and Masking
9. CRM User Experience (strategies to improve performance)
Accessing external Big Data objects
18. Design Patterns for external objects
Enable Separate Loading of Related Lists of External Objects
Performance Tuning Tips for
Related Lists in Account
000148978
19. Decrease latency accessing Big Data over Hive
Stuff Salesforce devs don’t care about
1. Use Apache Tez as execution engine for Hive
2. Use ORCfile, new storage format
3. Use vectorization query execution (Hive 0.13)
4. Performance Tuning (Partitions, Indexes, Buckets, Block Sizes, etc)
5. Consider another query interface (i.e. Apache Hawq)