SlideShare a Scribd company logo
1 of 32
Download to read offline
© 2015 ligaDATA, Inc. All Rights Reserved.
Driving Business
Value Through 

Real-Time
Decisioning
Solutions
July 2015
Download, Forums, Docs, Events http://Kamanja.org 
ligaDATA
2
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Outline
Motivation Case Study – Modeling Department
Review Mining and Big Data Tools
Solution: Predictive Markup Modeling Language (PMML)
Reviewing Big Data Space and Real Time
Kamanja Integration (Open Source PMML)
Use Cases, Demo, Architecture
3
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Audience Survey (show of hands)
Data Mining Experience
__% Read or heard about
__% Class or competition
__% Put a model into production
__% Have put 10+ models in production
__% Have put 75+ models in production
Big Data Experience
__% Read or heard about
__% Class or exploration project
__% Put a system into production
__% System with 3+ OSS in prod
__% System with 6+ OSS or PB+ in
production
Extensive Data Mining AND Big Data Experience
__% with 10+ models AND 3+ OSS
__% with 75+ models AND 6+ OSS / PB+
Overlap on extensive experience is rare
This is what Kamanja helps with
4
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Case Study of a Modeling Department
Financial Fraud Detection
CONTEXT
•  3 modelers, 2 data infrastructure people in department
•  Over 3 dozen predictive models in production, high $$$$ and visibility
•  Separate Operations group deploying models
PROBLEM
•  Models were getting stale
•  “Spinning Plates” between short term solutions
•  2 months for a full model training investigation
•  2 months to put a model into production (OUCH)
Had to completely re-code the preprocessing and model scoring
Operations had One process to deploy a regression
Operations had a different process to deploy a decision tree
5
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Case Study of a Modeling Department
Financial Fraud Detection
INDUSTRY REVIEW to answer:
•  How common is it to use many algorithms or tools in a project?
•  What is an easier way to deploy models?
6
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
http://www.kdnuggets.com/polls/2015/analytics-data-mining-data-science-software-used.html
In the industry, many algorithms and
tools are used
Need to simplify DEPLOYMENT
7
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html
Independent use of tools
8
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html
Tools used in combination
9
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Scoring
Engine
(Kamanja)
PMML Diagram
Predictive Modeling Markup Language
Training & test data
(batch)
Data
Mining
Tool File, Save As
PMML
PMML
File
PMML
Producer
PMML
FileScoring data
(real time streaming)
Output data has
new score field
Training Project Phase
Production Scoring Project Phase
Full model
specification
PMML Consumer
10
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Given industry fragmentation,
PMML is a solution
PMML Producers (18 companies)
•  R (Rattle, PMML)*
•  RapidMiner
•  KNIME*
PMML Consumers (12 co)
•  Zementis
•  IBM SPSS
•  KNIME
•  Microstrategy
•  SAS
•  Kamanja* (Open Source)
•  Spark (MLib)* * = Open Source
•  Weka*
•  SAS Enterprise Miner
PREDICTIVE
Naïve Bayes
Neural Net
Regression
Rules
Scorecard
Sequence
SVM
Time Series
Trees
DESCRIPTIVE / OTH
Association Rules
Cluster, K-Nearest Nb
Text Models
model ensembles &
composition
(i.e. Gradient
Boosting)
11
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Case Study of a Modeling Department
Financial Fraud Detection
SOLUTION OBJECTIVES
1) Support a wider variety of algorithms and software (increase accuracy)
2) Decrease time on putting models into production (incr analysis time)
12
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Case Study of a Modeling Department
Financial Fraud Detection
SOLUTION OBJECTIVES
1) Support a wider variety of algorithms and software (increase accuracy)
2) Decrease time on putting models into production (incr analysis time)
SOLUTION
1) Train models in SAS Enterprise Miner, R (PMML Producers)
2) Score models with a RESTful call to a PMML Consumer (Zementis)
Predictive Modeling Markup Language (PMML) is a type of XML
RESULT
1) By supporting more software & algorithms – MORE ACCURATE!
2) PUT MODELS INTO PRODUCTION from 8 weeks to down to 2-5 days!
Greatly increased throughput of training new models!
13
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Outline
Motivation Case Study – Modeling Department
Review Mining and Big Data Tools
Solution: Predictive Markup Modeling Language (PMML)
Other Uses of Real Time Decisioning
Reviewing Big Data Space and Real Time
Kamanja Integration (Open Source PMML)
Use Cases, Demo, Architecture
14
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Other Uses of PMML or
Real-Time Decisioning
Complex Event Processing (CEP)
•  Possibly 100’s of concurrent data streams
•  Apply rule logic, select, aggregate
•  Select action on elements in stream
Enterprise Applications, During …
•  customer call or chat: recommendations to improve service
•  card transaction: offer credit increase
•  web application: pre-approval
•  web transaction: recommend other product(s)
•  MOOC: customize training speed for the student
(Custom Java Model)
© 2015 ligaDATA, Inc. All Rights Reserved.
 15
ligaDATA
Real Time Computing
OSS Technology Stack 
Integration with Kamanja
Kamanja
(PMML/Java/Scala Consumer)
High level languages / abstractions
Compute
Fabric
Cloud, EC2
Internal Cloud
Security
Kerberos
Real Time
Streaming
Kafka,
MQ
Spark*
ligaDATA
Data Store
HBase,
Cassandra,
InfluxDB
HDFS
(Create
adaptors to
integrate
others)
Resource
Management
Zookeeper,
Yarn*,
Mesos*
High Level Languages /
Abstractions
MLlib* (PMML Producers)
© 2015 ligaDATA, Inc. All Rights Reserved.
 16
ligaDATA
Real Time Open Source Systems (OSS) 
Kamanja and Spark are good Compliments
Clarify with a feature list,
Use case to work together
17
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Higher Requirements in 

Financial Services or Health Care
Compared to social media or web apps
© 2015 ligaDATA, Inc. All Rights Reserved.
CONFIDENTIAL 
17
Legal Compliance to meet exacting
technical standards
•  Losing (or duplicating) a bank transaction
•  Losing a medical record
•  Executives or employees can GO TO JAIL
What is different about these industries?
•  Regulatory requirements requires 100% data protection
•  Security
•  Auditability
•  Lineage
•  ZERO data loss
ligaDATA
18
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
•  Migrated from mass generalized
communications to real time
personalized alerts
•  Increased messaging effectiveness
of 400% lift in conversion to digital
•  Full integration with Mainframe
•  Leverage streaming transaction
and customer account data
Business Objectives
•  Reduce operating cost of Calling Centers
•  Increase customer adoption of digital channels
•  Interact with Customer at point of transaction
IT Objectives
•  Implement cost effective and scalable platform
•  Satisfy financial services security and 

compliance req.’s
•  Integrate with existing core systems
Driving Digital Adoption:
the Bank’s Call Center
© 2015 ligaDATA, Inc. All Rights Reserved.
CONFIDENTIAL 
18
Results
ligaDATA
© 2015 ligaDATA, Inc. All Rights Reserved.
 19
ligaDATA
Medical Company use
of Kamanja
Lines of Business
Run Medco models
supplying client's
intelligence based upon
model findings 

(using multi tenant deployment when
appropriate)
Run Customer models 

on Medco hardware

(on Medco owned 

customer private net)
Consult/partner with
Medco customers
providing software
solutions to be run on
Customer net
© 2015 ligaDATA, Inc. All Rights Reserved.
 20
ligaDATA
•  Clinicians (knowledge experts) develop heuristic based rule set models
•  The initial model was COPD (Chronic Obstructive Pulmonary Disease) risk
assessment
•  Models are expressed with a Domain Specific Language (DSL) they developed
•  DSL models are transformed to PMML for Kamanja
•  Models consume current + prior related messages over “look back period”
Save the “assertions” of a patient in the database (beyond standard PMML)
•  Medco plans to integrate the DSL with their ontology data modeling effort
•  Goal is to generate new models as their “medical world” ontology evolves
Medical Company use
of Kamanja
© 2015 ligaDATA, Inc. All Rights Reserved.
 21
ligaDATA
DECISION
GATEWAYS
Representative Kamanja 

Solution Architecture
MAINFRAME
DB2
SERVER
INTEGRATION
CDC
Kafka
Inbound
Queue
Kamanja
Security Management
Error Management
Metadata Service/Cache
Storage Service/Cache
Message
Construction
Decision 

Engine
Output 

Handler
Transform
Compute
MakeObj
Parallel
DAG
Executor
DAG
Optimizer
DAG
Generator
Change
Listner
Output
Generator
Output
Disctributor
DBs
Apps
Notification
Engine
DATA SOURCES
DW
Customer
Preferences
Kafka
Outbound
Queue
HBase - History
 HDFS – Long term storage
Zookeeper – Resource Management
ligaDATA
© 2015 ligaDATA, Inc. All Rights Reserved.
 22
ligaDATA
Performance

Characteristics
© 2015 ligaDATA, Inc. All Rights Reserved.
CONFIDENTIAL 
22
Performance
•  Throughput of million messages/second
•  Uses commodity hardware
Scalability
•  Linear scalability vertically and horizontally
•  Data partitioning support
•  Runtime multi-model optimizations to
supports thousands of models
•  Consistent performance on hundreds of
models and thousands of rules
Built for IoT
data volumes
ligaDATA
Data
Transformer
Data History
(Cassandra,
HBase)
Metadata
(expanded in next slides)
Model
Runtime
Output
Dispatcher
Kafka
Queue
Input
Adapter
Output
Adapter
Next
Process
Kamanja Engine
Kamanja Execution
Flow on a Node
Storage
Adapter
ligaDATA
Metadata
Functions
(in PMML, Scala as
User Defined Func (UDF))
Models
(PMML Rule Set, i.e.
fraud, attrition)
Messages
(from input queue, real
time records)
Containers
(i.e. a record or lookup table
to provide context, priors)
Types
(i.e. array of patients, Dr’s,
types of containers)
Concepts
(PMML created fields,
preprocessing, scores)
Metadata API
Elements
ligaDATA
Metadata
Metadata API
Scoring Engine
Manager
(within a model)
Model Manager
(activate, control
a DAG of many
models)
PMML Producer, or
application
Admin App, used by DevOps
Activate PMML Model or DAG
Rest API
Metadata API
Subsystems
Configuration
(Cluster, Engine, Model Compilation)
Kamanja Engine
ligaDATA
Model Runtime
Kamanja Runtime
Model Execution
Transformer
Data History / Metadata,
(HBase, Cassandra, ..)
Msg
Storage
Adapter(s)
Metadata
Instance
Model
Object
Model
Factory
1) Message rec by
runtime engine
2) Metadata is checked
To see what model is
Interested in the message 3) Model object
Is instantiated
5) Msgs committed
to history
4) Model is executed
on the Message obj
6) Output of the model
is returned to the engine
ligaDATA
If the node that crashes is a Kamanja Slave node
•  The Kamanja Leader Node rebalances over all Kamanja nodes
•  Each message is processed EXACTLY ONCE
•  A Bank needs to process a transaction ONCE AND ONLY ONCE
•  Look at the state of every message through each step
If the Kamanja Leader node goes down,
•  The next node on the list becomes the Leader, then rebalance
COMPARE TO:
•  Spark and Storm would execute each message AT LEAST ONCE (but may
process a message 2, 3 or 4 times…).
•  The expectation is for the application to handle possible dup.
What happens when
a node goes down?
ligaDATA
© 2015 ligaDATA, Inc. All Rights Reserved.
 28
ligaDATA
Kamanja Integration
Points
•  We provide with an enterprise friendly license
(No GPL License virus to infect the entire system)
•  Adaptors: for any data flow
Kafka, IBMs MQ, Hbase, Cassandra, InfluxDB, Zookeeper, Spark
•  User Defined Functions:
Provide a JAR file or Scala function
•  Custom Java Model
Can skip PMML, leverage Adaptors and UDFs
Import generated Java code
29
© 2015 ligaDATA, Inc. All Rights Reserved.
ligaDATA
Deploy Predictive Models and
Rules in 1/100th the time it takes
today
•  Kamanja is an open source, real time decisioning
engine
•  Hardened to meet strictest requirements of Financial
Services, Healthcare and scalable to handle IoT
•  Kamanja Enables Developers and Data Scientists to
reduce time to deploy Rules and Predictive Models
•  Kamanja integrates with your Big Data ecosystem
© 2015 ligaDATA, Inc. All Rights Reserved.
 30
ligaDATA
Planned Kamanja
Differentiation
•  Model management, enable DevOps for models

DevOps: automated testing, validation, deployment and rollback

A/B testing to competitively roll out model update, scheduling
•  Enterprise Level Security and Multiple-Tenancy

Integration using Kerberos

Role based security for model management

Security at field level for models, “need to know/access”
•  Multi tenancy

partition internal groups in different tenancies

Data isolation, resource management, SLA support
•  Data Integration

Built-in integrations for social data and third party data

Can consume 100s of different event and document types
© 2015 ligaDATA, Inc. All Rights Reserved.
 31
ligaDATA
Planned Kamanja
Differentiation
•  Performance and Scale

Dynamic scaling – enlarge and shrink as needed, based on load

Leap in performance by generating native code (vs. Java)

Cost aware execution in cloud environment
•  Extensive integrations with enterprise queue, storage and indexing

MQ, HBase, Cassandra, RDBMS, Elastic Search, Zookeeper
•  Domain specific libraries and model templates to speed up preprocessing,
business logic and algorithms
© 2015 ligaDATA, Inc. All Rights Reserved.
Try out

Kamanja
© 2015 ligaDATA, Inc. All Rights Reserved.
CONFIDENTIAL 
Download, Forums, Docs, Events http://Kamanja.org 
ligaDATA

More Related Content

What's hot

Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
Data Democratization at Nubank
 Data Democratization at Nubank Data Democratization at Nubank
Data Democratization at NubankDatabricks
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkPolymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkDatabricks
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!DataWorks Summit
 
Deep Credit Risk Ranking with LSTM with Kyle Grove
Deep Credit Risk Ranking with LSTM with Kyle GroveDeep Credit Risk Ranking with LSTM with Kyle Grove
Deep Credit Risk Ranking with LSTM with Kyle GroveDatabricks
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsArcadia Data
 
Flash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonFlash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonJeffrey T. Pollock
 
Scaling Face Recognition with Big Data
Scaling Face Recognition with Big DataScaling Face Recognition with Big Data
Scaling Face Recognition with Big DataBogdan Bocse
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenJeffrey T. Pollock
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in productionTuri, Inc.
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemShirshanka Das
 
Data Science Driven Malware Detection
Data Science Driven Malware DetectionData Science Driven Malware Detection
Data Science Driven Malware DetectionVMware Tanzu
 
Deep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseDeep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseGanesan Narayanasamy
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsRyan Gross
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOADemed L'Her
 

What's hot (20)

Data engineering design patterns
Data engineering design patternsData engineering design patterns
Data engineering design patterns
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
Msst 2019 v4
Msst 2019 v4Msst 2019 v4
Msst 2019 v4
 
Data Democratization at Nubank
 Data Democratization at Nubank Data Democratization at Nubank
Data Democratization at Nubank
 
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache SparkPolymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
Polymorphic Table Functions: The Best Way to Integrate SQL and Apache Spark
 
Hadoop dev 01
Hadoop dev 01Hadoop dev 01
Hadoop dev 01
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!
 
Deep Credit Risk Ranking with LSTM with Kyle Grove
Deep Credit Risk Ranking with LSTM with Kyle GroveDeep Credit Risk Ranking with LSTM with Kyle Grove
Deep Credit Risk Ranking with LSTM with Kyle Grove
 
Accelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time AnalyticsAccelerating Data Lakes and Streams with Real-time Analytics
Accelerating Data Lakes and Streams with Real-time Analytics
 
Flash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lonFlash session -goldengate--lht1053-lon
Flash session -goldengate--lht1053-lon
 
Scaling Face Recognition with Big Data
Scaling Face Recognition with Big DataScaling Face Recognition with Big Data
Scaling Face Recognition with Big Data
 
GoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest RakutenGoldenGate and Stream Processing with Special Guest Rakuten
GoldenGate and Stream Processing with Special Guest Rakuten
 
Machine learning in production
Machine learning in productionMachine learning in production
Machine learning in production
 
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystemStrata 2016 - Architecting for Change: LinkedIn's new data ecosystem
Strata 2016 - Architecting for Change: LinkedIn's new data ecosystem
 
Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Data Science Driven Malware Detection
Data Science Driven Malware DetectionData Science Driven Malware Detection
Data Science Driven Malware Detection
 
Deep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the EnterpriseDeep Learning Image Processing Applications in the Enterprise
Deep Learning Image Processing Applications in the Enterprise
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Smart data for a predictive bank
Smart data for a predictive bankSmart data for a predictive bank
Smart data for a predictive bank
 
Big data and its impact on SOA
Big data and its impact on SOABig data and its impact on SOA
Big data and its impact on SOA
 

Viewers also liked

Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsGreg Makowski
 
Linked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 BLinked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 BGreg Makowski
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Greg Makowski
 
SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24Greg Makowski
 
The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)Greg Makowski
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Greg Makowski
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchGreg Makowski
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLPaco Nathan
 
The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)Greg Makowski
 
Machine Learning In Production
Machine Learning In ProductionMachine Learning In Production
Machine Learning In ProductionSamir Bessalah
 
Three case studies deploying cluster analysis
Three case studies deploying cluster analysisThree case studies deploying cluster analysis
Three case studies deploying cluster analysisGreg Makowski
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsVarad Meru
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Edureka!
 
PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014Sri Ambati
 
Cluster analysis for market segmentation
Cluster analysis for market segmentationCluster analysis for market segmentation
Cluster analysis for market segmentationVishal Tandel
 

Viewers also liked (19)

Using Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical ApplicationsUsing Deep Learning to do Real-Time Scoring in Practical Applications
Using Deep Learning to do Real-Time Scoring in Practical Applications
 
Linked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 BLinked In Slides 2009 02 24 B
Linked In Slides 2009 02 24 B
 
Production model lifecycle management 2016 09
Production model lifecycle management 2016 09Production model lifecycle management 2016 09
Production model lifecycle management 2016 09
 
SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24SFbayACM ACM Data Science Camp 2015 10 24
SFbayACM ACM Data Science Camp 2015 10 24
 
The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)The 360º Leader (Section 2 of 6)
The 360º Leader (Section 2 of 6)
 
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
Using Deep Learning to do Real-Time Scoring in Practical Applications - 2015-...
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient search
 
Data Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAMLData Workflows for Machine Learning - Seattle DAML
Data Workflows for Machine Learning - Seattle DAML
 
The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)The 360º Leader (Section 1 of 6)
The 360º Leader (Section 1 of 6)
 
Machine Learning In Production
Machine Learning In ProductionMachine Learning In Production
Machine Learning In Production
 
Three case studies deploying cluster analysis
Three case studies deploying cluster analysisThree case studies deploying cluster analysis
Three case studies deploying cluster analysis
 
K-Means, its Variants and its Applications
K-Means, its Variants and its ApplicationsK-Means, its Variants and its Applications
K-Means, its Variants and its Applications
 
Fraud Detection Architecture
Fraud Detection ArchitectureFraud Detection Architecture
Fraud Detection Architecture
 
Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples Application of Clustering in Data Science using Real-life Examples
Application of Clustering in Data Science using Real-life Examples
 
Deep Learning for Fraud Detection
Deep Learning for Fraud DetectionDeep Learning for Fraud Detection
Deep Learning for Fraud Detection
 
PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014PayPal's Fraud Detection with Deep Learning in H2O World 2014
PayPal's Fraud Detection with Deep Learning in H2O World 2014
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Cluster Analysis for Dummies
Cluster Analysis for DummiesCluster Analysis for Dummies
Cluster Analysis for Dummies
 
Cluster analysis for market segmentation
Cluster analysis for market segmentationCluster analysis for market segmentation
Cluster analysis for market segmentation
 

Similar to Kamanja: Driving Business Value through Real-Time Decisioning Solutions

MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionIoan Toma
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionLDBC council
 
Big Data - Hadoop and MapReduce for QA and testing by Aditya Garg
Big Data - Hadoop and MapReduce for QA and testing by Aditya GargBig Data - Hadoop and MapReduce for QA and testing by Aditya Garg
Big Data - Hadoop and MapReduce for QA and testing by Aditya GargQA or the Highway
 
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsData Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsEsther Vasiete
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationAbdelkrim Hadjidj
 
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data AnalyticsMotadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data Analyticsnovsela
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderDataconomy Media
 
Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeApache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)Anthony Baker
 
Big Data, Big Picture: Can You See It?
Big Data, Big Picture: Can You See It?Big Data, Big Picture: Can You See It?
Big Data, Big Picture: Can You See It?CA Technologies
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Pactera_US
 
QCon 2018 | Gimel | PayPal's Analytic Platform
QCon 2018 | Gimel | PayPal's Analytic PlatformQCon 2018 | Gimel | PayPal's Analytic Platform
QCon 2018 | Gimel | PayPal's Analytic PlatformDeepak Chandramouli
 
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...CA Technologies
 
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterprisePivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterpriseVMware Tanzu
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?SnapLogic
 
Jupyter in the modern enterprise data and analytics ecosystem
Jupyter in the modern enterprise data and analytics ecosystem Jupyter in the modern enterprise data and analytics ecosystem
Jupyter in the modern enterprise data and analytics ecosystem Gerald Rousselle
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Denodo
 
Migrating from Oracle to Postgres
Migrating from Oracle to PostgresMigrating from Oracle to Postgres
Migrating from Oracle to PostgresEDB
 
Pivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewPivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewVMware Tanzu
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDBMongoDB
 

Similar to Kamanja: Driving Business Value through Real-Time Decisioning Solutions (20)

MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
 
Big Data - Hadoop and MapReduce for QA and testing by Aditya Garg
Big Data - Hadoop and MapReduce for QA and testing by Aditya GargBig Data - Hadoop and MapReduce for QA and testing by Aditya Garg
Big Data - Hadoop and MapReduce for QA and testing by Aditya Garg
 
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source ToolsData Science at Scale on MPP databases - Use Cases & Open Source Tools
Data Science at Scale on MPP databases - Use Cases & Open Source Tools
 
Paris FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant PresentationParis FOD Meetup #5 Cognizant Presentation
Paris FOD Meetup #5 Cognizant Presentation
 
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data AnalyticsMotadata - Unified Product Suite for IT Operations and Big Data Analytics
Motadata - Unified Product Suite for IT Operations and Big Data Analytics
 
Embedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern StaenderEmbedded-ml(ai)applications - Bjoern Staender
Embedded-ml(ai)applications - Bjoern Staender
 
Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)
 
Big Data, Big Picture: Can You See It?
Big Data, Big Picture: Can You See It?Big Data, Big Picture: Can You See It?
Big Data, Big Picture: Can You See It?
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
QCon 2018 | Gimel | PayPal's Analytic Platform
QCon 2018 | Gimel | PayPal's Analytic PlatformQCon 2018 | Gimel | PayPal's Analytic Platform
QCon 2018 | Gimel | PayPal's Analytic Platform
 
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...
Technology Primer: Hey IT—Your Big Data Infrastructure Can’t Sit in a Silo An...
 
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterprisePivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
 
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
Intelligent data summit: Self-Service Big Data and AI/ML: Reality or Myth?
 
Jupyter in the modern enterprise data and analytics ecosystem
Jupyter in the modern enterprise data and analytics ecosystem Jupyter in the modern enterprise data and analytics ecosystem
Jupyter in the modern enterprise data and analytics ecosystem
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Migrating from Oracle to Postgres
Migrating from Oracle to PostgresMigrating from Oracle to Postgres
Migrating from Oracle to Postgres
 
Pivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical OverviewPivotal Big Data Suite: A Technical Overview
Pivotal Big Data Suite: A Technical Overview
 
Veritas + MongoDB
Veritas + MongoDBVeritas + MongoDB
Veritas + MongoDB
 

More from Greg Makowski

Understanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxGreg Makowski
 
Future of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxFuture of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxGreg Makowski
 
A Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data ScientistsA Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data ScientistsGreg Makowski
 
Kdd 2019: Standardizing Data Science to Help Hiring
Kdd 2019:  Standardizing Data Science to Help HiringKdd 2019:  Standardizing Data Science to Help Hiring
Kdd 2019: Standardizing Data Science to Help HiringGreg Makowski
 
Tales from an ip worker in consulting and software
Tales from an ip worker in consulting and softwareTales from an ip worker in consulting and software
Tales from an ip worker in consulting and softwareGreg Makowski
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Greg Makowski
 

More from Greg Makowski (6)

Understanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
 
Future of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptxFuture of AI - 2023 07 25.pptx
Future of AI - 2023 07 25.pptx
 
A Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data ScientistsA Successful Hiring Process for Data Scientists
A Successful Hiring Process for Data Scientists
 
Kdd 2019: Standardizing Data Science to Help Hiring
Kdd 2019:  Standardizing Data Science to Help HiringKdd 2019:  Standardizing Data Science to Help Hiring
Kdd 2019: Standardizing Data Science to Help Hiring
 
Tales from an ip worker in consulting and software
Tales from an ip worker in consulting and softwareTales from an ip worker in consulting and software
Tales from an ip worker in consulting and software
 
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...Predictive Model and Record Description with Segmented Sensitivity Analysis (...
Predictive Model and Record Description with Segmented Sensitivity Analysis (...
 

Recently uploaded

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 

Recently uploaded (20)

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 

Kamanja: Driving Business Value through Real-Time Decisioning Solutions

  • 1. © 2015 ligaDATA, Inc. All Rights Reserved. Driving Business Value Through 
 Real-Time Decisioning Solutions July 2015 Download, Forums, Docs, Events http://Kamanja.org ligaDATA
  • 2. 2 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Outline Motivation Case Study – Modeling Department Review Mining and Big Data Tools Solution: Predictive Markup Modeling Language (PMML) Reviewing Big Data Space and Real Time Kamanja Integration (Open Source PMML) Use Cases, Demo, Architecture
  • 3. 3 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Audience Survey (show of hands) Data Mining Experience __% Read or heard about __% Class or competition __% Put a model into production __% Have put 10+ models in production __% Have put 75+ models in production Big Data Experience __% Read or heard about __% Class or exploration project __% Put a system into production __% System with 3+ OSS in prod __% System with 6+ OSS or PB+ in production Extensive Data Mining AND Big Data Experience __% with 10+ models AND 3+ OSS __% with 75+ models AND 6+ OSS / PB+ Overlap on extensive experience is rare This is what Kamanja helps with
  • 4. 4 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Case Study of a Modeling Department Financial Fraud Detection CONTEXT •  3 modelers, 2 data infrastructure people in department •  Over 3 dozen predictive models in production, high $$$$ and visibility •  Separate Operations group deploying models PROBLEM •  Models were getting stale •  “Spinning Plates” between short term solutions •  2 months for a full model training investigation •  2 months to put a model into production (OUCH) Had to completely re-code the preprocessing and model scoring Operations had One process to deploy a regression Operations had a different process to deploy a decision tree
  • 5. 5 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Case Study of a Modeling Department Financial Fraud Detection INDUSTRY REVIEW to answer: •  How common is it to use many algorithms or tools in a project? •  What is an easier way to deploy models?
  • 6. 6 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA http://www.kdnuggets.com/polls/2015/analytics-data-mining-data-science-software-used.html In the industry, many algorithms and tools are used Need to simplify DEPLOYMENT
  • 7. 7 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html Independent use of tools
  • 8. 8 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA http://www.kdnuggets.com/2015/06/data-mining-data-science-tools-associations.html Tools used in combination
  • 9. 9 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Scoring Engine (Kamanja) PMML Diagram Predictive Modeling Markup Language Training & test data (batch) Data Mining Tool File, Save As PMML PMML File PMML Producer PMML FileScoring data (real time streaming) Output data has new score field Training Project Phase Production Scoring Project Phase Full model specification PMML Consumer
  • 10. 10 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Given industry fragmentation, PMML is a solution PMML Producers (18 companies) •  R (Rattle, PMML)* •  RapidMiner •  KNIME* PMML Consumers (12 co) •  Zementis •  IBM SPSS •  KNIME •  Microstrategy •  SAS •  Kamanja* (Open Source) •  Spark (MLib)* * = Open Source •  Weka* •  SAS Enterprise Miner PREDICTIVE Naïve Bayes Neural Net Regression Rules Scorecard Sequence SVM Time Series Trees DESCRIPTIVE / OTH Association Rules Cluster, K-Nearest Nb Text Models model ensembles & composition (i.e. Gradient Boosting)
  • 11. 11 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Case Study of a Modeling Department Financial Fraud Detection SOLUTION OBJECTIVES 1) Support a wider variety of algorithms and software (increase accuracy) 2) Decrease time on putting models into production (incr analysis time)
  • 12. 12 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Case Study of a Modeling Department Financial Fraud Detection SOLUTION OBJECTIVES 1) Support a wider variety of algorithms and software (increase accuracy) 2) Decrease time on putting models into production (incr analysis time) SOLUTION 1) Train models in SAS Enterprise Miner, R (PMML Producers) 2) Score models with a RESTful call to a PMML Consumer (Zementis) Predictive Modeling Markup Language (PMML) is a type of XML RESULT 1) By supporting more software & algorithms – MORE ACCURATE! 2) PUT MODELS INTO PRODUCTION from 8 weeks to down to 2-5 days! Greatly increased throughput of training new models!
  • 13. 13 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Outline Motivation Case Study – Modeling Department Review Mining and Big Data Tools Solution: Predictive Markup Modeling Language (PMML) Other Uses of Real Time Decisioning Reviewing Big Data Space and Real Time Kamanja Integration (Open Source PMML) Use Cases, Demo, Architecture
  • 14. 14 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Other Uses of PMML or Real-Time Decisioning Complex Event Processing (CEP) •  Possibly 100’s of concurrent data streams •  Apply rule logic, select, aggregate •  Select action on elements in stream Enterprise Applications, During … •  customer call or chat: recommendations to improve service •  card transaction: offer credit increase •  web application: pre-approval •  web transaction: recommend other product(s) •  MOOC: customize training speed for the student (Custom Java Model)
  • 15. © 2015 ligaDATA, Inc. All Rights Reserved. 15 ligaDATA Real Time Computing OSS Technology Stack Integration with Kamanja Kamanja (PMML/Java/Scala Consumer) High level languages / abstractions Compute Fabric Cloud, EC2 Internal Cloud Security Kerberos Real Time Streaming Kafka, MQ Spark* ligaDATA Data Store HBase, Cassandra, InfluxDB HDFS (Create adaptors to integrate others) Resource Management Zookeeper, Yarn*, Mesos* High Level Languages / Abstractions MLlib* (PMML Producers)
  • 16. © 2015 ligaDATA, Inc. All Rights Reserved. 16 ligaDATA Real Time Open Source Systems (OSS) Kamanja and Spark are good Compliments Clarify with a feature list, Use case to work together
  • 17. 17 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Higher Requirements in 
 Financial Services or Health Care Compared to social media or web apps © 2015 ligaDATA, Inc. All Rights Reserved. CONFIDENTIAL 17 Legal Compliance to meet exacting technical standards •  Losing (or duplicating) a bank transaction •  Losing a medical record •  Executives or employees can GO TO JAIL What is different about these industries? •  Regulatory requirements requires 100% data protection •  Security •  Auditability •  Lineage •  ZERO data loss ligaDATA
  • 18. 18 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA •  Migrated from mass generalized communications to real time personalized alerts •  Increased messaging effectiveness of 400% lift in conversion to digital •  Full integration with Mainframe •  Leverage streaming transaction and customer account data Business Objectives •  Reduce operating cost of Calling Centers •  Increase customer adoption of digital channels •  Interact with Customer at point of transaction IT Objectives •  Implement cost effective and scalable platform •  Satisfy financial services security and 
 compliance req.’s •  Integrate with existing core systems Driving Digital Adoption: the Bank’s Call Center © 2015 ligaDATA, Inc. All Rights Reserved. CONFIDENTIAL 18 Results ligaDATA
  • 19. © 2015 ligaDATA, Inc. All Rights Reserved. 19 ligaDATA Medical Company use of Kamanja Lines of Business Run Medco models supplying client's intelligence based upon model findings 
 (using multi tenant deployment when appropriate) Run Customer models 
 on Medco hardware
 (on Medco owned 
 customer private net) Consult/partner with Medco customers providing software solutions to be run on Customer net
  • 20. © 2015 ligaDATA, Inc. All Rights Reserved. 20 ligaDATA •  Clinicians (knowledge experts) develop heuristic based rule set models •  The initial model was COPD (Chronic Obstructive Pulmonary Disease) risk assessment •  Models are expressed with a Domain Specific Language (DSL) they developed •  DSL models are transformed to PMML for Kamanja •  Models consume current + prior related messages over “look back period” Save the “assertions” of a patient in the database (beyond standard PMML) •  Medco plans to integrate the DSL with their ontology data modeling effort •  Goal is to generate new models as their “medical world” ontology evolves Medical Company use of Kamanja
  • 21. © 2015 ligaDATA, Inc. All Rights Reserved. 21 ligaDATA DECISION GATEWAYS Representative Kamanja 
 Solution Architecture MAINFRAME DB2 SERVER INTEGRATION CDC Kafka Inbound Queue Kamanja Security Management Error Management Metadata Service/Cache Storage Service/Cache Message Construction Decision 
 Engine Output 
 Handler Transform Compute MakeObj Parallel DAG Executor DAG Optimizer DAG Generator Change Listner Output Generator Output Disctributor DBs Apps Notification Engine DATA SOURCES DW Customer Preferences Kafka Outbound Queue HBase - History HDFS – Long term storage Zookeeper – Resource Management ligaDATA
  • 22. © 2015 ligaDATA, Inc. All Rights Reserved. 22 ligaDATA Performance
 Characteristics © 2015 ligaDATA, Inc. All Rights Reserved. CONFIDENTIAL 22 Performance •  Throughput of million messages/second •  Uses commodity hardware Scalability •  Linear scalability vertically and horizontally •  Data partitioning support •  Runtime multi-model optimizations to supports thousands of models •  Consistent performance on hundreds of models and thousands of rules Built for IoT data volumes ligaDATA
  • 23. Data Transformer Data History (Cassandra, HBase) Metadata (expanded in next slides) Model Runtime Output Dispatcher Kafka Queue Input Adapter Output Adapter Next Process Kamanja Engine Kamanja Execution Flow on a Node Storage Adapter ligaDATA
  • 24. Metadata Functions (in PMML, Scala as User Defined Func (UDF)) Models (PMML Rule Set, i.e. fraud, attrition) Messages (from input queue, real time records) Containers (i.e. a record or lookup table to provide context, priors) Types (i.e. array of patients, Dr’s, types of containers) Concepts (PMML created fields, preprocessing, scores) Metadata API Elements ligaDATA
  • 25. Metadata Metadata API Scoring Engine Manager (within a model) Model Manager (activate, control a DAG of many models) PMML Producer, or application Admin App, used by DevOps Activate PMML Model or DAG Rest API Metadata API Subsystems Configuration (Cluster, Engine, Model Compilation) Kamanja Engine ligaDATA
  • 26. Model Runtime Kamanja Runtime Model Execution Transformer Data History / Metadata, (HBase, Cassandra, ..) Msg Storage Adapter(s) Metadata Instance Model Object Model Factory 1) Message rec by runtime engine 2) Metadata is checked To see what model is Interested in the message 3) Model object Is instantiated 5) Msgs committed to history 4) Model is executed on the Message obj 6) Output of the model is returned to the engine ligaDATA
  • 27. If the node that crashes is a Kamanja Slave node •  The Kamanja Leader Node rebalances over all Kamanja nodes •  Each message is processed EXACTLY ONCE •  A Bank needs to process a transaction ONCE AND ONLY ONCE •  Look at the state of every message through each step If the Kamanja Leader node goes down, •  The next node on the list becomes the Leader, then rebalance COMPARE TO: •  Spark and Storm would execute each message AT LEAST ONCE (but may process a message 2, 3 or 4 times…). •  The expectation is for the application to handle possible dup. What happens when a node goes down? ligaDATA
  • 28. © 2015 ligaDATA, Inc. All Rights Reserved. 28 ligaDATA Kamanja Integration Points •  We provide with an enterprise friendly license (No GPL License virus to infect the entire system) •  Adaptors: for any data flow Kafka, IBMs MQ, Hbase, Cassandra, InfluxDB, Zookeeper, Spark •  User Defined Functions: Provide a JAR file or Scala function •  Custom Java Model Can skip PMML, leverage Adaptors and UDFs Import generated Java code
  • 29. 29 © 2015 ligaDATA, Inc. All Rights Reserved. ligaDATA Deploy Predictive Models and Rules in 1/100th the time it takes today •  Kamanja is an open source, real time decisioning engine •  Hardened to meet strictest requirements of Financial Services, Healthcare and scalable to handle IoT •  Kamanja Enables Developers and Data Scientists to reduce time to deploy Rules and Predictive Models •  Kamanja integrates with your Big Data ecosystem
  • 30. © 2015 ligaDATA, Inc. All Rights Reserved. 30 ligaDATA Planned Kamanja Differentiation •  Model management, enable DevOps for models DevOps: automated testing, validation, deployment and rollback A/B testing to competitively roll out model update, scheduling •  Enterprise Level Security and Multiple-Tenancy Integration using Kerberos Role based security for model management Security at field level for models, “need to know/access” •  Multi tenancy partition internal groups in different tenancies Data isolation, resource management, SLA support •  Data Integration Built-in integrations for social data and third party data Can consume 100s of different event and document types
  • 31. © 2015 ligaDATA, Inc. All Rights Reserved. 31 ligaDATA Planned Kamanja Differentiation •  Performance and Scale Dynamic scaling – enlarge and shrink as needed, based on load Leap in performance by generating native code (vs. Java) Cost aware execution in cloud environment •  Extensive integrations with enterprise queue, storage and indexing MQ, HBase, Cassandra, RDBMS, Elastic Search, Zookeeper •  Domain specific libraries and model templates to speed up preprocessing, business logic and algorithms
  • 32. © 2015 ligaDATA, Inc. All Rights Reserved. Try out
 Kamanja © 2015 ligaDATA, Inc. All Rights Reserved. CONFIDENTIAL Download, Forums, Docs, Events http://Kamanja.org ligaDATA