Personal Information
Organization / Workplace
Greater New York City Area United States
Occupation
Data Scientist at Bank of America
Industry
Technology / Software / Internet
About
My interests are centered on developing tools to assist people in understanding large sets of data. My approach to is to first understand the organizational, social or psychological context of the problem and then draw on statistics and machine learning tools to develop novel solutions. This encourages a holistic view of the problem under consideration and exposes nuances in user behavior that affect the utility of the solution. The approach also centers on evaluating gains in terms of both the efficiency of the system and its effectiveness in terms of the system's ability to better support the individual(s) using the data.
- Presentations
- Documents
- Infographics
Do More. Do things that were previously impossible!
Tim O'Reilly
•
6 years ago
Capacity Planning with Free Tools
Adrian Cockcroft
•
15 years ago
Text categorization with Lucene and Solr
Tommaso Teofili
•
11 years ago
The Network structure of R packages on CRAN & BioConductor
Revolution Analytics
•
8 years ago
Linking data without common identifiers
Lars Marius Garshol
•
12 years ago
Data modeling for Elasticsearch
Florian Hopf
•
8 years ago
RHadoop, R meets Hadoop
Revolution Analytics
•
12 years ago
Simple Lean Agile KPIs
Yuval Yeret
•
13 years ago
Monitoring Spark Applications
Tzach Zohar
•
8 years ago
03 Modelling
Hadley Wickham
•
14 years ago
Building Data Pipelines for Solr with Apache NiFi
Bryan Bende
•
8 years ago
Scaling SolrCloud to a Large Number of Collections - Fifth Elephant 2014
Shalin Shekhar Mangar
•
9 years ago
New Security Features in Apache HBase 0.98: An Operator's Guide
HBaseCon
•
9 years ago
Apache NiFi- MiNiFi meetup Slides
Isheeta Sanghi
•
7 years ago
Building a near real time search engine & analytics for logs using solr
lucenerevolution
•
10 years ago
HBaseCon 2013: Streaming Data into Apache HBase using Apache Flume: Experience with High Speed Writes
Cloudera, Inc.
•
10 years ago
HBaseCon 2013: Realtime User Segmentation using Apache HBase -- Architectural Case Study
Cloudera, Inc.
•
10 years ago
HBaseCon 2015: S2Graph - A Large-scale Graph Database with HBase
HBaseCon
•
8 years ago
The State of HBase Replication
HBaseCon
•
9 years ago
Gobblin' Big Data With Ease @ QConSF 2014
Lin Qiao
•
9 years ago