Personal Information
Organization / Workplace
Greater New York City Area United States
Occupation
Data Scientist at Bank of America
Industry
Technology / Software / Internet
About
My interests are centered on developing tools to assist people in understanding large sets of data. My approach to is to first understand the organizational, social or psychological context of the problem and then draw on statistics and machine learning tools to develop novel solutions. This encourages a holistic view of the problem under consideration and exposes nuances in user behavior that affect the utility of the solution. The approach also centers on evaluating gains in terms of both the efficiency of the system and its effectiveness in terms of the system's ability to better support the individual(s) using the data.
- Presentations
- Documents
- Infographics
How Google Works
Eric Schmidt
•
9 years ago
Building a real time, solr-powered recommendation engine
Trey Grainger
•
11 years ago
The Impala Cookbook
Cloudera, Inc.
•
9 years ago
10 Lessons Learned from Building Machine Learning Systems
Xavier Amatriain
•
9 years ago
What is in a Lucene index?
lucenerevolution
•
10 years ago
HBaseCon 2012 | HBase Schema Design - Ian Varley, Salesforce
Cloudera, Inc.
•
11 years ago
Espresso: LinkedIn's Distributed Data Serving Platform (Paper)
Amy W. Tang
•
10 years ago
Netflix Data Pipeline With Kafka
Allen (Xiaozhong) Wang
•
9 years ago
Site Search Analytics in a Nutshell
Louis Rosenfeld
•
13 years ago
Apache Hadoop and HBase
Cloudera, Inc.
•
13 years ago
01 Intro
Hadley Wickham
•
14 years ago
Data Workflows for Machine Learning - Seattle DAML
Paco Nathan
•
10 years ago
Simple REST-APIs with Dropwizard and Swagger
LeanIX GmbH
•
8 years ago
Java application monitoring with Dropwizard Metrics and graphite
Roberto Franchini
•
9 years ago
High Performance Solr
Shalin Shekhar Mangar
•
9 years ago
Near-realtime analytics with Kafka and HBase
dave_revell
•
11 years ago
A Survey of HBase Application Archetypes
HBaseCon
•
9 years ago
02 Ddply
Hadley Wickham
•
14 years ago
Benchmarking Solr Performance at Scale
thelabdude
•
9 years ago
Implementing Conceptual Search in Solr using LSA and Word2Vec: Presented by Simon Hughes, Dice.com
Lucidworks
•
8 years ago