Personal Information
Organization / Workplace
Sebastopol, CA United States
Occupation
Evil Mad Scientist
Industry
Technology / Software / Internet
Website
derwen.ai/paco
About
Known as a "player/coach", with core expertise in data science, natural language processing, machine learning, cloud computing; 35+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Co-chair Rev. Advisor for Amplify Partners, Deep Learning Analytics, Primer, Data Spartan, Recognai. Recent roles: Director, Learning Group @ O'Reilly Media; Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Tags
big data
data science
machine learning
hadoop
cascading
spark
mesos
scalding
cascalog
nlp
python
jupyter
scala
use cases
enterprise data workflows
ai
textrank
streaming
twitter
cluster computing
open data
pmml
aws
cloud computing
text analytics
r
active learning
graph algorithms
approximation algorithms
case studies
ipython notebook
functional programming
management
human-in-the-loop
learning
docker
mesosphere
clojure
o'reilly media
publishing
real-time analytics
sql
knime
advanced math
distributed systems
google
predictive modeling
java
disambiguation
ontology
open source
scikit-learn
chicago
history
apache hadoop
analytics
networkx
datasketch
spacy
deep learning
content discovery
media
video
computable content
inverted classroom
education
graphx
community
certification
mooc
graph queries
abstract algebra
datacenter computing
marathon
linux
low latency
graph theory
airbnb
linux containers
isolation
borg
mathematics
statistics
portland
sas
ansi sql
palo alto
mapreduce
algorithms
enterprise
redis
gephi
business strategy
social media
knowledge graph
search
learning experiences
nike
nginx
kaltura
best practices
literate programming
summarization
standards
pfa
accountability
governance
avro
recommender systems
social context
kubernetes
learning curve
continuous learning
computational thinking
philosophy
parquet
thebe
json
oscon
notebooks
brazil
sao paulo
qcon
iot
paco nathan
pagerank
probabilistic data structures
system architecture
business
stanford
functio
cluster scheduling
quasar
probabilistic programming
chronos
cgroups
omega
mbrace
augustus
julia
mlbase
summingbird
titan
genetic programming
metascale
sears
chug
virtualization
university of chicago
ensembles
kdd
hadoop summit
windows azure
texas
pattern language
predictive models
optimization
tdd
optiq
application layer
enterprise architecture
splunk
bigdata
tf-idf
data analysis
pentaho
imvu
continuous deployment
emr
enron
infochimps
datameer
See more
- Presentations
- Documents
- Infographics
Possible Visions for Mahout 1.0
Ted Dunning
•
10 years ago
Whitepaper: Agricultural Systems + Data Outlook 2Q14
The Data Guild
•
10 years ago
Data Science Folk Knowledge
Krishna Sankar
•
10 years ago
Introduction to Apache Mesos
tomasbart
•
10 years ago
Data Wrangling For Kaggle Data Science Competitions
Krishna Sankar
•
10 years ago
Reactive Reatime Big Data with Open Source Lambda Architecture - TechCampVN 2014
Trieu Nguyen
•
10 years ago
Got Chaos? Extracting Business Intelligence from Email with Natural Language Processing and Dynamic Graph Analysis
Digital Reasoning
•
10 years ago
Micro Servers in Big Data
Aater Suleman
•
11 years ago
Fast matrix primitives for ranking, link-prediction and more
David Gleich
•
10 years ago
Customer Behaviour Analytics: Billions of Events to one Customer-Product Property Graph
Paul Lam
•
10 years ago
Adversarial Analytics - 2013 Strata & Hadoop World Talk
Robert Grossman
•
10 years ago
Evolution of The Twitter Stack
Chris Aniszczyk
•
10 years ago
Semantically coherent functional linear data structures
Jack Fox
•
10 years ago
Mesos
Anis Nasir
•
11 years ago
SQL Now! How Optiq brings the best of SQL to NoSQL data.
Julian Hyde
•
10 years ago
Individual movements and geographical data mining. Clustering algorithms for highlighting hotspots in personal navigation routes.
Beniamino Murgante
•
12 years ago
Personalized PageRank based community detection
David Gleich
•
10 years ago
Why Docker
dotCloud
•
10 years ago
Hadoop on-mesos
Henry Cai 蔡明航
•
10 years ago
Functional linear data structures in f#
Jack Fox
•
10 years ago