Personal Information
Organization / Workplace
Sebastopol, CA United States
Occupation
Evil Mad Scientist
Industry
Technology / Software / Internet
Website
derwen.ai/paco
About
Known as a "player/coach", with core expertise in data science, natural language processing, machine learning, cloud computing; 35+ years tech industry experience, ranging from Bell Labs to early-stage start-ups. Co-chair Rev. Advisor for Amplify Partners, Deep Learning Analytics, Primer, Data Spartan, Recognai. Recent roles: Director, Learning Group @ O'Reilly Media; Director, Community Evangelism @ Databricks and Apache Spark. Cited in 2015 as one of the Top 30 People in Big Data and Analytics by Innovation Enterprise.
Tags
big data
data science
machine learning
hadoop
cascading
spark
mesos
scalding
cascalog
nlp
python
jupyter
scala
use cases
enterprise data workflows
ai
textrank
streaming
twitter
cluster computing
open data
pmml
aws
cloud computing
text analytics
r
active learning
graph algorithms
approximation algorithms
case studies
ipython notebook
functional programming
management
human-in-the-loop
learning
docker
mesosphere
clojure
o'reilly media
publishing
real-time analytics
sql
knime
advanced math
distributed systems
google
predictive modeling
java
disambiguation
ontology
open source
scikit-learn
chicago
history
apache hadoop
analytics
networkx
datasketch
spacy
deep learning
content discovery
media
video
computable content
inverted classroom
education
graphx
community
certification
mooc
graph queries
abstract algebra
datacenter computing
marathon
linux
low latency
graph theory
airbnb
linux containers
isolation
borg
mathematics
statistics
portland
sas
ansi sql
palo alto
mapreduce
algorithms
enterprise
redis
gephi
business strategy
social media
knowledge graph
search
learning experiences
nike
nginx
kaltura
best practices
literate programming
summarization
standards
pfa
accountability
governance
avro
recommender systems
social context
kubernetes
learning curve
continuous learning
computational thinking
philosophy
parquet
thebe
json
oscon
notebooks
brazil
sao paulo
qcon
iot
paco nathan
pagerank
probabilistic data structures
system architecture
business
stanford
functio
cluster scheduling
quasar
probabilistic programming
chronos
cgroups
omega
mbrace
augustus
julia
mlbase
summingbird
titan
genetic programming
metascale
sears
chug
virtualization
university of chicago
ensembles
kdd
hadoop summit
windows azure
texas
pattern language
predictive models
optimization
tdd
optiq
application layer
enterprise architecture
splunk
bigdata
tf-idf
data analysis
pentaho
imvu
continuous deployment
emr
enron
infochimps
datameer
See more
- Presentations
- Documents
- Infographics
Data Science with Hadoop - A primer
Ofer Mendelevitch
•
10 years ago
PRISM seed-stage Investor Deck
David Coallier
•
10 years ago
A dynamical system for PageRank with time-dependent teleportation
David Gleich
•
10 years ago
Agile analytics applications on hadoop
Russell Jurney
•
10 years ago
Skills, Reputation, and Search
Peter Skomoroch
•
10 years ago
Sparse matrix computations in MapReduce
David Gleich
•
10 years ago
Functional programming for optimization problems in Big Data
Paco Nathan
•
11 years ago
Visualize Big Graph Data
Mathieu Bastian
•
11 years ago
Data Day Texas 2013
Matthias Broecheler
•
11 years ago
Why clojure
Thomas Goossens
•
11 years ago
Incorporating Regularity into Models of Noncontractual Customer-Firm Relationships
MOSTLY AI
•
14 years ago
Netflix and Open Source
Adrian Cockcroft
•
11 years ago
Microlearning: a strategy for ongoing professional development
eLearning Papers
•
13 years ago
LinkedIn Data Products
Vitaly Gordon
•
11 years ago
Drill / SQL / Optiq
Julian Hyde
•
11 years ago
Scalding
Mario Pastorelli
•
11 years ago
Scalable and Flexible Machine Learning With Scala @ LinkedIn
Vitaly Gordon
•
11 years ago
Enterprise Data Workflows with Cascading
Paco Nathan
•
11 years ago
Optiq: a SQL front-end for everything
Julian Hyde
•
11 years ago
Ember.js for SFHTML5
Anthony Bull
•
11 years ago