Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Staff Site Reliability Engineer
Industry
Technology / Software / Internet
Website
http://tabledideas.com
About
I am a Staff Site Reliability Engineer at LinkedIn, tasked with keeping Zookeeper, Kafka, and Samza deployments fed and watered. The SRE is a jack-of-all-trades, seamlessly shifting from architecting our system infrastructure, to deploying and running those systems, to developing tools, documentation, and processes to make it all work a little bit better. With well over a petabyte of data flowing through these systems every day, this is no small feat.
My role as an SRE, and especially as a technical leader within the company, allows me to take a strategic view of what is happening with, and what is planned for, LinkedIn's systems. By understanding that we are all one team, working with t...
Tags
sre
apache
kafka
big data
linkedin
operations
monitoring
culture
tuning
performance
leadership
engineering
code yellow
blameless
toil
lisa
usenix
bosre
wayfair
metrics
open source
cloudrr2016
resilience
cloud computing
reliability
devops
java
infrastructure
architecture
data
See more
Presentations
(15)Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Staff Site Reliability Engineer
Industry
Technology / Software / Internet
Website
http://tabledideas.com
About
I am a Staff Site Reliability Engineer at LinkedIn, tasked with keeping Zookeeper, Kafka, and Samza deployments fed and watered. The SRE is a jack-of-all-trades, seamlessly shifting from architecting our system infrastructure, to deploying and running those systems, to developing tools, documentation, and processes to make it all work a little bit better. With well over a petabyte of data flowing through these systems every day, this is no small feat.
My role as an SRE, and especially as a technical leader within the company, allows me to take a strategic view of what is happening with, and what is planned for, LinkedIn's systems. By understanding that we are all one team, working with t...
Tags
sre
apache
kafka
big data
linkedin
operations
monitoring
culture
tuning
performance
leadership
engineering
code yellow
blameless
toil
lisa
usenix
bosre
wayfair
metrics
open source
cloudrr2016
resilience
cloud computing
reliability
devops
java
infrastructure
architecture
data
See more