Personal Information
Organization / Workplace
San Francisco Bay Area, ca United States
About
There is nothing like the satisfaction of seeing a huge cluster of servers churning through massive amounts of data quickly and efficiently. As a Big Data performance engineer, my playground is massive amounts of memory, CPU, data disks (including NVMEs), fast network connectivity, open source Hadoop stack. I love Spark and Solr. I measure analytic workloads of all kind: Tesla trip data, crime data, tweets, credit card transactions, retail data that is larger than Amazon.com!
Tags
spark
troubleshooting
tpcds
sql
thrift server
ibm
iop
tuning
performance
monitoring
See more