One benefit of Apache Hadoop is the ability to power multiple workloads, across many different users and departments, all within a single, shared cluster. Hear how BT is doing this today and learn about new features in Cloudera Manager to provide better visibility for multi-tenant operations.
To set some context I thought I’d take a slide to give you the backstory to HaaS.
As a business BT has always invested in R&D, our UK research campus Adastral Park was opened 40 years ago.
Ever since we have invested in R&D, last year BT spent over £500 million.
In addition to our in-house research work we have technology scouts in silicon valley and researchers at MIT.
In 2010/11 our customer experience research team were working social media sentiment analysis when they came across Hadoop.
They were working on small data samples on laptops in R-studio.
Hadoops scale out architecture and schema on read made it easy for them to ingest millions of tweets so they built a research cluster.
Pretty soon they were using Hadoop to answer different business questions like
“What proportion of UK phone lines could support 50MB internet ?”
“What would the fault rates be if 80% of customers had 50MB broadband ? How many additional engineers might we need” ?
The business was catching onto big data spurred on by articles like the HBR Oct 2012 and the torrent of analyst waves and hype cycles.
They started to rely on the research hadoop capability as they found they could get answers to big ad-hoc questions much faster from research and hadoop than they could from traditional data warehouses that weren’t setup to quickly ingest new data sets and run statistical models.
Research now had a problem because they’re not set up to offer a production service with support and SLA.
They came to the Chief architects Office for help in getting Hadoop out of Research and into BAU data centres ASAP.
Within CAO we saw the lots of opportunities with Hadoop.
The most significant being the ability to build a single enterprise data hub that we could use to deliver data democratisation, i.e. giving the data back to the business owners
There were other short benefits such as the ability to re-platform old batch apps that needed to be kept running and provide low cost storage & archive.
-oOo-
Design Write Service description based on customer needs. MVP !
Sign Offs (Data centre Operations, Info Security)
Try it out, use Cloudera Manager to setup & monitor services
Reuse what the business already had
Order Gateway, Active Directory
Automate Provisioning
Market & Communicate