DOES SFO 2016 - Topo Pal - DevOps at Capital One

•Download as PPTX, PDF•

9 likes•3,869 views

In my previous years’ talks at DevOps Enterprise Summit, I spoke about starting and scaling of DevOps at Capital One; importance of Open Source, Open Technology and Innovations in DevOps. This year, I will present Capital One’s journey of maturing in DevOps and Continuous Delivery. My presentation will cover our current areas of focus: Delivery Pipeline, Flow and Measurements. I will also share some of the problems we faced and what we did to solve them.

Technology

DevOps at Capital One
Focusing on Pipeline and Measurement

@TopoPal
Capital One
 Millions of accounts
 One of the largest Digital Banks
 #1 Information Week’s Elite 100
 ~ 20 years old

@TopoPal
Different DNA
 Build our own software
 Build on public cloud
 MicroServices
 Open Source
 DevOpsSec and Continuous Delivery

@TopoPal
• Enterprise Architecture
• DevOpsSec Strategy Owner
• DevOps Evangelist
• Shared Technology Group
• Product Manager of Continuous
Delivery Tools Platform
• DevOps Evangelist
• Core Contributor and Community
Manager of Hygieia
Personal Journey

@TopoPal
• Waterfall
• Manual Build
• Manual Deployment
• Manual Test
• Data Center
• Closed Source First
• Agile
• Automated Build
• Automated Deployment
• Automated Test
• Public Cloud
• Open Source First
Agile & DevOps Transformation Journey

@TopoPal
Mostly Out-Sourced Mostly In-Sourced
Agile & DevOps Transformation Journey
Vertical Silos Product Team
Dev, Ops, QA, RM Engineers

@TopoPal
 DOES 2014
Building out Automation steps
 DOES 2015
Scaling DevOps, Open Source, Cloud, Innovation
 DOES 2016
Measure, Improve, Mature

@TopoPal
Typical DevOps Success Story
Code Commit Random 100s /day
Deployment
Prod
Manual Automated
Integration Monthly 15 mins
QA, Perf Monthly 4 / day
Monthly/Quarter
ly
Once / sprint
Testing Manual Automated

@TopoPal
Deliver High Quality Working Software Faster

@TopoPal
Deliver High Quality Working Software Faster
• Across LOBs, Shared Services and 3rd
Parties
• Tested end-to-end
• All dependencies are satisfied
• How fast? ASAP?

@TopoPalhttps://upload.wikimedia.org/wikipedia/commons/c/c8/Can_We_Do_it_Better_or_Faster...We_Want_Your_Ideas_-_NARA_-_534240.jpg

@TopoPal
Feb 8, 1700 — March 17, 1782
Daniel J. Bernoulli

@TopoPal
Constrict flow, Increase Speed, Lessen
Pressure
https://www.khanacademy.org/science/physics/fluids/fluid-dynamics/a/what-is-volume-flow-rate

@TopoPal
https://en.wikipedia.org/wiki/Oil_refinery

@TopoPal
https://commons.wikimedia.org/wiki/File:US_Navy_060906-N-8257O-
026_Damage_Controlman_1st_Class_Petty_Officer_Derrick_Harney_assists_his_students_in_repairing_a_broken_pipeline_during_the_hands_on_patch_training_p
ortion_of_the_Damage_Control_Wet_Trainer.jpg

@TopoPal
• Design
• Measure
• Improve
Pipeline

@TopoPal
Pipeline must have 16 gates
Source code version control
Optimum branching strategy
Static analysis
> 80% Code coverage
Vulnerability scan
Open source scan
Artifact version control
Auto provision
Immutable servers
Integration testing
Performance testing
Build, Deploy, Testing automated for every commit
Automated Change Order
Zero downtime release
Feature Toggle

@TopoPal
https://devops-research.com/ https://github.com/capitalone/Hygieia

@TopoPal
Increase Speed = Reduce Wait Time

@TopoPal
Opportunities
• Branching Strategy
• Process

@TopoPal
Pipeline Improvement
Improve Branching

@TopoPal
Branching
• We recommend “Trunk based”
development.
• Other option:

@TopoPal
Pipeline Improvement
Improve Process
• Automate Release Process
• Revisit Audit & Compliance

@TopoPal
Risks are real
• Intentional damage
• Unintentional damage
• Untested code in production
But….
There is a better way

@TopoPal
Hypothesis
• DevOpsSec & CI/CD provide better
controls
• A model with ~30 practices can satisfy
audit and compliance
• If everything is source code, no one
needs access to production
• For emergency, “Break Glass”

@TopoPal
Result
Production Release 1+ / dayOnce / sprint
# of Applications with Release Automation: 20+
Max. # of Releases in 1 day for 1 Application: 34
With “Segregation of Duties”

@TopoPal
Goal
Release Automation
without
classic “Segregation of Duties”

@TopoPal
Coming Soon to Open Source
• A secure & compliant pipeline model
• A forked and enhanced version of “LGTM”

What's hot

Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...SlideTeam

DevOps + DataOps = Digital Transformation Delphix

Agile Network India | What does it take to Transform into Product Centric IT ...AgileNetwork

Modernizing to a Cloud Data ArchitectureDatabricks

Shift Left Security - The What, Why and HowDevOps.com

Creating an Effective Roadmap for Your Cloud Journey (ENT225-R1) - AWS re:Inv...Amazon Web Services

The Next Wave of Reliability EngineeringMichael Kehoe

Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker

Migration PlanningAmazon Web Services

Evolving Team Structure in DevOpsSherry Chang

The State of DevSecOpsDevOps Indonesia

Emerging Trends in Data EngineeringAnanth PackkilDurai

Value stream management is essential for dev ops v4DevOps.com

Microservice Architecture 101Kochih Wu

Agile Transformation Journey on Large Scale ProjectsAvinash Bais- Agile Coach - CSPO

Agile transformation 1.3Krystian Kaczor

Cloud MigrationKimberly Wiethoff, MBA PMP CSM ITIL

Cloud Migration Cookbook: A Guide To Moving Your Apps To The CloudNew Relic

Application Migration: How to Start, Scale and SucceedVMware Tanzu

Snowflake Automated Deployments / CI/CD PipelinesDrew Hansen

What's hot (20)

Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...

DevOps + DataOps = Digital Transformation

Agile Network India | What does it take to Transform into Product Centric IT ...

Modernizing to a Cloud Data Architecture

Shift Left Security - The What, Why and How

Creating an Effective Roadmap for Your Cloud Journey (ENT225-R1) - AWS re:Inv...

The Next Wave of Reliability Engineering

Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021

Migration Planning

Evolving Team Structure in DevOps

The State of DevSecOps

Emerging Trends in Data Engineering

Value stream management is essential for dev ops v4

Microservice Architecture 101

Agile Transformation Journey on Large Scale Projects

Agile transformation 1.3

Cloud Migration

Cloud Migration Cookbook: A Guide To Moving Your Apps To The Cloud

Application Migration: How to Start, Scale and Succeed

Snowflake Automated Deployments / CI/CD Pipelines

Viewers also liked

What is DevOps? What is DevOps CoE? 7Targets AI Sales Assistants

How do I do DevOps when all I have is Ops?Chris Swan

Using Lean Thinking to identify and address Delivery Pipeline bottlenecksSanjeev Sharma

How will DevOps benefit enterprise? InterQuest Group

Rundeck's History and Futuredev2ops

Get Mapped: Using Value Stream Mapping to Create a DevOps Adoption RoadmapIBM UrbanCode Products

Viewers also liked (6)

What is DevOps? What is DevOps CoE?

How do I do DevOps when all I have is Ops?

Using Lean Thinking to identify and address Delivery Pipeline bottlenecks

How will DevOps benefit enterprise?

Rundeck's History and Future

Get Mapped: Using Value Stream Mapping to Create a DevOps Adoption Roadmap

Similar to DOES SFO 2016 - Topo Pal - DevOps at Capital One

Topo pal does2016Tapabrata Pal

DevOps Measurement - DevOpsDays DCTapabrata Pal

Dashboards and Culture: How Openness Changes Your BehaviourSteve Poole

Agile Islands 2020 - Dashboards and CultureSteve Poole

Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...Steve Poole

Operations for databases: the agile/devops journeyEduardo Piairo

Tools, Culture, and Aesthetics: The Art of DevOpsJ. Paul Reed

Moving to a DevOps mode - easy, hard or just plain terrifying? - Daniel Bryan...JAXLondon2014

JAX London 2014 "Moving to DevOps Mode: easy, hard or just plain terrifying?"Daniel Bryant

How We Make UnityNa'Tosha Bard

EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...European Data Forum

Operations for databases – The DevOps journey Eduardo Piairo

Deployment pipeline for databasesEduardo Piairo

CTOs in London "The Challenges of Evaluating Development Technology Within th...Daniel Bryant

01 - DevOpsGuys - Atmosphere May 2015 widescreenStephen Thair

Rip DevOps (Feb 2019)Ryan Lockard

BrainQuest-DevOpsEric Phan

DockerCon SF 2015: Ben Golub's Keynote Day 1Docker, Inc.

Keynote Dev Days vilnius 2018: how openness changes your behaviourSteve Poole

Searching The Enterprise Data Lake With Solr - Watch Us Do It!: Presented by...Lucidworks

Similar to DOES SFO 2016 - Topo Pal - DevOps at Capital One (20)

Topo pal does2016

DevOps Measurement - DevOpsDays DC

Dashboards and Culture: How Openness Changes Your Behaviour

Agile Islands 2020 - Dashboards and Culture

Agile Tour London 2018: DASHBOARDS AND CULTURE – HOW OPENNESS CHANGES YOUR BE...

Operations for databases: the agile/devops journey

Tools, Culture, and Aesthetics: The Art of DevOps

Moving to a DevOps mode - easy, hard or just plain terrifying? - Daniel Bryan...

JAX London 2014 "Moving to DevOps Mode: easy, hard or just plain terrifying?"

How We Make Unity

EDF2013: Invited talk Florian Bauer: Unleashing climate and energy knowledge ...

Operations for databases – The DevOps journey

Deployment pipeline for databases

CTOs in London "The Challenges of Evaluating Development Technology Within th...

01 - DevOpsGuys - Atmosphere May 2015 widescreen

Rip DevOps (Feb 2019)

BrainQuest-DevOps

DockerCon SF 2015: Ben Golub's Keynote Day 1

Keynote Dev Days vilnius 2018: how openness changes your behaviour

Searching The Enterprise Data Lake With Solr - Watch Us Do It!: Presented by...

Recently uploaded

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

Developing An App To Navigate The Roads of BrazilV3cube

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Histor y of HAM Radio presentation slidevu2urc

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

🐬 The future of MySQL is Postgres 🐘RTylerCroy

How to convert PDF to text with Nanonetsnaman860154

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker

Data Cloud, More than a CDP by Matt Robison

Developing An App To Navigate The Roads of Brazil

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Unblocking The Main Thread Solving ANRs and Frozen Frames

CNv6 Instructor Chapter 6 Quality of Service

Presentation on how to chat with PDF using ChatGPT code interpreter

Automating Google Workspace (GWS) & more with Apps Script

Histor y of HAM Radio presentation slide

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Breaking the Kubernetes Kill Chain: Host Path Mount

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

🐬 The future of MySQL is Postgres 🐘

How to convert PDF to text with Nanonets

Boost PC performance: How more available memory can improve productivity

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Driving Behavioral Change for Information Management through Data-Driven Gree...

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

08448380779 Call Girls In Civil Lines Women Seeking Men

DOES SFO 2016 - Topo Pal - DevOps at Capital One

1. DevOps at Capital One Focusing on Pipeline and Measurement

2. @TopoPal

3. @TopoPal Capital One  Millions of accounts  One of the largest Digital Banks  #1 Information Week’s Elite 100  ~ 20 years old

4. @TopoPal Different DNA  Build our own software  Build on public cloud  MicroServices  Open Source  DevOpsSec and Continuous Delivery

5. @TopoPal • Enterprise Architecture • DevOpsSec Strategy Owner • DevOps Evangelist • Shared Technology Group • Product Manager of Continuous Delivery Tools Platform • DevOps Evangelist • Core Contributor and Community Manager of Hygieia Personal Journey

6. @TopoPal

7. @TopoPal • Waterfall • Manual Build • Manual Deployment • Manual Test • Data Center • Closed Source First • Agile • Automated Build • Automated Deployment • Automated Test • Public Cloud • Open Source First Agile & DevOps Transformation Journey

8. @TopoPal Mostly Out-Sourced Mostly In-Sourced Agile & DevOps Transformation Journey Vertical Silos Product Team Dev, Ops, QA, RM Engineers

9. @TopoPal  DOES 2014 Building out Automation steps  DOES 2015 Scaling DevOps, Open Source, Cloud, Innovation  DOES 2016 Measure, Improve, Mature

10. @TopoPal Typical DevOps Success Story Code Commit Random 100s /day Deployment Prod Manual Automated Integration Monthly 15 mins QA, Perf Monthly 4 / day Monthly/Quarter ly Once / sprint Testing Manual Automated

11. @TopoPal 2016 What’s in your pipeline?

12. @TopoPal http://www.devopsdays.org

13. @TopoPal Deliver High Quality Working Software Faster

14. @TopoPal Deliver High Quality Working Software Faster • Across LOBs, Shared Services and 3rd Parties • Tested end-to-end • All dependencies are satisfied • How fast? ASAP?

15. @TopoPalhttps://upload.wikimedia.org/wikipedia/commons/c/c8/Can_We_Do_it_Better_or_Faster...We_Want_Your_Ideas_-_NARA_-_534240.jpg

16. @TopoPal

17. @TopoPal Feb 8, 1700 — March 17, 1782 Daniel J. Bernoulli

18. @TopoPal Constrict flow, Increase Speed, Lessen Pressure https://www.khanacademy.org/science/physics/fluids/fluid-dynamics/a/what-is-volume-flow-rate

19. @TopoPal Commit Deploy

20. @TopoPal http://www.netuba.org/

21. @TopoPal https://en.wikipedia.org/wiki/Oil_refinery

22. @TopoPal https://commons.wikimedia.org/wiki/File:US_Navy_060906-N-8257O- 026_Damage_Controlman_1st_Class_Petty_Officer_Derrick_Harney_assists_his_students_in_repairing_a_broken_pipeline_during_the_hands_on_patch_training_p ortion_of_the_Damage_Control_Wet_Trainer.jpg

23. @TopoPal • Design • Measure • Improve Pipeline

24. @TopoPal Pipeline Design

25. @TopoPal Pipeline must have 16 gates Source code version control Optimum branching strategy Static analysis > 80% Code coverage Vulnerability scan Open source scan Artifact version control Auto provision Immutable servers Integration testing Performance testing Build, Deploy, Testing automated for every commit Automated Change Order Zero downtime release Feature Toggle

26. @TopoPal Pipeline Measurement

27. @TopoPal https://devops-research.com/

28. @TopoPal https://devops-research.com/ https://github.com/capitalone/Hygieia

29. @TopoPal Increase Speed = Reduce Wait Time

30. @TopoPal Opportunities • Branching Strategy • Process

31. @TopoPal Pipeline Improvement Improve Branching

32. @TopoPal Branching • We recommend “Trunk based” development. • Other option:

33. @TopoPal Pipeline Improvement Improve Process • Automate Release Process • Revisit Audit & Compliance

34. @TopoPal Risks are real • Intentional damage • Unintentional damage • Untested code in production But…. There is a better way

35. @TopoPal Hypothesis • DevOpsSec & CI/CD provide better controls • A model with ~30 practices can satisfy audit and compliance • If everything is source code, no one needs access to production • For emergency, “Break Glass”

36. @TopoPal Result Production Release 1+ / dayOnce / sprint # of Applications with Release Automation: 20+ Max. # of Releases in 1 day for 1 Application: 34 With “Segregation of Duties”

37. @TopoPal Goal Release Automation without classic “Segregation of Duties”

38. @TopoPal Coming Soon to Open Source • A secure & compliant pipeline model • A forked and enhanced version of “LGTM”

39. @TopoPal

40. @TopoPal Thank You!

Editor's Notes

Good Morning everyone. My name is Tapabrata Pal, I go by Topo. My twitter handle is @TopoPal. And I am an engineer at Capital One. This is my third year at this conference. And let me tell you that it’s an honor to be back on this stage for the third time and speak to you all. It’s like a home coming for me.
Most of you know Capital One is a Credit Card company. We are one of the largest in the US with over 70 million accounts. Many know that we are also one of the nation’s largest banks. Fewer, however, realize that we are a Founder led 20 years old Technology Company. Our “youngest” competitor is 108 years old. In that sense, we are a startup in this industry. A typical bank organization will largely procure third-party software for its internal and customer-facing operations. Over the past five years we have transformed ourselves to an organization that truly builds its own software and develops its own solutions.
That is a different DNA. Today, we are hyper-focused on how we can get more productive, move quicker, get things to market faster, and constantly iterate. • We build on the public cloud, leveraging continuous integration and delivery methods to deploy our products into production. • We build using micro-services architecture and restful APIs. using Open Source and practicing DevOpsSec and Continuous Delivery
I joined Capital One six years ago. I started as an Enterprise Architect – I have been involved with Capital One’s DevOps journey from the beginning, many a times I led some key efforts around DevOpsSec adoption, Scaling DevOps across the enterprise, Open Source governance that formalizes open source adoption. I led creation of our Enterprise DevOpsSec Strategy – I helped standing up our Shared Delivery Tools platform. And that led to my movement from Enterprise Architecture to Shared Technology organization. Currently I am the product manager of our Shared Continuous Delivery Tools Platform that offers typical DevOps tools for the enterprise as services.
In the meantime, few of us at Capital One developed Hygieia DevOps Dashboard and Open Sourced it. It’s the first open source product from Capital One. I am the community manager and one of the core contributors. I can’t tell you enough about my excitement around Hygieia. Since its launch in July 2015, it has become very popular. Google “DevOps dashboard” and the first non-ad hit you get is Hygieia github repo! Many large enterprises are either using it or testing it out. It has won “Open Source rookie of 2015”. If any of you are using it or thinking of it and need some help or have new feature ideas, send me a quick note - @TopoPal or open a github issue. Overall, I am loving it. Let me tell you the best part of my job – I get to learn new things every day. This is so true when you are in the middle of an awesome transformation and been a part of it.
Our Agile and DevOps transformation over the last 5 years have been quite successful. At a high level, we have transformed ourselves from waterfall to agile – across the board. From manual build, code promotion, testing, release to full automation. Exceptions are off-the-self products and products that are pre-historic . Any new product that have been created over the past 4 years are following agile and devopssec principles. From Vertical Silos of Dev, Ops, QA, Support to Autonomous Product based teams. We are not fully there yet, but we are getting there.
Biggest are these three big transformation is the fact we went from mostly outsourced company to mostly in-sourced. We are continuously hiring skilled engineers. From vertical silos such as Dev Team, Ops Team, QA Team…. we now have Product Teams. Autonomous teams with everyone needed to develop a product.
As I said this is my third year on this stage. In 2014, I shared with you our DevOpsSec Strategy, Initial successes of our automation efforts and also shared our success story around scaling DevOpsSec in an Enterprise Scale. In 2015, last year, I shared our success stories around an Engineering Transformation – not just DevOps. An awesome transformation that I will always be proud of being involved in. I shared how Open Source, Open Technology, Innovation and Sharing changed Capital One culture drastically. This time, I am going to share our learnings around DevOps maturity through measurement and continuous improvements.
Let me start with where we left off last time – our typical success story – before and after. This is before-and-after for one of our biggest product line. It has more than 250 engineers in the product team that includes dev, qa, ops everyone. A single Github repository with application code, test code, infrastructure and configuration code. The application runs on a public cloud infrastructure. As you can imagine the result of automation and shift-left are quite apparent in these numbers. Builds every 15 minutes, automated testing, automated deployment to all the environments – it’s all good; and it really is. But this is not where we want to stop. In particular, we are not happy with our deployment frequency. Let me put a disclaimer right here. For us, a deployment means real application code change. It does not include content changes, style changes, network changes, database changes, system resource changes and so on. Whether we should count those too or not is a different topic of discussion. All I am saying is that a deployment here represents a set of new application code, in whatever form, being installed in production. In other words, the deployment number here is a small subset of production changes that are going on.
2016, the year of “What’s in your pipeline”. We have been asking our teams a very simple question “What’s in your pipeline”. We have been doing DevOpsSec for a while – for about 5 years now. Our engineers know what DevOps means – or not!
In my honest opinion, we need to stop defining DevOps. Instead of asking what DevOps is, we should be asking Why do we need DevOps.
In my point of view, the answer is very straight forward: the goal is to Deliver High Quality Working Software Faster. Now, we all know what each of these words mean. But “Faster”? How fast? This is so much confusing in many ways… What is a good number? Why do we need to go that fast? We used to do 1 release per quarter… now we have release every month – isn’t that fast enough…
In my point of view, the answer is very straight forward: the goal is to Deliver High Quality Working Software Faster. Now, we all know what each of these words mean. But “Faster”? How fast? This is so much confusing in many ways… What is a good number? Why do we need to go that fast? We used to do 1 release per quarter… now we have release every month – isn’t that fast enough…
To be honest, I cannot answer the question as to how fast is optimum or how fast is feasible. But there is scientific proof that faster is better. There is also evidence that frequent deployment is better. And so, faster and frequent is better – an indicator of high performing IT organization.
It was loud in clear in DevOps Survey. If you have not read this yet, do it tonight. So, faster is better. I kept thinking… is there a scientific proof? I know Nicole will tell me that the survey is scientific. And it is. I am thinking I have read this proof somewhere… years ago when I was a kid.
Let me digress here a little bit. It was Bernoulli in the early 18th century.
Based on Bernoulli’s work, you can explain why when the flow of an incompressible fluid is constricted, the fluid velocity increases. And the dynamic fluid pressure decreases and the energy remain constant. In essence, science had proven long ago that smaller chuck of changes delivered in a continuous flow through a pipeline increases velocity and creates less pressure…
In order to increase speed in delivery, we started looking at pipelines. Continuous Delivery pipelines. With all the automation, shit-left, practices that we developed over the years, we now wanted to build these pipelines that magically takes a commit from commit stage to production with zero touch. Easier said than done. We looked at some sample pipelines that people started creating – both inside at Capital One and outside… We found some rather interesting ones…
Like some pipelines that never end.
Some pipelines are so complex that you don’t know where they start and where they end.
And then, this is the most popular type of pipelines – you need an army to support the pipeline. I have also seen pipelines that just builds and deploys – but as far as I am concerned, a pipeline that does not have security embedded and that does not have test automation – is not a pipeline. Period. I really don’t care about rest of the things that the pipeline does.
So, to summarize, we had these tasks for ourselves 1. Design and implement a pipeline 2. Measure and identify bottlenecks 3. Fix bottlenecks Let me share what we did on each of these areas and then I will share with you the outcome.
I call it 10 commandments -- in Hexadecimal. We as an enterprise have come together on these criteria to measure our DevOpsSec success. Every product team are being tracked on these – all the way to the CIO – he sees the progress at a lines of business level.
We spent a lot of time on discussing what to measure, how to measure and how to interpret what we measured. We attacked it from different angles.
Last year during about this time Jez Humble showed me the survey that he and Nicole Forsgren were working on. It looked very promising and we agreed to participate. We did some proof of concepts, went back and forth on many things; I think we had some good influence on two aspects of the survey: Security and Test Data Management. We went to the executive leaders and got a green signal from our CIO and then ran the survey in many teams across the Enterprise. They produced some interesting and encouraging numbers. First, and most importantly, that we are moving to the right direction and we need to keep doing what we had been doing so far… and in fact some of our new large product initiatives are at per with top industry performers category. The survey also pointed out few areas where we need to double down.
We also used our own Hygieia dashboard and the newest features in Hygieia that we developed to improve “speed”. We spent a lot of time brainstorming this topic. Speed of what? What is flowing through the pipeline? Business Value? Feature? Intent? Code? We could not come up with a full proof method to track the speed of delivery of business values and features.
What we have is a way to track each code commit through delivery lifecycle stages – from Commit stage to Production Deployed stage. The beauty of this is that we can now see the “wait time” between two stages. Why is this important? 1. In our opinion, you can speed up by reducing these wait times. Why are commits waiting for X number of hours before being deployed to Dev environment? May be lack of automation? May be the infrastructure is unstable? 2. The team can decide which “wait time” needs to be reduced to speed up the pipeline. You do not want the team to spend time in reducing a 10 min build cycle to 5 mins where the test cases take few hours to finish. 3. The teams decide what to do and they will do it. Believe me. You just need to make it transparent. In some cases, you need a bigger effort when it comes to processes.
Both the measurements showed us few things that we needed to address at an enterprise level. First is process and technical bottleneck to go to production. By decisively selecting public cloud, we had our arms around the technical bottleneck. Technically now we can deploy to production by clicking a button. But, who knew clicking a button is so difficult?
The core of this bottleneck is what all the big enterprises face …. CAB! Change Approval Board. Before going to CAB, you need pre-approvals before getting approvals, and then change management, and review of change management. I am sure it is much more complicated that it sounds. This year we worked very closely with our Audit, Compliance and Risk offices to take a deep dive into our processes and how we can do a better job. We have developed a hypothesis and we are testing it out. Let me share, at a high level, what that hypothesis is… before that, let me state that we started from a set of common believes..
We proved via empirical data that “Trunk based” development is better for Continuous Delivery and this is what we want for our teams to follow. But it is hard to enforce this to hundreds of product teams. So, what we came up with is this simple formula If team goal is to deploy 3 times a day; CI takes 30 minutes; CD takes 3 hours and Prod deployment takes 1 hour. The team must merge code to release branch within about 3 hours of the original commit.
The core of this bottleneck is what all the big enterprises face …. CAB! Change Approval Board. Before going to CAB, you need pre-approvals before getting approvals, and then change management, and review of change management. I am sure it is much more complicated that it sounds. This year we worked very closely with our Audit, Compliance and Risk offices to take a deep dive into our processes and how we can do a better job. We have developed a hypothesis and we are testing it out. Let me share, at a high level, what that hypothesis is… before that, let me state that we started from a set of common believes..
These Risks are real. You can not deny that. But… there is a better way to mitigate these risks.
Remarkable Results. Production Release - Only Code release - from 1 / sprint -> 10
Preparing for testing these hypothesis was no way easy. We ended up developing our own release automation tool which has an onboarding process that ensures that a team practices required practices, creates a change order automatically and approves it. We also forked out LGTM Github review tool – enhanced the tool with many configurable rules that helped us in this work. We wanted to give it back to LGTM but they proposed that we host it as a new product.
In the very near future, we are going to open source the modified LGTM as a new tool. We are also going to open source those 30 practices as a model. And unless you have not noticed in DevOps news, there is another DevOpsSec tool that we open sourced this year – it’s called Cloud Custodian.
I will end by sharing a picture of my favorite T-Shirt. “All of Chuck Norris’s Change Controls are FullCycle… and they are always APPROVED”.

DOES SFO 2016 - Topo Pal - DevOps at Capital One

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (6)

Similar to DOES SFO 2016 - Topo Pal - DevOps at Capital One

Similar to DOES SFO 2016 - Topo Pal - DevOps at Capital One (20)

More from Gene Kim

More from Gene Kim (20)

Recently uploaded

Recently uploaded (20)

DOES SFO 2016 - Topo Pal - DevOps at Capital One

Editor's Notes