Posted: 25 May 2023 10 min. read

A new framework for reducing DevOps complexity

A primer on the Deloitte DevOps Cloud Platform

A blog post by Varun Shah, Lead, Consulting Deloitte Canada;   Jay McDonald, Managing Director, Deloitte Consulting LLP; Eddy Krumholz, Technology Fellow, Deloitte Consulting LLP

 

The Deloitte DevOps Cloud Platform (DCP) is a unified, multi-cloud platform for software delivery, automation, operations, and management that combines the best of DevOps tools and streamlines configurations, allowing teams to focus on core software design and development while leaving the rest up to DCP.

Our experience implementing DCP has shown us that, in addition to optimizing engineering bandwidth and increasing time spent on high-value activities by as much as 65%, DCP can help companies reduce spend by as much as 30% and can help them automate as much as 60% faster than with traditional DevSecOps. With DCP, teams can potentially shorten the tool enablement and adoption process from three to six months to three to five days and track feature delivery progress and KPIs in real time, directly addressing performance visibility challenges organizations often face.

DCP was created from years of research and focus to reduce the DevOps complexity of too many tools, too many custom configurations, and not enough visibility into performance across software and infrastructure life cycles end to end. The Deloitte DevOps Cloud Platform features continuous security, compliance, logging, and monitoring—all behind a single pane of glass. With DCP, managing infrastructure and application deployments to help companies rapidly spin up an industry-grade pipeline and onboard applications for continuous delivery is easier.

We explore DCP’s application and unique value in five key areas:

  1. Risk and security
  2. Compliance
  3. Observability
  4. Site reliability engineering (SRE)
  5. Governance
[Click on this image to expand]

Risk and security

Problem statement

As cybersecurity threats continue to increase, organizations are facing immense enterprise security challenges. DevOps requires the collaboration of two different teams: development and operations. As these teams traditionally work in silos, unifying their processes can be challenging. And without collaboration and information sharing, organizations can inadvertently create security vulnerabilities that can provide a path for attackers to compromise sensitive data and disrupt critical business operations.

DCP value

Given the current landscape of cybersecurity across organizations today, DevSecOps—which is the next phase of DevOps that integrates security practices into every phase of DevOps—has been identified as a key way to securely move forward with DevOps. With DCP, DevSecOps can be facilitated through automated and configurable controls that allow for adherence to organization security standards and policies at every step in the release process for every application, and business unit, rolled up all the way to the organization level. Additionally, DCP can aggregate several critical security tests, integrating visibility with risk and security elements for every product and feature across the organization.

Compliance

Problem statement

Over the years, organizations have invested heavily in continuous integration/continuous delivery (CI/CD) pipelines to automate their build and deployment processes. However, audit and security requirements tend to complicate these automated processes, eventually resulting in heavy manual intervention to meet critical regulatory and compliance standards. 

DCP value

Deloitte built DCP to address this specific industry challenge, and through thoughtful feedback and solutioning, the compliance process has been automated to improve the software delivery process. The Deloitte DevOps Cloud Platform helps configure compliance rules in the form of workflows allowing for compliance-specific jobs to execute that track the results so every production deployment ensures the application is compliant with the defined standards. 

Observability

Problem statement

To meet ever-growing business demands, IT service providers and enterprises need to aggressively manage business service performance, improve stability, and predict and prevent performance degradation and outages. This requires closely observing metrics and datasets and monitoring service performance vulnerabilities, particularly during upgrades and code launches. System observability plays a critical role for enhanced end-to-end visibility.

Historically, the challenge in implementing observability in IT has been the volume, variety, and velocity of data, combined with having the computational power and domain knowledge needed to analyze and make sense of it in real time. 

DCP value

Deloitte’s observability—powered by Dynatrace—helps companies design, implement, and operate observability through orchestration and automation into their product development, operations, and business processes. DCP provides:

  • Observability delivery to enable tracing, logs, and metrics across the entire environment.
  • Advanced practices such as monitoring-as-code (monoco) to shift observability left.
  • Enhanced site reliability engineering capabilities via automated root-cause analysis and self-healing automation.

The DCP’s observability integration and orchestration, coupled with strategic delivery and AIOps, provides holistic end-to-end business services insights and helps drive autonomous remediation. 

Site reliability engineering

Problem statement

The main goal of site reliability engineering (SRE) is to mitigate operational challenges like production failures, infrastructure issues, and security monitoring. The focus of the SRE team is to maintain the reliability of applications and servers and automate tasks to reduce manual work. Standard expectations for an enterprise SRE require these capabilities:

  • Monitoring and alerting: Selecting the right tools for monitoring and configuring the correct metrics to monitor the infrastructure and applications.
  • Reliability: Maintaining the reliability of infrastructure and applications is another challenge that the team needs to overcome to meet service level agreement (SLA) expectations. These expectations are largely observed through service level objectives (SLO), service level indicators (SLI), and error budgets.
  • Incident management: Detecting incidents and performing root-cause analysis (RCA). Maintaining records of incidents and defining policies and procedures for managing incidents inside the organization, enabling quick resolution without violating SLAs.
  • Automating manual tasks: Eliminating time-consuming and repetitive operations. Site reliability engineering uses time more efficiently by automating repetitive manual tasks. It also reduces toil (waste) in the process and automates repetitive operations and releases.
  • Debugging and troubleshooting skills: Leveraging site reliability engineers as “detectives” in order to figure out why things don’t work as expected. When working with distributed computing systems, observability functionality to detect, measure, trace, and inspect is a critical capability to troubleshoot in order to resolve and optimize.

DCP value

The DCP has easy-to-read dashboarding and reporting capabilities across distributed computing and data centers that can be easily customized. The platform allows companies to implement critical processes of the software delivery life cycle such as secrets management, sophisticated stage gated workflows for proper CI/CD, enabling DevSecOps, automation pipelines, and repeatable “blueprints” for tool configurations, jobs, and rules.

These workflows include user tasks and approved tasks in the pipelines, which function as automated compliance gatekeepers. The DCP’s configurable stage gates enable guardrails to set specific pass-or-fail conditions on every task. Through these powerful workflows, companies can also rapidly initiate assessments and view the reports in near-real time. With DCP, organizations can increase time to value, reduce major security risks, and help tackle application and infrastructure issues.

Governance 

Problem statement

DevOps teams are often burdened by change management in large organizations due to the quantity of environments and rapidly evolving application architecture, creating friction between DevOps and developers. Gaining access to infrastructure and leveraging infrastructure-as-code platforms can be insufficient in offering the necessary collaboration, governance, and cost controls to run a modern software organization. An infrastructure control plane that supports these needs throughout the life cycle of the application environment, from development to production, is crucial.

DCP value

The Deloitte DevOps Cloud Platform is a DevSecOps maturity enablement platform that can help automatically enforce governance policies by translating business requirements into technical rules. It has automated and accelerated traditional governance processes, enabling organizations to improve and mature their DevSecOps processes, understand where risks and improvements can be made without wasted time and energy, and unlock enormous productivity and business value.

A final word

With cloud adoption growing, and companies seeking higher value from their cloud investments, it’s critical to have a solution to helps organizations combine their cloud environment into a single platform for software delivery, automation, operations, and management. The DCP combines leading DevOps practices into a simplified, aggregated view across a complex ecosystem of tools and teams, enabling product teams to focus on their core mission to build and release software features that produce better business outcomes and value-add to cloud investments.

Interested in exploring more on cloud?

Get in touch

Jason McDonald

Jason McDonald

Managing Director | Deloitte Consulting LLP

Jay is a Managing Director in Deloitte Consulting LLP, spearheading the Deloitte Engineering portfolio as the leader of the Developer Experience practice in Core Business Operations. He has 26 years of IT engineering transformations, scaling teams, mentoring and coaching to help attain elite performance. Since joining Deloitte in 2018, Jay has been instrumental in establishing Deloitte's engineering capabilities, leading offerings in DevOps, SRE, Agile, Observability and Artificial Intelligence (AI). Prior to Deloitte, Jay was President of a venture capital-backed Cloud DevOps Engineering consultancy and a senior leader at AWS. Under his stewardship, Deloitte Engineering has flourished, with Jay serving as both the visionary, architect and ambassador of Deloitte Engineering Advantage storefront. His strategic management has led to sustained industry leading growth. Jay also leads strategic alliances with DevOps, SRE and Observability software platforms.