Skip to content

The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different teams within an organisation.


Notifications You must be signed in to change notification settings


Repository files navigation

Multi-team Software Delivery Assessment

The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different teams within an organisation. Devised by Matthew Skelton of Conflux, it is used as a key part of the Software Delivery Assessment at Conflux, but can be used freely by anyone (subject to the CC BY-SA license below).

The assessment uses and builds on the well-known and proven Spotify Squad Health Check model.

Translations: Japanese (ja ūüáĮūüáĶ)

The assessment covers ten dimensions in total:

  1. Team Health
  2. Deployment
  3. Flow
  4. Continuous Delivery
  5. Operability
  6. Testing and Testability
  7. Reliability and SRE
  8. On-call
  9. Security and Securability
  10. Team Topologies team interactions

These ten dimensions cover key aspects of modern software delivery in a form that enables teams to self-assess their strengths and practices.

ūüöÄ Overview: see slides 32-38 in Continuous Delivery at scale

ūüÉŹ Card deck: make the assessment fun and interactive by using the Software Delivery Assessment printed card deck from Agile Stationery. Developed in collaboration with Conflux, the card deck has Tired and Inspired indicators for each of the assessment criteria, together with emoji cards for quick-fire voting from team members. The card deck works for remote assessments too!

Assessment cards from Agile Stationery Five emoji voting cards

Copyright © 2018-2021 Conflux Digital Ltd

Licenced under CC BY-SA 4.0 CC BY-SA 4.0


Purpose of the assessments

The aim of the assessments is to promote and sustain a positive working environment for building and running software systems where:

  • Changes to software are built, tested, and deployed to Production rapidly and safely using Continuous Delivery practices
  • Processes and practices are optimised for the flow of change towards Production
  • Software is designed and built to enable independent, decoupled deployments for separate families of systems
  • Software is designed and built in a way that addresses operability, testability, releasability, security, and reliability
  • Problems in Production are always detected by teams before customers and users notice
  • Responsibility and accountability for software changes lead to empowerment and ownership
  • Working with software is rewarding and interesting
  • Being on-call and supporting the software is sustainable and valuable
  • People feel confident to challenge poor practices and approaches
  • Teams have a clear mission and well-defined interaction patterns with other teams

Fundamentally, the assessments should help to unblock and enable teams so they can succeed. The assessments should help teams to improve how they build, test, and deploy software systems through identifying different kinds of improvements:

  1. Team-focused improvements
  2. Product-focused and Service-focused improvements
  3. Organisation-wide improvements

The assessments should NOT be used to penalize teams, but to provide a shared drive towards improving practices and quality.

Teams included in the assessments

Every team writing code, scripts, and/or configuration for application software or infrastructure will benefit from being included in the assessments:

  • Teams building user- and customer-facing websites and services
  • Teams building internal services
  • Teams building infrastructure to support other systems (including Platform teams)
  • Teams building build & deployment tooling and scripts
  • Teams configuring and testing COTS products as part of the software & infrastructure estate
  • Any other teams with a primary focus on building, configuring, and testing software and infrastructure

By "team", we mean 6-10 person group that works together closely, usually called a Squad, Scrum team, Product team, or Stream-aligned team. 

Assessment criteria

The criteria for each dimension are taken from existing published books and online sources:

How to run the assessments

The assessment session itself should feel somewhat like a team retrospective session. The main difference from a normal retrospective session is that in the team assessment session the facilitator guides the discussion more firmly. There are many questions to discuss and it's important that the team discusses all of the criteria in the time available.

At the end of the assessment session, the team should feel encouraged and empowered to decide on what actions they want to take to improve their processes and practices based on the discussions.


Many organisations find that running team assessments every 3 months provides a good result.


  1. Find someone to Facilitate the assessment. This should be someone from outside the team, who is familiar with running team retrospectives. 
  2. Book a time slot for 2 or 3 hours: for online sessions you could split this over 2 or more video call sessions, and for in-person sessions book a room large enough for the team.
  3. For online sessions, either use the card deck from Agile Stationery and screenshot or note down the results in a spreadsheet, or use an online poll tool to capture responses from people in the session. For in-person sessions, either use the card deck from Agile Stationery, or print the assessment sheets for each set of criteria, either using the ready-made A1 PDF (see Releases), or the individual assessment pages at A1 size if possible (use small margins):
  4. For online sessions, show the Tired and Inspired criteria on the screen alongside the question. For in-person sessions, either print the details pages as a guide or have the pages open on-screen to understand the context and details of each of the assessment criteria:
    1. Team Health
    2. Deployment
    3. Flow
    4. Continuous Delivery
    5. Operability
    6. Testing and Testability
    7. Reliability and SRE
    8. On-call
    9. Security and Securability
    10. Team Topologies team interactions
  5. It is valuable to capture details and nuances of the discussions around each question. For online sessions, have someone take notes in a document or shared whiteboard. For in-person sessions, bring lots of marker pens or whiteboard markers: red, blue, and green are best.
  6. Include someone who is familiar with facilitating retrospectives (possibly a scrum master) in the session. They will be shadowing the facilitator during the session so the person from your team can facilitate other assessment sessions later.

Make sure that the Facilitator understands the purpose of the session and is familiar with the assessment pages and questions.


The facilitator should familiarise themselves with the Spotify Squad Health Check approach before running the session. See How I Used the Spotify Squad Health Check Model for a good experience report, Squad Health Checks from SkyBet, and download instructions from Spotify (PDF).

During the assessment:

  • Keep the team on schedule by asking for some discussions to be held outside the session
  • Write down the team scores and notes on the printed assessment sheets or ensure that scores and notes are captured in a digital tool.
  • Take photographs of the completed assessment sheets for in-person sessions.
  • Get feedback from the team on the VALUE and EXECUTION of the engineering assessment - smiley faces are sufficient!


Each team assessment runs for around 3-4 hours, and the facilitator will run the team through 10 sets of questions:

  1. Team health check - 35 mins
  2. Deployment health check - 10 mins
  3. Flow check - 10 mins
  4. Continuous Delivery check - 20 mins
  5. Operability check - 20 mins
  6. Test coverage check - 20 mins
  7. Reliability and SRE - 30 mins
  8. On-call - 15 mins
  9. Security check - 30 mins
  10. Team Topologies check - 20 mins

These timings leave space for two 10 minute breaks during the assessment.

Running the Assessment session

Each section has several questions. Each question should be answered as follows:

  • The team (either as individuals or as a team) rate each of the criteria using¬†SAD (1 OR 2)¬†/¬†MEH (3)¬†/¬†YAY (4 OR 5)¬†based on the¬†Tired¬†and¬†Inspired¬†guidelines

    • Tired¬†aligns to a low rating (1), and¬†Inspired¬†aligns to a high rating (5)

    • If you used individual ratings, tally the ratings and/or decide on a single team score from 1 to 5. You may find it useful to use different coloured pens on the printed sheet to indicate visually the different ratings.

  • The¬†Trend¬†since the previous time is identified (going up, staying roughly the same, going down), if applicable

  • An¬†Action¬†agreed to improve the score for that question over the coming months.

  • Use the¬†Notes¬†column to indicate further information that you think is valuable for the coordinating team to know about.

  • Make sure to complete to the¬†Date/Name/Facilitator¬†details

  • For in-person sessions, take a photo of each completed sheet and send to¬†the person coordinating the assessments

  • ¬†Get team members to rate the assessment session itself in terms of: Value, Execution (sad, meh, happy faces)

Viral facilitation

Each team assessment session has present one person who is shadowing the facilitator so that they can themselves facilitate future sessions. Each new facilitator should facilitate at least 2 sessions with other teams. In this way, the number of facilitators expands rapidly, enabling a minimal burden on the initial facilitators.

Coordinating and interpreting the results

After teams have each run an assessment session and sent their results, the coordinating group should collate the results from different teams to identify any areas that need improving across the organisation. Ask questions such as:

  • Why does team ABC rate themselves as 1 for test coverage? What is hindering them?
  • What can we do as an organisation to help more teams with deployments?
  • Is there an aspect of the Platform that needs improving so teams can go faster?

Do not attempt to rank or compare teams directly. Instead, use the signals from the teams to understand the organisational dynamics better and then prioritise organisation-wide improvements.


The Multi-team Software Delivery Assessment is a simple, easy-to-execute approach to assessing software delivery across many different teams within an organisation.



Security policy