Pairing Leading and Lagging Indicators in Software Engineering Metrics<a href="https://hashnode.com/@manuelmorales" id=""><br/></a>

I have seen this concept used in Product in the past. And I don't think it's so well widespread in Engineering. Metrics are a delicate and dangerous matter in software development, and the more we understand the domain, the better.

A simplistic definition is that leading indicators will hint you about something that might happen in the future. While lagging can tell a story about what happened in the past. And they really clicked on my head when I started understanding what makes them like that:

‍

Imagine that our customers are finding bugs frequently. And your development team, tired of testing manually, suggest introducing automated tests.

In this example test coverage can be a good candidate to be our leading indicator. One that the team can iterate over on a weekly basis. And this is because:

We believe it can help us to predict the future: Increasing test coverage can lead to less bugs found by our customers.
The team has direct control over it.
There is a short feedback loop. They can see the coverage immediately each time they run the test on their local machines.

All of that sounds good. But notice what can happen if you pair it with a lagging indicator. One like escaped defects, the number of bugs found in production:

Several weeks down the line, you can look at this metric and see if your investment did deliver the impact you were hoping for.
It can help non-technical people understand the business value that your technical investment brought.
It can act as a north star for decision making. Is this test really worth being written or am I increasing coverage for the sake of it? Are we writing the right kind of tests? You'll have a much easier deciding on these things if you have a clear idea of the greater good you want to achieve.

This last point is particularly important. In my experience, lagging indicators take longer to impact but are closer to business value, are a good representation of the current status the health of the project. They give you an overview, a long term baseline that you can improve over time.

One of the reasons for that is that they tend to be the symptom of multiple factors. There are many things that can affect escaped defects beyond a poor automation suite. Focusing on improving the ultimate goal will help you picking the right battles.

But this trait of lagging indicators being affected by multiple factors is also a challenge they have. If it moved, how do I know it is because of that specific change we made and not something else? There is an attribution problem. And I'm afraid that I don't have a silver bullet to that. You can rely on industry research, which might not apply to your specific case. Or experiment within your organisation to validate it yourself, but you might not have the means for that.

And these are the reasons why I believe that pairing a leading indicator to iterate quickly with a lagging indicator to guide us and validate success is a good practice.

About me