12 Places to Intervene – Rethink FinOps Using a Systems Thinking … – InfoQ.com

Key Takeaways

Systems Thinking provides ways to extract simple but useful models of complex things, but I think only a few people think that way. Could be nature or nurture, but we dont try to teach very often.

- Adrian Cockcrofts tweet on 11 Sep 2018.

A recent study on FinOps by McKinsey reveals that 69% of the organizations prioritize tactical initiatives over higher-impact strategic initiatives.

One reason could be that most of those 69% organizations are unaware of those strategic initiatives or unaware of their importance.

So, in continuation to my article on Medium (that explores cloud cost management problems from a socio-technical perspective), here I will analyze via a System Thinking lens how to prioritize the efforts to address those problems.

Any organization, or a part of it, can be considered a socio-technical system because any organization employs people with a certain skill set, who work to achieve set goals, follow laid down processes, use particular technology, operate on a foundational infrastructure, and share certain cultural norms.Thus, Cloud Financial Management is a socio-technical system that needs to be analyzed from a social and technical angle via Systems Thinking. But before diving in, let me introduce some terminology of Systems Thinking.

Cloud Financial Management via Systems Thinking lens

When we analyze a problem, we often try to look for certain points or places where we can focus our efforts to gain maximum leverage on the system to achieve our desired outcome. For example, if you are in a situation to lift a motorbike thats fallen down, you dont pull up from every part that you can hold on to. Instead, you find the best part and a way to hold (based on the bike design and your strengths) to pull up with the least effort and damage.

Similarly, for complex socio-technical systems, Donnella Meadows, in her book Thinking in Systems, proposes 12 places where you can "intervene" to achieve maximum impact. These are known as leverage points or points of intervention.

A system intervention is a deliberate effort to change or improve a systems behavior, processes, or outcomes. It involves identifying problems and implementing changes to improve the overall functioning of the system.

I will explore each of the 12 interventions and elaborate on how each intervention correlates with cloud financial management along with corresponding measures to optimize the spend.

The System Iceberg - seen and unseen interventions

Donella Meadows introduces the 12 leverage points in the increasing order of leverage (from least to highest). We can classify these 12 points into four, namely, Parameters, Feedbacks, Social Design, and Mental Models, based on their area of impact.

While Parameters classification is concerned with relatively tangible parts of the system, Feedbacks are about the systems internal dynamics. And Social Design relates to the social structure within the system. The last one, Mental Models, refers to the values, goals, and mindset of the people involved.

12 Leverage points (from least to highest)

Most organizations are operated on the assumption that setting targets is sufficient. But the reality is that setting targets is the least effective place to intervene. And in some situations, setting targets will become counter-intuitive. For example, number of resources, duration of usage, and price of resources are a few of the parameters that can determine the cost of the cloud system. Setting targets on those parameters is a leverage point.

Optimization measures in this category are enforcing

Parameters usually are not an efficient leverage point that can bring change in the systems behavior unless they contribute to pushing some other higher leverage points like goals. Well cover more of that in the goals section.

Buffers are all about increasing the capacity of the system to handle changes in the in/out flows.

For example, improving the skill level and providing dedicated time or additional staff can increase the buffer capacity of the team.

Optimization measures that can increase the capacity of the buffers are

Creating most of the buffers takes time and isnt easy to change, so this leverage point is kept down on the list.

The structure of the stock-and-flow in a system plays a vital role in how the system operates. This leverage point refers to building or modifying the structure of a system (like infrastructure, products, or processes) to lower the effort of acting on the problem.For example, the structure of teams that manage the cloud, the structure of the cloud, and the structure of accountability among the teams come under this category.

Measures that fall under this leverage point are

The structure of stock-and-flows is rarely quick or simple to change, so this leverage point ends up at the bottom of the list.

Feedback loops - the basic operating units of a system - Donella Meadows

Feedback Loops

Delays refer to the length of time it takes to correct the system in relation to the rate at which the system changes. It plays a vital role in the behavior of the system. Delays will create oscillations between the desired target that you want to achieve and the actual state.

For example, if the cost overrun information is available to your team only at the end of the month, then optimization actions will happen only in a monthly cycle. Similarly, response to the feedback information should also be timely without any delay.

Measures that will reduce delays in the system are

If the systems delays are changeable, changing them can cause high impacts.

Balancing feedback loops are the ones that keep the system state within safe boundaries based on the difference between the actual and desired level of stock. Strengthening the balancing feedback loops is primarily improving the systems self-correcting abilities.

Any balancing feedback loop consists of a goal, an observer to check on deviations from the goal, and a response act.

Here are some measures that can strengthen the systems self-correcting abilities:

A FinOps tool can enable most of the automation mentioned above.

Reinforcing feedback loops are the ones that keep the system growing or collapsing. The more it works, the more it grows to work even more. There are two kinds of reinforcing loops - Vicious cycles, and Virtuous cycles.

In Cloud, a Vicious cycle occurs during a DDoS (Distributed Denial-of-Service) attack, where the attack traffic triggers auto-provisioning of resources which invites more attack traffic that in turn restarts the cycle as more resources are provisioned. This exponential resource usage ends up in a huge bill. Even a performance testing script mistake can cause such a situation.

Steps that can limit the growth of these reinforcing loops are

Implementing effective cloud financial management can trigger a Virtuous cycle where cost savings leads to increased investment in cloud services, which increases its overall value, leading to more motivation to optimize spending and realize even more cost savings.

Any system that has uninterrupted growth will self-destruct. A better way to stabilize the system is to weaken the reinforcing loops.

The flow of feedback information to the right set of people can result in very different outcomes, which are different from parameter adjustment or strengthening/weakening an existing feedback loop. This is about cascading feedback information to those who can act on it immediately and appropriately.

Here are some of the measures you can take to improve the structure of the information flow:

Since this intervention creates accountability by providing the missing feedback information, Donella observes that this intervention is always popular with the masses rather than with the powerful.

Rules form a high leverage point. Standards, guidelines, and policies can be referred to as rules here. For example, the Architecture Board must sign off on the design and architecture beforehand for the coding to start.

A few of the measures that you can employ to make use of this leverage point are

Self-organization is a powerful intervention that enables the system to evolve. It amounts to changing any aspect of the system that has less leverage than this, such as rules, physical structure, information flow, etc.

A few of the measures you can do are

Framing those rules that help develop and maintain self-organization within the system is a powerful intervention. And the power to frame those rules should lie with the team.

Goals are much higher interventions than the stock-flows, feedback loops, or even self-organization because a wrong goal will lead to very different outcomes even when other aforesaid interventions are in place.

So, aligning the system with higher goals will lead to better outcomes.

One of the measures that can improve the purpose of the cost management system is to pursue improving the value of cloud consumption rather than controlling cloud spend.

Based on this higher goal, a change that can be introduced in the hosting strategy is choosing the hosting model based on the role played by the workload in the organizations value chain. For example, a core business process application that is customer facing and is a differentiator shall be hosted on IaaS (even if it incurs more cost). But an application that belongs to a supportive business process in the value chain shall be hosted on PaaS/SaaS (incurring less cost).

Unit economics is a way to measure the value of cloud spend. For example, in a logistics organization, the cloud cost for processing one parcel can show the real cost of the parcel processing even if the number of parcels increases or decreases. Unit cost metrics like per-parcel cost can be leveraged as a north star for optimization initiatives towards improving the value of cloud consumption.

Everything, from goals and rules to delays and parameters of a system, emerges out of a mindset, aka paradigm about that system. Thus, this leverage point, a shift in mindset, can initiate a change in the whole system that lies underneath.

Paradigm change is often viewed as hard to accomplish, but all it takes for mindset change is a moment of realization.

Cloud spend management needs a shift in mindset (that was there while managing on-premises infrastructure).

Here are the measures you can take to bring in the shift in mindset

And some of the misconceptions (mindset) that should be dealt with are

Paradigms arent constant. So, you need to look beyond your current paradigm. In this case, Cloud is your current paradigm, but its not final.

You need to think beyond cloud spend optimization. You need to think of it as infrastructure spend optimization. If you start on that note, you will explore other available options, like what Dropbox did back in 2016 (its present state) - Dropbox shifted its majority of workloads from the public cloud to co-location facilities saving nearly $75M over two years which contributed to increased gross margins from 33% to 67%. For DropBox, the public cloud was cheaper early on but costlier later on when the company grew.

One point to note is that as the effectiveness of the leverage points increases, so does the resistance to change, meaning, an initiative at a higher leverage point will face higher resistance from the system.

In this article, I have talked about where you can intervene to change the course of your Cloud Financial Management system and the effectiveness of each intervention point. But how to intervene is up to you because each system is unique and constantly evolving. All the best!

Excerpt from:
12 Places to Intervene - Rethink FinOps Using a Systems Thinking ... - InfoQ.com

Related Posts

Comments are closed.