Architecting for profit: A blueprint for modern cloud economics

In partnership with

If the role of a good cloud architect is to design and build cost-effective software, is there a formula to achieve that goal every time?

A successful enterprise software application must effectively reflect the appropriate business context. But how do you ensure this is the case at the architecture phase?

I find it helpful to start by creating an oversimplified business model at the design stage. For example, using this model, a business owner invests capital to establish a factory that transforms raw materials into finished products sold at a reasonable price. Additionally, the business has ongoing operational costs, such as rent and utilities.

The role of technical architects is to consider the initial outlay, the per-request fees, and the operational costs to build the most profitable system possible.

Figure 1: The economics of running a software business. Funds allocated to establish the system represent the initial capital investment. Initial capital sets up the system. Overhead costs remain stable no matter the traffic. Costs and earnings per request show the business's profit margins.

Funds allocated to establish the system represent the initial capital investment. Initial capital sets up the system. Overhead costs remain stable no matter the traffic. Costs and earnings per request show the business's profit margins.

To represent this business model, I've formulated an equation that I'll reference in this article as the "profit equation."

profit - n X (r – c) – C – O

Cost Centers

If the primary responsibility of an architect is to increase profitability, the profit equation helps in identifying costs, which include:

Initial Capital (C)

Historically, tech companies spent a lot of money on data centers. Amazon Web Services (AWS) changed the game by allowing businesses to rent parts of its expansive cloud infrastructure and only pay for what they use.

Hourly Costs (Operational Costs) (O)

Once your code is written, it's time to deploy. In a physical factory, there are machines; in the digital realm, we have servers. An application may need to be constantly accessible, requiring consistent infrastructure and incurring steady costs. As user numbers grow, additional servers might be needed, increasing costs. These costs rise in chunks, similar to factories adding machines during high demand, and are known as "step costs."

Request Costs (Raw Materials) (c)

While operational costs have this step-like pattern, request costs are linear. Whenever a user sends a request, think of it as using a unit of raw material. You pay for components like bandwidth and data transfer. If your traffic skyrockets, these costs rise in direct proportion.

Managing Resources

In designing applications, architects must balance cost, efficiency, and scalability. Selecting inappropriate resources can disrupt this harmony. You can choose to outsource some tasks to your cloud provider – like scaling and administration based on chosen resources – while also maintaining control in areas you prioritize.

The AWS Shared Responsibility Model

AWS specifically operates a Shared Responsibility Model (SRM), where it clearly defines the areas of control it undertakes. This gives architects three clear options regarding how much they manually control and where they hand off to the cloud provider.

First, you can opt to provision servers for incoming traffic manually. These servers come with a fixed hourly rate regardless of the number of requests. When these servers are near their capacity, you must scale them up. Depending on the resource, you might have to do this manually, impacting efficiency.

Figure: With a steady load, instant-based resources are more efficient due to planned costs. But if the load changes, manual scaling may be required.

Alternatively, many cloud resources offer autoscaling, letting you adjust server capacity automatically based on factors like utilization or a set schedule. While initial setup is necessary, the cloud provider tweaks your server capacity according to your autoscaling guidelines.

Figure: If traffic growth is steady, a simple autoscaling policy can scale up the system automatically and downscale when there are periods of no load.

At the other extreme are serverless resources, where the cloud provider takes the reins, handling scaling, management, and security. You are charged a premium based on the number of requests, and while these resources excel in specific functions, they're not as flexible.

Figure: These systems are designed for spiky traffic.

The system scales up when traffic spikes. In the event of zero load, the system scales down to zero without charging you.

While resources with instances have an initial cost, serverless resources can get expensive with a higher volume of requests. Therefore, the cost-effectiveness of your setup often hinges on your request volume and your ability to predict it.

Figure: Pizza as a service vs. Software as a service

Architecture strategies

The key to selecting the best strategy is to identify where your business aims to stand out and where it's willing to make concessions.

Upfront investment

If you are making large investments upfront, success depends on accurately forecasting traffic trends. Architects and developers work together to predict future traffic needs. They then invest accordingly to support expected traffic, ensuring enough capacity.

This approach suggests a substantial investment in (C) to reduce our profit equation's per-request cost (c). When (n) is large, the cost of (C) gets distributed over the multiple requests, leveraging economies of scale.

Steady stalwarts

Another strategy is to be steady stalwarts, which seeks to balance the flexible pay-as-you-go model with a fixed upfront investment. This starts with an estimated demand and adjusts capacity as needed if changes are steady and foreseeable. Autoscaling is generally used to adjust capacity in response to traffic changes.

This approach adjusts (O) by balancing between (c) and (C). If there are unexpected traffic surges, you might face situations of having too much or too little capacity. Thus, this strategy best suits platforms with consistent traffic.

Flexible spending

For customers who prioritize flexibility and the ability to scale rapidly, the primary challenge is accurately predicting traffic or load. Such customers could benefit from using serverless resources to mitigate the risk of unreliable forecasting.

If you exclude initial costs (C) and overhead (O), your expenses align directly with the volume of requests you get. More requests (n) mean higher costs. Serverless options usually cost more than fully utilized instance-based resources. This method allows businesses to delay scaling until they experience maximum traffic.

Scalability

The magic wand that lets your application handle more user requests is scalability. Think of it like a highway. The more lanes (servers) you have, the more cars (requests) you can handle. But every lane (scale) comes with a price. While aiming for higher profits, managing the number of lanes is crucial.

To break it down, the profit equation is (n) times the difference between revenue (r) and cost (c). You'll only make a good profit if you earn more (r) than you spend (c) for each car on the highway (request).

However, as you add more lanes, the maintenance cost rises, shrinking the gap between revenue and cost. Thus, the benefit gained from each added lane reduces over time. Eventually, adding another lane might not be worth the cost. For architects, this poses a dilemma. Should they continue to add lanes even if it's not profitable?

And here's a twist: not every loss is about money. What if a car (request) gets turned away? It could tarnish the brand's image or affect other system parts. While counting pennies, remember to weigh the non-monetary costs, too.

In my experience, it's vital for architects to consider expanding the highway or closing temporarily for maintenance. Sometimes, the best growth strategy is to pause and recalibrate.

What should architects do?

Imagine building software as a high-stakes game of chess. You've got to make trade-offs or strategic choices tailored to your business's unique playbook.

Do you have a crystal ball that predicts growth? Go ahead and double down on beefy instance-based infrastructure. Not sure what the future holds? You might want the agility of a serverless setup. In my playbook, it's often wise to dip your toes in the serverless waters first. This gives you the freedom to experiment without going all-in.

If you end up picking the wrong path, hit pause and retrace your steps. When you get it right, the best systems work like well-oiled machines – each piece complements the other, and the whole thing just purrs.

Architecting for profit: A blueprint for modern cloud economics

Posted in:

Written by:

Share:

In partnership with

Figure 1: The economics of running a software business. Funds allocated to establish the system represent the initial capital investment. Initial capital sets up the system. Overhead costs remain stable no matter the traffic. Costs and earnings per request show the business's profit margins.

profit - n X (r – c) – C – O

Cost Centers

Initial Capital (C)

Hourly Costs (Operational Costs) (O)

Request Costs (Raw Materials) (c)

Managing Resources

The AWS Shared Responsibility Model

Figure: With a steady load, instant-based resources are more efficient due to planned costs. But if the load changes, manual scaling may be required.

Figure: If traffic growth is steady, a simple autoscaling policy can scale up the system automatically and downscale when there are periods of no load.

Figure: These systems are designed for spiky traffic.

Figure: Pizza as a service vs. Software as a service

Architecture strategies

Upfront investment

Steady stalwarts

Flexible spending

Scalability

What should architects do?

Related content

Why Zig is one of the hottest programming languages to learn

How to build an effective technical strategy

Why OpenFeature is central to modern feature management

Understanding feature flags

What is retrieval-augmented generation (RAG) and are you ready for it?

How to standardize codebases across teams

WebAssembly is still waiting for its moment

Minimum viable architecture is the backbone of a successful product

A buyer’s checklist for AI coding assistants

5 mistakes to avoid when picking an AI coding assistant

The best AI coding assistants 2024

How to argue with the AI coding assistant skeptics

PostgreSQL: The database that quietly ate the world

Partner Content: The Engineering Leader’s Guide to Goals and Reporting

AI models can’t understand code. Does that matter?

6 questions to ask when buying a software developer metrics tool

How to combat generative AI security risks

How Zalando uses its own Tech Radar to make better technology choices

4 things you need to know from the latest Thoughtworks Tech Radar

9 women in AI you need to know about

AI and Kubernetes are pushing cloud costs out of control

How to write better AI prompts

A buyer’s checklist for software developer analytics tools

5 mistakes to avoid when choosing a software developer analytics tool

How to plan for and mitigate different types of tech debt

The best software development analytics tools 2024

Who holds the edge in the JavaScript framework wars?

11 generative AI programming tools for developers

Researchers say generative AI isn't replacing devs any time soon

Mastering tough technical decisions

Unlocking productivity with developer platforms

12 things to consider when assessing open source software

Choose a contextualized AI coding assistant

What developers need to know about generative AI in 2024

Leading open-source teams in large organizations

Whatever happened to Big Data?

6 steps to addressing legacy enterprise code

Learning to live with legacy code

A journey to tackle legacy code in online travel

How test coverage can improve code quality

What you need to know about Biden’s AI executive order

How OpenAI fought off security threats and GPU shortages to scale ChatGPT

Balancing build vs buy decisions in a post-boom world

3 strategies for maximizing your cloud savings

Building a cloud architecture that can scale to any challenge

How are engineering orgs achieving reliability in 2023?

Tech debt for engineering leaders: How a shortcut today impacts tomorrow

What AI has to offer: Using LLM tools in interviews

Tech debt traps to avoid

The 6 biggest generative AI risks for developers

7 generative AI productivity hacks for developers

SRE for engineering managers

Can platform engineering help you do more with less?

When to migrate from a monolithic to a distributed frontend architecture

The essential tools for software engineering managers

Let's mitigate bias in tech